sea

Tatsushi Oka

Affiliation

Research Field

Academic Positions

Education

Ph.D. (Economics) Boston University
M.A. (Economics) Osaka University

Working Papers

Inflation Target at Risk: A Time-Varying Parameter Distributional Regression

with Yunyun Wang and Dan Zhu, Mar 2024

Working Paper Abstract

Macro variables frequently display time-varying distributions, driven by the dynamic and evolving characteristics of economic, social, and environmental factors that consistently reshape the fundamental patterns and relationships governing these variables. To better understand the distributional dynamics beyond the central tendency, this paper introduces a novel semi-parametric approach for constructing time-varying conditional distributions, relying on the recent advances in distributional regression. We present an efficient precision-based Markov Chain Monte Carlo algorithm that simultaneously estimates all model parameters while explicitly enforcing the monotonicity condition on the conditional distribution function. Our model is applied to construct the forecasting distribution of inflation for the U.S., conditional on a set of macroeconomic and financial indicators. The risks of future inflation deviating excessively high or low from the desired range are carefully evaluated. Moreover, we provide a thorough discussion about the interplay between inflation and unemployment rates during the Global Financial Crisis, COVID, and the third quarter of 2023.

Keywords: Time-varying parameter model, Distributional regression, Bayesian analysis, Inflation risks

Distributional Vector Autoregression: Eliciting Macro and Financial Dependence

with Yunyun Wang and Dan Zhu, Feb 2023

Working Paper Abstract

Vector autoregression is an essential tool in empirical macroeconomics and finance for understanding the dynamic interdependencies among multivariate time series. In this study, we expand the scope of vector autoregression by incorporating a multivariate distributional regression framework and introducing a distributional impulse response function, providing a comprehensive view of dynamic heterogeneity. We propose a straightforward yet flexible estimation method and establish its asymptotic properties under weak dependence assumptions. Our empirical analysis examines the conditional joint distribution of GDP growth and financial conditions in the United States, with a focus on the global financial crisis. Our results show that tight financial conditions lead to a multimodal conditional joint distribution of GDP growth and financial conditions, and easing financial conditions significantly impacts long-term GDP growth, while improving the GDP growth during the global financial crisis has limited effects on financial conditions.

Keywords: Vector autoregression, Impulse response function, Multivariate time series, Distributional regression

Peer-Reviewed Articles

Journal Publications

Regression Adjustment for Estimating Distributional Treatment Effects in Randomized Controlled Trials

with Shota Yasui, Yuta Hayakawa, Undral Byambadalai

Econometric Reviews, forthcoming

Working Paper Abstract

In this paper, we address the issue of estimating and inferring distributional treatment effects in randomized experiments. The distributional treatment effect provides a more comprehensive understanding of treatment heterogeneity compared to average treatment effects. We propose a regression adjustment method that utilizes distributional regression and pre-treatment information, establishing theoretical efficiency gains without imposing restrictive distributional assumptions. We develop a practical inferential framework and demonstrate its advantages through extensive simulations. Analyzing water conservation policies, our method reveals that behavioral nudges systematically shift consumption from high to moderate levels. Examining health insurance coverage, we show the treatment reduces the probability of zero doctor visits by 6.6 percentage points while increasing the likelihood of 3-6 visits. In both applications, our regression adjustment method substantially improves precision and identifies treatment effects that were statistically insignificant under conventional approaches.

Semiparametric Single-Index Estimation for Average Treatment Effects

with Difang Huang and Jiti Gao

Econometric Reviews, 2025 44(6), pp. 843-885

Econometric Reviews Working Paper Abstract

We propose a semiparametric method to estimate the average treatment effect under the assumption of unconfoundedness given observational data. Our estimation method alleviates misspecification issues of the propensity score function by estimating the single-index link function involved through Hermite polynomials. Our approach is computationally tractable and allows for moderately large dimension covariates. We provide the large sample properties of the estimator and show its validity. Also, the average treatment effect estimator achieves the parametric rate and asymptotic normality. Our extensive Monte Carlo study shows that the proposed estimator is valid in finite samples. Applying our method to maternal smoking and infant health, we find that conventional estimates of smoking's impact on birth weight may be biased due to propensity score misspecification, and our analysis of job training programs reveals earnings effects that are more precisely estimated than in prior work. These applications demonstrate how addressing model misspecification can substantively affect our understanding of key policy-relevant treatment effects.

Keywords: Average treatment effects, Causal inference, Hermite series expansion, Propensity score

Quantile Random-Coefficient Regression with Interactive Fixed Effects: Heterogeneous Group-level Policy Evaluation

with Ruofan Xu, Jiti Gao and Yoon-Jae Whang

Econometric Reviews, 2025, 44(5), pp. 630-648.

Econometric Reviews Working Paper Abstract

We propose a quantile random-coefficient regression with interactive fixed effects to study the effects of group-level policies that are heterogeneous across individuals. Our approach is the first to use a latent factor structure to handle the unobservable heterogeneities in the random coefficient. The asymptotic properties and an inferential method for the policy estimators are established. The model is applied to evaluate the effect of the minimum wage policy on earnings between 1967 and 1980 in the United States. Our results suggest that the minimum wage policy has significant and persistent positive effects on black workers and female workers up to the median. Our results also indicate that the policy helps reduce income disparity up to the median between two groups: black, female workers versus white, male workers. However, the policy is shown to have little effect on narrowing the income gap between low- and high-income workers within the subpopulations.

Keywords: Heterogenous policy effects, Hierarchical regression, Random coefficient model

Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding

with Ukyo Honda, Peinan Zhang, Masato Mita

Transactions of the Association for Computational Linguistics, 2024, 12, pp. 1268–1289

TACL Working Paper Abstract

Recent models for natural language understanding are inclined to exploit simple patterns in datasets, commonly known as shortcuts. These shortcuts hinge on spurious correlations between labels and latent features existing in the training data. At inference time, shortcut-dependent models are likely to generate erroneous predictions under distribution shifts, particularly when some latent features are no longer correlated with the labels. To avoid this, previous studies have trained models to eliminate the reliance on shortcuts. In this study, we explore a different direction: pessimistically aggregating the predictions of a mixture-of-experts, assuming each expert captures relatively different latent features. The experimental results demonstrate that our post-hoc control over the experts significantly enhances the model's robustness to the distribution shift in shortcuts. Additionally, we show that our approach has some practical advantages. We also analyze our model and provide results to support the assumption.

Keywords: Natural language understanding, Mixture ofExperts, Shortcuts, Spurious correlations

Bivariate Distribution Regression with Application to Insurance Data

with Yunyun Wang and Dan Zhu

Insurance: Mathematics and Economics, 2023, 113, pp. 215-232

IME Working Paper Abstract

Understanding variable dependence, particularly eliciting their statistical properties given a set of covariates, provides the mathematical foundation in practical operations management such as risk analysis and decision-making given observed circumstances. This article presents an estimation method for modeling the conditional joint distribution of bivariate outcomes based on the distribution regression and factorization methods. This method is considered semiparametric in that it allows for flexible modeling of both the marginal and joint distributions conditional on covariates without imposing global parametric assumptions across the entire distribution. In contrast to existing parametric approaches, our method can accommodate discrete, continuous, or mixed variables, and provides a simple yet effective way to capture distributional dependence structures between bivariate outcomes and covariates. Various simulation results confirm that our method can perform similarly or better in finite samples compared to the alternative methods. In an application to the study of a motor third-party liability insurance portfolio, the proposed method effectively estimates risk measures such as the conditional Value-at-Risk and Expected Shortfall. This result suggests that this semiparametric approach can serve as an alternative in insurance risk management.

Keywords: Finance, Multivariate statistics, Risk management, Distribution Regression

Heterogeneous Impact of the Minimum Wage: Implications for Changes in Between- and Within-Group Inequality

with Ken Yamada

Journal of Human Resources, 2023, 58(1), pp. 335-362.

Journal of Human Resources Working Paper Abstract

Most of the workers who earn at or below the minimum wage are either less educated, young, or female in the United States. We examine the extent to which the minimum wage influences the wage differential among workers with different observed characteristics and the wage differential among workers with the same observed characteristics. Our results suggest that changes in the real value of the minimum wage account in part for the patterns of changes in education, experience, and gender wage differentials and for most of the changes in within-group wage differentials for workers with lower levels of experience.

Keywords: Minimum wage, wage inequality, censoring, quantile regression

The Effect of Human Mobility Restrictions on the COVID-19 Transmission Network in China

with Wei Wei and Dan Zhu

PLoS One, July 2021

PLoS One Working Paper Abstract

COVID-19 poses a severe threat worldwide. This study analyzes its propagation and evaluates statistically the effect of mobility restriction policies on the spread of the disease. We apply a variation of the stochastic Susceptible-Infectious-Recovered model to describe the temporal-spatial evolution of the disease across 33 provincial regions in China, where the disease was first identified. We employ Bayesian Markov Chain Monte-Carlo methods to estimate the model and to characterize a dynamic transmission network, which enables us to evaluate the effectiveness of various local and national policies. The spread of the disease in China was predominantly driven by community transmission within regions, which dropped substantially after local governments imposed various lockdown policies. Further, Hubei was only the epicenter of the early epidemic stage. Secondary epicenters, such as Beijing and Guangdong, had already become established by late January 2020. The transmission from these epicenters substantially declined following the introduction of mobility restrictions across regions. Conclusions: The spatial transmission network is able to differentiate the effect of the local lockdown policies and the cross-region mobility restrictions. We conclude that both are important policy tools for curbing the disease transmission. The coordination between central and local governments is important in suppressing the spread of infectious diseases.

Keywords: COVID-19, Infection, Heterogeneity, Spatial Model, Bayesian Analysis

Inference Related to Common Breaks in a Multivariate System with Joined Segmented Trends with Applications to Global and Hemispheric Temperatures

with Francisco Estrada, Dukpa Kim, Pierre Perron

Journal of Econometrics, January 2020, 214(1), pp. 130-152.

Journal of Econometrics Working Paper

Indirect Inference with a Non-Smooth Criterion Function

with David Frazier and Dan Zhu

Journal of Econometrics, October 2019, 212(2), pp. 623-645.

Journal of Econometrics Working Paper Abstract

Indirect inference requires simulating realizations of endogenous variables from the model under study. When the endogenous variables are discontinuous functions of the model parameters, the resulting indirect inference criterion function is discontinuous and does not permit the use of derivative-based optimization routines. Using a change of variables technique, we propose a novel simulation algorithm that alleviates the discontinuities inherent in such indirect inference criterion functions, and permits the application of derivative-based optimization routines to estimate the unknown model parameters. Unlike competing approaches, this approach does not rely on kernel smoothing or bandwidth parameters. Several Monte Carlo examples that have featured in the literature on indirect inference with discontinuous outcomes illustrate the approach, and demonstrate the superior performance of this approach over existing alternatives.

Keywords: Simulation estimators, Indirect inference, Discontinuous objective functions, Dynamic discrete choice models

Quantile Treatment Effects in Difference in Differences Models under Dependence Restrictions and with only Two Time Periods

with Brantly Callaway and Tong Li

Journal of Econometrics, October 2018, 206(2), pp. 395-413.

Journal of Econometrics Working Paper R-package

Testing for Common Breaks in a Multiple Equations System

with Pierre Perron

Journal of Econometrics, May 2018, 204(1) pp. 66–85.

Journal of Econometrics Working Paper Abstract

The issue addressed in this paper is that of testing for common breaks across or within equations of a multivariate system. Our framework is very general and allows integrated regressors and trends as well as stationary regressors. The null hypothesis is that breaks in different parameters occur at common locations and are separated by some positive fraction of the sample size unless they occur across different equations. Under the alternative hypothesis, the break dates across parameters are not the same and also need not be separated by a positive fraction of the sample size whether within or across equations. The test considered is the quasi-likelihood ratio test assuming normal errors, though as usual the limit distribution of the test remains valid with non-normal errors. Of independent interest, we provide results about the rate of convergence of the estimates when searching over all possible partitions subject only to the requirement that each regime contains at least as many observations as some positive fraction of the sample size, allowing break dates not separated by a positive fraction of the sample size across equations. Simulations show that the test has good finite sample properties. We also provide an application to issues related to level shifts and persistence for various measures of inflation to illustrate its usefulness.

Keywords: Change-point, Segmented regressions, Break dates, Hypothesis testing, Multiple equations systems

The Cross-Quantilogram: Measuring Quantile Dependence and Testing Directional Predictability between Time Series

with Heejoon Han, Oliver Linton and Yoon-Jae Whang

Journal of Econometrics, July 2016, 193(1), pp. 251-270.

Journal of Econometrics Working Paper R-package Abstract

This paper proposes the cross-quantilogram to measure the quantile dependence between two time series. We apply it to test the hypothesis that one time series has no directional predictability to another time series. We establish the asymptotic distribution of the cross-quantilogram and the corresponding test statistic. The limiting distributions depend on nuisance parameters. To construct consistent confidence intervals we employ a stationary bootstrap procedure; we establish consistency of this bootstrap. Also, we consider a self-normalized approach, which yields an asymptotically pivotal statistic under the null hypothesis of no predictability. We provide simulation studies and two empirical applications. First, we use the cross-quantilogram to detect predictability from stock variance to excess stock return. Compared to existing tools used in the literature of stock return predictability, our method provides a more complete relationship between a predictor and stock return. Second, we investigate the systemic risk of individual financial institutions, such as JP Morgan Chase, Morgan Stanley and AIG.

Keywords: Quantile, Correlogram, Dependence, Predictability, Systemic risk

Set Identification of the Censored Quantile Regression Model for Short Panels with Fixed Effects

with Tong Li

Journal of Econometrics, October 2015, 188(2), pp. 363–377.

Journal of Econometrics Working Paper

Divorce Law Reforms and Divorce Rates in the U.S.: An Interactive Fixed Effects Approach

with Dukpa Kim

Journal of Applied Econometrics, March 2014, 29(2), pp. 231-245.

Journal of Applied Econometrics Working Paper

Estimating Structural Changes in Regression Quantiles

with Zhongjun Qu

Journal of Econometrics, June 2011, 162(2), pp. 248-267.

Journal of Econometrics Working Paper R-code

Juvenile Crime and Punishment: Evidence from Japan

Applied Economics, 2009, 41(24), pp. 3103-3115.

Applied Economics Working Paper Abstract

Over the last decade, juvenile crime has become a serious social problem in Japan. The Juvenile Law was revised in 2001 to impose harsher punishment on juvenile offenders. This revision makes it possible to impose criminal punishment on 14- and 15-year-old criminal offenders, while those offenders aged 16–19 have always faced criminal punishment, both before and after the revision. Using this revision as a natural experiment, this study conducts a difference-in-differences estimation to examine the effect of punishment on juvenile crime. The analysis provides evidence that punishment can deter juvenile crime. In addition, this research examines the criminal behaviour of 13-year-olds, who face no change in punishment, but who soon will in the near future. The results suggest that the revision also had a negative impact on the criminal behaviour of these younger offenders.

Keywords: Juvenile crime, Juvenile law, Difference-in-differences

Conference Proceedings

Exploring Explanations Improves the Robustness of In-Context Learning

with Ukyo Honda

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL), July 2025

ACL Working Paper Abstract

In-context learning (ICL) has emerged as a successful paradigm for leveraging large language models (LLMs). However, it often struggles to generalize beyond the distribution of the provided demonstrations. A recent advancement in enhancing robustness is ICL with explanations (X-ICL), which improves prediction reliability by guiding LLMs to understand and articulate the reasoning behind correct labels. Building on this approach, we introduce an advanced framework that extends X-ICL by systematically exploring explanations for all possible labels (X2-ICL), thereby enabling more comprehensive and robust decision-making. Experimental results on multiple natural language understanding datasets validate the effectiveness of X2-ICL, demonstrating significantly improved robustness to out-of-distribution data compared to the existing ICL approaches.

Keywords: In-context learning, Large Language Models, Out-of-Distribution Data, Natural Language Processing, Spurious Correlation, Latent Variables, Generative artificial intelligence

On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization

with Undral Byambadalai, Tomu Hirata, Shota Yasui

Proceeding of International Conference on Machine Learning (ICML), July 2025

ICML Working PaperR-code Abstract

This paper focuses on the estimation of distributional treatment effects in randomized experiments that use covariate-adaptive randomization (CAR). These include designs such as Efron's biased-coin design and stratified block randomization, where participants are first grouped into strata based on baseline covariates and assigned treatments within each stratum to ensure balance across groups. In practice, datasets often contain additional covariates beyond the strata indicators. We propose a flexible distribution regression framework that leverages off-the-shelf machine learning methods to incorporate these additional covariates, enhancing the precision of distributional treatment effect estimates. We establish the asymptotic distribution of the proposed estimator and introduce a valid inference procedure. Furthermore, we derive the semiparametric efficiency bound for distributional treatment effects under CAR and demonstrate that our regression adjusted estimator attains this bound. Simulation studies and empirical analyses of microcredit programs highlight the practical advantages of our method.

Keywords: Randomized Expriment, Covariate-Adaptive Randomization, Distributional Treatment Effect, Probility Treatment Effect, Semiparametric Efficiency Bound, Regression Adjustment, Covarite Adjustment

Estimating Distributional Treatment Effects in Randomized Experiments: Machine Learning for Variance Reduction

with Undral Byambadalai, Shota Yasui

Proceeding of International Conference on Machine Learning (ICML), July 2024

ICML Working Paper R-code Abstract

We propose a novel regression adjustment method designed for estimating distributional treatment effect parameters in randomized experiments. Randomized experiments have been extensively used to estimate treatment effects in various scientific fields. However, to gain deeper insights, it is essential to estimate distributional treatment effects rather than relying solely on average effects. Our approach incorporates pre-treatment covariates into a distributional regression framework, utilizing machine learning techniques to improve the precision of distributional treatment effect estimators. The proposed approach can be readily implemented with off-the-shelf machine learning methods and remains valid as long as the nuisance components are reasonably well estimated. Also, we establish the asymptotic properties of the proposed estimator and present a uniformly valid inference method. Through simulation results and real data analysis, we demonstrate the effectiveness of integrating machine learning techniques in reducing the variance of distributional treatment effect estimators in finite samples.

Keywords: Randomized experiments, Distributional Treatment Effects, Machine Learning

Safe Collaborative Filtering

with Riku Togashi, Naoto Ohsaka, Tetsuro Morimura

Proceedings of the International Conference on Learning Representations (ICLR), May 2024

ICLR Working Paper Abstract

Excellent tail performance is crucial for modern machine learning tasks, such as algorithmic fairness, class imbalance, and risk-sensitive decision making, as it ensures the effective handling of challenging samples within a dataset. Tail performance is also a vital determinant of success for personalized recommender systems to reduce the risk of losing users with low satisfaction. This study introduces a ``safe'' collaborative filtering method that prioritizes recommendation quality for less-satisfied users rather than focusing on the average performance. Our approach minimizes the conditional value at risk (CVaR), which represents the average risk over the tails of users' loss. To overcome computational challenges for web-scale recommender systems, we develop a robust yet practical algorithm that extends the most scalable method, implicit alternating least squares (iALS). Empirical evaluation on real-world datasets demonstrates the excellent tail performance of our approach while maintaining competitive computational efficiency.

Keywords: Recommendation, Conditional Value at Risk, Matrix Factorization, Covolution

Statistical Packages

quantilogram Cross-Quantilogram

R-Package

QR.break Structural Breaks in Quantile Regression

R-Package

External Profiles

arXiv | DBLP | Google Scholar | ORCID| Web of Science

Research Grants

Grant-in-Aid for Scientific Research (C) (24K04821), 2024-2027

Grant-in-Aid for Research Activity Start-up (23K18807), 2023-2024

Australian Research Council (ARC) Discovery Projects (with Pierre Perron and Zhongjun Qu), 2021-2024

Australian Research Council (ARC) Discovery Projects (with Tong Li), 2019-2022

Academic Research Fund (Tier 1), Ministry of Education, Singapore, 2016-2019

Academic Research Fund (Tier 1), Ministry of Education, Singapore, 2013-2016



© - Tatsushi Oka