Econometrics

2025-12-10 | | Total: 7

#1 Difference-in-Differences with Interval Data [PDF] [Copy] [Kimi] [REL]

Authors: Daisuke Kurisu, Yuta Okamoto, Taisuke Otsu

Difference-in-differences (DID) is one of the most popular tools used to evaluate causal effects of policy interventions. This paper extends the DID methodology to accommodate interval outcomes, which are often encountered in empirical studies using survey or administrative data. We point out that a naive application or extension of the conventional parallel trends assumption may yield uninformative or counterintuitive results, and present a suitable identification strategy, called parallel shifts, which exhibits desirable properties. Practical attractiveness of the proposed method is illustrated by revisiting an influential minimum wage study by Card and Krueger (1994).

Subject: Econometrics

Publish: 2025-12-09 16:08:43 UTC


#2 Minimax and Bayes Optimal Adaptive Experimental Design for Treatment Choice [PDF1] [Copy] [Kimi1] [REL]

Author: Masahiro Kato

We consider an adaptive experiment for treatment choice and design a minimax and Bayes optimal adaptive experiment with respect to regret. Given binary treatments, the experimenter's goal is to choose the treatment with the highest expected outcome through an adaptive experiment, in order to maximize welfare. We consider adaptive experiments that consist of two phases, the treatment allocation phase and the treatment choice phase. The experiment starts with the treatment allocation phase, where the experimenter allocates treatments to experimental subjects to gather observations. During this phase, the experimenter can adaptively update the allocation probabilities using the observations obtained in the experiment. After the allocation phase, the experimenter proceeds to the treatment choice phase, where one of the treatments is selected as the best. For this adaptive experimental procedure, we propose an adaptive experiment that splits the treatment allocation phase into two stages, where we first estimate the standard deviations and then allocate each treatment proportionally to its standard deviation. We show that this experiment, often referred to as Neyman allocation, is minimax and Bayes optimal in the sense that its regret upper bounds exactly match the lower bounds that we derive. To show this optimality, we derive minimax and Bayes lower bounds for the regret using change-of-measure arguments. Then, we evaluate the corresponding upper bounds using the central limit theorem and large deviation bounds.

Subjects: Econometrics , Machine Learning , Statistics Theory , Methodology , Machine Learning

Publish: 2025-12-09 11:58:27 UTC


#3 Automatic Debiased Machine Learning of Structural Parameters with General Conditional Moments [PDF] [Copy] [Kimi] [REL]

Author: Facundo Argañaraz

This paper proposes a method to automatically construct or estimate Neyman-orthogonal moments in general models defined by a finite number of conditional moment restrictions (CMRs), with possibly different conditioning variables and endogenous regressors. CMRs are allowed to depend on non-parametric components, which might be flexibly modeled using Machine Learning tools, and non-linearly on finite-dimensional parameters. The key step in this construction is the estimation of Orthogonal Instrumental Variables (OR-IVs) -- "residualized" functions of the conditioning variables, which are then combined to obtain a debiased moment. We argue that computing OR-IVs necessarily requires solving potentially complicated functional equations, which depend on unknown terms. However, by imposing an approximate sparsity condition, our method finds the solutions to those equations using a Lasso-type program and can then be implemented straightforwardly. Based on this, we introduce a GMM estimator of finite-dimensional parameters (structural parameters) in a two-step framework. We derive theoretical guarantees for our construction of OR-IVs and show $\sqrt{n}$-consistency and asymptotic normality for the estimator of the structural parameters. Our Monte Carlo experiments and an empirical application on estimating firm-level production functions highlight the importance of relying on inference methods like the one proposed.

Subject: Econometrics

Publish: 2025-12-09 09:49:55 UTC


#4 Robust Counterfactuals in Centralized Schools Choice Systems: Addressing Gender Inequality in STEM Education [PDF] [Copy] [Kimi] [REL]

Authors: Lixiong Li, Ismaël Mourifié

Counterfactual analysis is central to education market design and provides a foundation for credible policy recommendations. We develop a novel methodology for counterfactual analysis in Gale-Shapley deferred-acceptance (DA) assignment mechanisms under a weaker set of assumptions than those typically imposed in existing empirical works. Instead of fully specifying utility functions or students' beliefs about admission probabilities, we rely on interpretable restrictions on behavior that yield an incomplete but flexible model of preferences. This framework addresses the challenge of partial identification by delivering sharp bounds on counterfactual stable matching outcomes, which we compute efficiently using a combination of algorithmic techniques and integer programming. We illustrate the methodology by evaluating policies aimed at increasing female enrollment in STEM fields in Chile.

Subjects: Econometrics , Theoretical Economics

Publish: 2025-12-08 23:41:20 UTC


#5 Branching Fixed Effects: A Proposal for Communicating Uncertainty [PDF] [Copy] [Kimi] [REL]

Author: Patrick Kline

Economists often rely on estimates of linear fixed effects models developed by other teams of researchers. Assessing the uncertainty in these estimates can be challenging. I propose a form of sample splitting for network data that breaks two-way fixed effects estimates into statistically independent branches, each of which provides an unbiased estimate of the parameters of interest. These branches facilitate uncertainty quantification, moment estimation, and shrinkage. Algorithms are developed for efficiently extracting branches from large datasets. I illustrate these techniques using a benchmark dataset from Veneto, Italy that has been widely used to study firm wage effects.

Subjects: Econometrics , Applications , Computation

Publish: 2025-12-08 23:18:42 UTC


#6 LLM-Generated Counterfactual Stress Scenarios for Portfolio Risk Simulation via Hybrid Prompt-RAG Pipeline [PDF] [Copy] [Kimi] [REL]

Author: Masoud Soleimani

We develop a transparent and fully auditable LLM-based pipeline for macro-financial stress testing, combining structured prompting with optional retrieval of country fundamentals and news. The system generates machine-readable macroeconomic scenarios for the G7, which cover GDP growth, inflation, and policy rates, and are translated into portfolio losses through a factor-based mapping that enables Value-at-Risk and Expected Shortfall assessment relative to classical econometric baselines. Across models, countries, and retrieval settings, the LLMs produce coherent and country-specific stress narratives, yielding stable tail-risk amplification with limited sensitivity to retrieval choices. Comprehensive plausibility checks, scenario diagnostics, and ANOVA-based variance decomposition show that risk variation is driven primarily by portfolio composition and prompt design rather than by the retrieval mechanism. The pipeline incorporates snapshotting, deterministic modes, and hash-verified artifacts to ensure reproducibility and auditability. Overall, the results demonstrate that LLM-generated macro scenarios, when paired with transparent structure and rigorous validation, can provide a scalable and interpretable complement to traditional stress-testing frameworks.

Subjects: Risk Management , Artificial Intelligence , Econometrics

Publish: 2025-11-26 19:29:22 UTC


#7 Pattern Recognition of Ozone-Depleting Substance Exports in Global Trade Data [PDF1] [Copy] [Kimi1] [REL]

Author: Muhammad Sukri Bin Ramli

New methods are needed to monitor environmental treaties, like the Montreal Protocol, by reviewing large, complex customs datasets. This paper introduces a framework using unsupervised machine learning to systematically detect suspicious trade patterns and highlight activities for review. Our methodology, applied to 100,000 trade records, combines several ML techniques. Unsupervised Clustering (K-Means) discovers natural trade archetypes based on shipment value and weight. Anomaly Detection (Isolation Forest and IQR) identifies rare "mega-trades" and shipments with commercially unusual price-per-kilogram values. This is supplemented by Heuristic Flagging to find tactics like vague shipment descriptions. These layers are combined into a priority score, which successfully identified 1,351 price outliers and 1,288 high-priority shipments for customs review. A key finding is that high-priority commodities show a different and more valuable value-to-weight ratio than general goods. This was validated using Explainable AI (SHAP), which confirmed vague descriptions and high value as the most significant risk predictors. The model's sensitivity was validated by its detection of a massive spike in "mega-trades" in early 2021, correlating directly with the real-world regulatory impact of the US AIM Act. This work presents a repeatable unsupervised learning pipeline to turn raw trade data into prioritized, usable intelligence for regulatory groups.

Subjects: Machine Learning , Econometrics , General Economics

Publish: 2025-11-26 14:58:03 UTC