2026-04-16 | | Total: 13
This paper provides a systematic comparison between Fitted Dynamic Programming (DP), where demand is estimated from data, and Reinforcement Learning (RL) methods in finite-horizon dynamic pricing problems. We analyze their performance across environments of increasing structural complexity, ranging from a single typology benchmark to multi-typology settings with heterogeneous demand and inter-temporal revenue constraints. Unlike simplified comparisons that restrict DP to low-dimensional settings, we apply dynamic programming in richer, multi-dimensional environments with multiple product types and constraints. We evaluate revenue performance, stability, constraint satisfaction behavior, and computational scaling, highlighting the trade-offs between explicit expectation-based optimization and trajectory-based learning.
We study a finite-horizon dynamic pricing problem for event tickets with limited inventory and time-varying demand. The central practical difficulty is that the total demand function $L(t)$ is not observed directly and must be estimated from data, while pricing decisions are sensitive to its temporal shape. The paper examines how the accuracy of this estimate affects revenue. We consider a model in which sales intensity is driven by the total demand $L(t)$, a price-response function $v(p)$, and a time-dependent willingness-to-pay factor $\varphi(t)$. The factor $\varphi(t)$ plays a central role: it captures the increase in customers' willingness to pay as the event date approaches and makes the temporal profile of demand economically important for pricing. Within this framework, the updated numerical study evaluates a benchmark dynamic-programming policy across nine deterministic true-demand scenarios, a collection of feature-aware misspecifications of $L(t)$, and multiple environment regimes induced by $v(p)=e^{-ηp}$, the deadline factor $\varphi(t)$, and inventory level $Q$. The reported summaries are based on stochastic simulation and a ratio-of-means relative-loss metric. The results show that a more accurate representation of the temporal demand profile leads to more effective pricing decisions and higher revenue. Over the full misspecification collection the aggregate relative revenue loss is $0.42\%$, the upper decile exceeds $1\%$, and the most expensive errors are omissions of late-demand components. The average effect is therefore modest but non-negligible, and it becomes stronger when deadline effects are pronounced and inventory is tight.
Why do large gender inequalities in everyday life persist even as women strengthen their attachment to paid work? Existing evidence shows that women continue to do more unpaid work than men, but much of that evidence is based on individual diaries, says little about how inequality is jointly organized within couples, and rarely links daily time allocation to directly measured gender attitudes. This paper addresses that gap using the TIMES Observatory, an original survey of 1,928 co-resident couples with at least one child younger than 11 in Emilia-Romagna or Campania. The data combine matched partner diaries for one weekday and one weekend day with rich socio-economic information and direct measures of gender norms. We document three main findings. First, women do substantially more unpaid work and spend more time with children, while men do more paid work and enjoy more leisure without children. Second, these asymmetries remain sizeable even among dual full-time couples, implying that stronger female labor-market attachment does not by itself equalize daily life. Third, more traditional gender attitudes - especially among men - are descriptively associated with lower male participation in childcare and domestic work and with wider gaps in discretionary leisure. The analysis is descriptive rather than causal, but it shows that gender inequality within couples is visible not only in the amount of work performed, but also in the distribution of time that is genuinely discretionary.
For networks with externalities, where each component's worth may depend on the full network structure, balanced contributions and fairness lead to distinct component-efficient allocation rules. We characterize the unique component-efficient allocation rule satisfying balanced contributions -- the BCE rule. Existence is the main challenge: balanced contributions must hold on every edge, but the construction uses only spanning-tree edges. A cycle-sum identity bridges this gap by reducing balanced contributions on non-tree edges to relations in proper subnetworks. The BCE rule coincides with the Myerson value for TU games and with its generalization by Jackson--Wolinsky for network games without externalities, it recovers the externality-free value on the complete network, and -- unlike the fairness-based FCE rule -- it does not reduce to a graph-free formula applied to the graph-restricted game.
Electricity is typically traded in day-ahead auctions because many power system decisions, such as unit commitment, must be made in advance. However, when wind and solar generators sell power one day ahead, they face uncertainty about their actual production. In current day-ahead auctions, this uncertainty cannot be directly communicated, leading to inefficient use of renewable energy and suboptimal system decisions. We show how this problem can be addressed using the concept of equilibrium under uncertainty from microeconomic theory. In particular, we demonstrate that electricity contracts should be conditioned not only on the time and location of delivery, but also on the state of the world (e.g., whether it will be windy or calm). This requires a precise definition of the state of the world. Since there are infinitely many possible definitions, criteria are needed to select among them. We develop such criteria and show that the resulting states correspond to solutions of an optimal partitioning problem. Finally, we illustrate how these states can be computed and interpreted using a case study of offshore wind farms in the European North Sea.
In centralized assignment problems, agents may have preferences over joint rather than individual assignments, such as couples in residency matching or siblings in school choice and daycare. Standard preference estimation methods typically ignore such complementarities. This paper develops an empirical framework that explicitly incorporates them. Using data from daycare assignment in a municipality in Japan, we estimate a model in which families incur both additional commuting distance and a fixed non-distance disutility when siblings are assigned to different facilities. We find that split assignment generates a large disutility, equivalent to more than twice the average commuting distance. We then simulate counterfactual assignment policies that vary the strength of sibling priority and evaluate welfare. The sibling priority reform that we designed and that was implemented in 2024 increases welfare by 6.4% while reducing inequality in assignment rates across sibling groups; models that ignore sibling complementarities substantially understate these gains. At the same time, we uncover a clear efficiency-equity tradeoff: along the frontier, increasing mean welfare by 100 meters is associated with an increase in inequality of about 1.7 percentage points, and the welfare-maximizing policy reverses much of the reform's reduction in inequality, largely through the displacement of households without siblings.
Access to mental health care is often rationed through waiting lists, yet there is limited causal evidence on the consequences of delayed access. We study whether eliminating waiting time for psychological support improves outcomes for young adults who grew up with parental substance misuse. Using a randomized waitlist-controlled trial in Denmark combined with survey and administrative data, we find that immediate access leads to sizable short-run improvements in psychological health. These gains persist three to four years after randomization, even after both groups have received the intervention. By contrast, we find limited evidence of large average effects on broader health or labor market outcomes. Our results highligth the importance of treatment timing in capacity-constrained settings.
The maximum score method (Manski, 1975, 1985) is a powerful approach for binary choice models, yet it is known to face both practical and theoretical challenges. In particular, the estimator converges at a slower-than-root-$n$ rate to a nonstandard limiting distribution. We investigate conditions under which strictly concave surrogate score functions can be employed to achieve identification through a smooth criterion function. This criterion enables root-$n$ convergence to a normal limiting distribution. While the conditions to guarantee these desired properties are nontrivial, we characterize them in terms of primitive conditions. Extensive simulation studies support, the root-$n$ convergence rate, the asymptotic normality, and the validity of the standard inference methods.
We review the "production approach" to estimating markups, the ratio of price to marginal cost. The approach is uniquely scalable: it requires no model of consumer demand or market structure and applies broadly across firms, industries, and time. Our organizing insight is that the production-based markup is a residual. Like the Solow residual, it is clean in theory but potentially contaminated by misspecification and mismeasurement. This framing helps explain why small differences in implementation can produce starkly different results from the same data. In some cases, markups have risen sharply. In others, they have not. Despite the disagreements in the literature, the importance of understanding and measuring market power cannot be overstated. We provide conceptual rationales for this disagreement, offer practical guidance on data and estimation, and call for greater transparency about how much of the variation attributed to markups may instead reflect technology.
Firms in denser areas are more productive, a pattern attributed to agglomeration economies and firm selection. To disentangle these two channels, the popular approach of Combes et al. (2012, ECTA) critically assumes that total factor productivity (TFP) distributions between denser and less dense areas are the same up to mean, variance, and left-tail truncation. We empirically validate this assumption using Spanish administrative firm-level data and recent econometric methods adapted to noisy TFP estimates. Our results find that TFP distributions are indeed statistically identical up to these parameters, validating the use of such productivity decompositions. Furthermore, using only the mean and variance is sufficient to capture differences for all sectors. Accordingly, the productivity advantage of cities may be entirely due to agglomeration rather than stronger selection, suggesting that policymakers should focus on policies targeting agglomeration. Finally, our approach extends to related contexts like differences in worker skill distributions.
This study analyses the impacts of economic complexity on environmental performance in BRICS-T countries. Annual data for the period 1999-2021, Durbin-Hausman cointegration test and Augmented Mean Group (AMG) estimator are used in the analysis. The robustness of the Panel AMG results is tested with CCEMG and CS-ARDL methods. The results indicate that economic complexity has a positive impact on environmental performance. An increase of 1% in the economic complexity index increases environmental performance in BRICS-T countries between 0.020% and 1.243%. However, economic growth, energy intensity and population density were found to have a negative impact on environmental performance. Renewable energy use, in contrast, contributes positively to environmental performance.
Why do capitalist economies recurrently generate crises whose severity is disproportionate to the size of the triggering shock? This paper proposes a structural answer grounded in the evolutionary geometry of production networks. As economies evolve through specialization, integration, and competitive selection, their inter-sectoral linkages drift toward configurations of increasing geometric fragility, eventually crossing a threshold beyond which small disturbances generate disproportionately large cascades. We introduce Sandpile Economics, a formal framework that interprets macroeconomic instability as an emergent property of disequilibrium production networks. The key state variable is the Forman--Ricci curvature of the input--output graph, capturing local substitution possibilities when supply chains are disrupted. We show that when curvature falls below an endogenous threshold, the distribution of cascade sizes follows a power law with tail index $α\in (1,2)$, implying a regime of unbounded amplification. The underlying mechanism is evolutionary: specialization reduces input substitutability, pushing the economy toward criticality, while crisis episodes induce endogenous network reconfiguration and path dependence. These dynamics are inherently non-ergodic and cannot be captured by representative-agent frameworks. Empirically, using global input--output data, we document that production networks operate in persistently negative curvature regimes and that curvature robustly predicts medium-run output dynamics. A one-standard-deviation increase in curvature is associated with higher cumulative growth over three-year horizons, and curvature systematically outperforms standard network metrics in explaining cross-country differences in resilience.
The bullwhip effect remains operationally persistent despite decades of analytical research. Two computational deficiencies hinder progress: the absence of modular open-source simulation tools for multi-echelon inventory dynamics with asymmetric costs, and the lack of a standardized benchmarking protocol for comparing mitigation strategies across shared metrics and datasets. This paper introduces deepbullwhip, an open-source Python package that integrates a simulation engine for serial supply chains (with pluggable demand generators, ordering policies, and cost functions via abstract base classes, and a vectorized Monte Carlo engine achieving 50 to 90 times speedup) with a registry-based benchmarking framework shipping a curated catalog of ordering policies, forecasting methods, six bullwhip metrics, and demand datasets including WSTS semiconductor billings. Five sets of experiments on a four-echelon semiconductor chain demonstrate cumulative amplification of 427x (Monte Carlo mean across 1,000 paths), a stochastic filtering phenomenon at upstream tiers (CV = 0.01), super-exponential lead time sensitivity, and scalability to 20.8 million simulation cells in under 7 seconds. Benchmark experiments reveal a 155x disparity between synthetic AR(1) and real WSTS bullwhip severity under the Order-Up-To policy, and quantify the BWR-NSAmp tradeoff across ordering policies, demonstrating that no single metric captures policy quality.