https://papers.cool/arxiv/econEconomics2024-08-09T00:00:00+00:00python-feedgenCool Papers - Immersive Paper Discoveryhttps://papers.cool/arxiv/2408.04385Non-maximizing policies that fulfill multi-criterion aspirations in expectation2024-08-09T00:00:00+00:00Simon DimaSimon FischerJobst HeitzigJoss OliverIn dynamic programming and reinforcement learning, the policy for the sequential decision making of an agent in a stochastic environment is usually determined by expressing the goal as a scalar reward function and seeking a policy that maximizes the expected total reward. However, many goals that humans care about naturally concern multiple aspects of the world, and it may not be obvious how to condense those into a single reward function. Furthermore, maximization suffers from specification gaming, where the obtained policy achieves a high expected total reward in an unintended way, often taking extreme or nonsensical actions. Here we consider finite acyclic Markov Decision Processes with multiple distinct evaluation metrics, which do not necessarily represent quantities that the user wants to be maximized. We assume the task of the agent is to ensure that the vector of expected totals of the evaluation metrics falls into some given convex set, called the aspiration set. Our algorithm guarantees that this task is fulfilled by using simplices to approximate feasibility sets and propagate aspirations forward while ensuring they remain feasible. It has complexity linear in the number of possible state-action-successor triples and polynomial in the number of evaluation metrics. Moreover, the explicitly non-maximizing nature of the chosen policy and goals yields additional degrees of freedom, which can be used to apply heuristic safety criteria to the choice of actions. We discuss several such safety criteria that aim to steer the agent towards more conservative behavior.https://papers.cool/arxiv/2408.04617Difference-in-Differences for Health Policy and Practice: A Review of Modern Methods2024-08-09T00:00:00+00:00Shuo FengIshani GanguliYoujin LeeJohn PoeAndrew RyanAlyssa BilinskiDifference-in-differences (DiD) is the most popular observational causal inference method in health policy, employed to evaluate the real-world impact of policies and programs. To estimate treatment effects, DiD relies on the "parallel trends assumption", that on average treatment and comparison groups would have had parallel trajectories in the absence of an intervention. Historically, DiD has been considered broadly applicable and straightforward to implement, but recent years have seen rapid advancements in DiD methods. This paper reviews and synthesizes these innovations for medical and health policy researchers. We focus on four topics: (1) assessing the parallel trends assumption in health policy contexts; (2) relaxing the parallel trends assumption when appropriate; (3) employing estimators to account for staggered treatment timing; and (4) conducting robust inference for analyses in which normal-based clustered standard errors are inappropriate. For each, we explain challenges and common pitfalls in traditional DiD and modern methods available to address these issues.https://papers.cool/arxiv/2408.04508Scarce Workers, High Wages?2024-08-09T00:00:00+00:00Erik-Benjamin BörschleinMario BosslerMartin PoppLabor market tightness tremendously increased in Germany between 2012 and 2022. We analyze the effect of tightness on wages by combining social security data with unusually rich information on vacancies and job seekers. Instrumental variable regressions reveal positive elasticities between 0.004 and 0.011, implying that higher tightness explains between 7 and 19 percent of the real wage increase. We report greater elasticities for new hires, high-skilled workers, the Eastern German labor market, and the service sector. In particular, tightness raised wages at the bottom of the wage distribution, contributing to the decline in wage inequality over the last decade.https://papers.cool/arxiv/2408.04509Robust Market Design with Opaque Announcements2024-08-09T00:00:00+00:00Aram GrigoryanMarkus MöllerWe introduce a framework where the announcements of a clearinghouse about the allocation process are opaque in the sense that there can be more than one outcome compatible with a realization of type reports. We ask whether desirable properties can be ensured under opacity in a robust sense. A property can be guaranteed under an opaque announcement if every mechanism compatible with it satisfies the property. We find an impossibility result: strategy-proofness cannot be guaranteed under any level of opacity. In contrast, in some environments, weak Maskin monotonicity and non-bossiness can be guaranteed under opacity.https://papers.cool/arxiv/2408.04552Semiparametric Estimation of Individual Coefficients in a Dyadic Link Formation Model Lacking Observable Characteristics2024-08-09T00:00:00+00:00L. Sanna StephanDyadic network formation models have wide applicability in economic research, yet are difficult to estimate in the presence of individual specific effects and in the absence of distributional assumptions regarding the model noise component. The availability of (continuously distributed) individual or link characteristics generally facilitates estimation. Yet, while data on social networks has recently become more abundant, the characteristics of the entities involved in the link may not be measured. Adapting the procedure of \citet{KS}, I propose to use network data alone in a semiparametric estimation of the individual fixed effect coefficients, which carry the interpretation of the individual relative popularity. This entails the possibility to anticipate how a new-coming individual will connect in a pre-existing group. The estimator, needed for its fast convergence, fails to implement the monotonicity assumption regarding the model noise component, thereby potentially reversing the order if the fixed effect coefficients. This and other numerical issues can be conveniently tackled by my novel, data-driven way of normalising the fixed effects, which proves to outperform a conventional standardisation in many cases. I demonstrate that the normalised coefficients converge both at the same rate and to the same limiting distribution as if the true error distribution was known. The cost of semiparametric estimation is thus purely computational, while the potential benefits are large whenever the errors have a strongly convex or strongly concave distribution.https://papers.cool/arxiv/2408.04573Revealed Invariant Preference2024-08-09T00:00:00+00:00Peter CaradonnaChristopher P. ChambersWe consider the problem of rationalizing choice data by a preference satisfying an arbitrary collection of invariance axioms. Examples of such axioms include quasilinearity, homotheticity, independence-type axioms for mixture spaces, constant relative/absolute risk and ambiguity aversion axioms, stationarity for dated rewards or consumption streams, separability, and many others. We provide necessary and sufficient conditions for invariant rationalizability via a novel approach which relies on tools from the theoretical computer science literature on automated theorem proving. We also establish a generalization of the Dushnik-Miller theorem, which we use to give a complete description of the out-of-sample predictions generated by the data under any such collection of axioms.