USI - Email
Adaptive maximization of social welfare in theory and practice
Host: Euler Institute
USI Campus Ovest, room 355
12:30 - 13:30
University of Oxford
Abstract: We consider the problem of repeatedly choosing policy parameters, such as tax or trans- fer rates, in order to maximize social welfare, defined as the weighted sum of private utility and public revenue. The outcomes of earlier policy choices inform later choices. In contrast to multi-armed bandit models, utility is not observed, but needs to be indirectly inferred from the integral of the response function. In contrast to standard optimal tax theory, response functions need to be learned through policy choices.
We derive a lower bound on regret for this problem, and a matching adversarial upper bound on regret for a variant of the Exp3 algorithm. In both cases, cumulative regret grows 2/3at a rate of T . This implies that (i) the social welfare maximization problem is harder than the multi-armed bandit problem (with a rate of T 1/2), and (ii) that our proposed algorithm achieves the optimal rate.
Simulations confirm these results, as well as the viability of the proposed algorithm. We also compare the social welfare maximization problem to two related learning problems, monopoly pricing (which is easier), and price setting for bilateral trade (which is harder).
We lastly discuss extensions to nonlinear income taxation, and to commodity taxation.
Maximilian Kasy is Professor of Economicy at the University of Oxford. He received his PhD at UC Berkeley and joined Oxford coming from Harvard University. His current research interests focus on social foundations for statistics and machine learning, going beyond traditional single-agent decision theory. He also works on economic inequality, job guarantee programs, and basic income. He teaches a course on foundations of machine learning at the economics department at Oxford.
INF Newsletter Archive