research-article

On the convergence of regret minimization dynamics in concave games

Authors:

Yishay Mansour,

Uri NadavAuthors Info & Claims

STOC '09: Proceedings of the forty-first annual ACM symposium on Theory of computing

Pages 523 - 532

https://doi.org/10.1145/1536414.1536486

Published: 31 May 2009 Publication History

Abstract

We study a general sub-class of concave games which we call socially concave games. We show that if each player follows any no-external regret minimization procedure then the dynamics will converge in the sense that both the average action vector will converge to a Nash equilibrium and that the utility of each player will converge to her utility in that Nash equilibrium. We show that many natural games are indeed socially concave games. Specifically, we show that linear Cournot competition and linear resource allocation games are socially-concave games, and therefore our convergence result applies to them. In addition, we show that a simple best response dynamics might diverge for linear resource allocation games, and is known to diverge for linear Cournot competition. For the TCP congestion games we show that "near" the equilibrium the games are socially-concave, and using our general methodology we show the convergence of a specific regret minimization dynamics.

References

[1]

S. Arora and W. Brinkman. A randomized online algorithm for bandwidth utilization. In SODA, 2002.

Digital Library

[2]

A. Blum, E. Even-Dar, and K. Ligett. Routing without regret: On convergence to nash equilibria of regret-minimizing algorithms in routing games. In PODC, pages 45--52, 2006.

Digital Library

[3]

A. Blum, K. Ligett M. Hajiaghay and, and A. Roth. Regret minimization and the price of total anarchy. In STOC, pages 373--382, 2008.

Digital Library

[4]

A. Blum and Y. Mansour. From external to internal regret. Journal of Machine Learning Research (JMLR), 8:1307--1324, 2007. (Preliminary version in COLT 2005.).

Digital Library

[5]

S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.

Digital Library

[6]

N. Cesa-Bianchi and G. Lugosi. Potential-based algorithms in on-line prediction and game theory. Machine Learning, 51(3):239--261, 2003.

Digital Library

[7]

A. A. Cournot. Recherches sur les principes mathÈmatiques de la theorie des richesses.(english translation: Researches into the mathematical principles of the theory of wealth). 1838.

[8]

S. Floyd and Jacobson V. Random early detection gateways for congestion avoidance. IEEE/ACM Transactions on Networking, 1:397--413, 1993.

Digital Library

[9]

D. Foster and S. M. Kakade. Deterministic calibration and nash equilibrium. In COLT, pages 33--48, 2004.

[10]

D. Foster and R. Vohra. Regret in the on-line decision problem. Games and Economic Behavior, 21:40--55, 1997.

[11]

D. Foster and H. P. Young. Learning, hypothesis testing, and Nash equilibrium. Games and Economic Behavior, 45:73--96, 2003.

[12]

D. Foster and H. P. Young. Regret testing: Learning to play Nash equilibrium without knowing you have an opponent. Theoretical Economics, 1:341--367, 2006.

[13]

Y. Freund and R. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79--103, 1999.

[14]

Drew Fudenberg and David K. Levine. The theory of learning in games. MIT press, 1998.

[15]

F. Germano and G. Lugosi. Global Nash convergence of Foster and Young's regret testing. Games and Economic Behavior, 60:135--154, 2007.

[16]

B. Hajek and G. Gopalakrishnan. Do greedy autonomous systems make for a sensible internet?, 2002. presented at the Conference on Stochastic Networks, Stanford University.

[17]

S. Hart and Y. Mansour. The communication complexity of uncoupled Nash equilibrium procedures. In STOC, pages 345--353, 2007.

Digital Library

[18]

S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127--1150, 2000.

[19]

S. Hart and A. Mas-Colell. Stochastic uncoupled dynamics and Nash equilibrium. Games and Economice Behavior, 57(2):286--303, 2006.

[20]

E. Hazan, A. Agarwal, and S. Kale. Logarithmic regret algorithms for online convex optimization. Machine Learning, 69(2-3):169--192, 2007.

Digital Library

[21]

R. Joahri and J. Tsitsiklis. Efficiency loss in resource allocation games. Mathematics of Operations Research, 29(3):407--435, 2004.

Digital Library

[22]

R. Johari and J. N. Tsitsiklis. Efficiency loss in a network resource allocation game. Mathematics of Operations Research, 29(3):407--435, 2004.

Digital Library

[23]

Richard M. Karp, Elias Koutsoupias, Christos Papadimitriou, and Scott Shenker. Optimization problems in congestion control. In FOCS, 2000.

Digital Library

[24]

F. P. Kelly. Charging and rate control for elastic traffic. European Transactions on Telecommunications, page 33ñ37, 1997.

[25]

A. Mas-Colell, J. Green, and M. D. Whinston. Microeconomic Theory. Oxford University Press, 1995.

[26]

J. B. Rosen. Existence and uniqueness of equilibrium points for concave n-person games. Econometrica, 33(3):520--534, 1965.

[27]

R. D. Theocharis. On the stability of the cournot solution on the oligopoly problem. The Review of Economic Studies., 27(2):133--134, 1960.

[28]

H. P. Young. Strategic Learning and Its Limits. Oxford University Press, 2004.

[29]

M. Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In ICML, pages 928--936, 2003.

Digital Library

[30]

M. Zinkevich. Theoretical guarantees for algorithms in multi-agent settings. Technical Report CMU-CS-04-161, Carnegie Mellon University, 2004.

Cited By

Rozen-Schiff NNavon APechtalt IBruckman L(2024)Hercules: Heterogeneous Requirements Congestion Control2024 IEEE 30th International Symposium on Local and Metropolitan Area Networks (LANMAN)10.1109/LANMAN61958.2024.10621890(106-112)Online publication date: 10-Jul-2024
https://doi.org/10.1109/LANMAN61958.2024.10621890
Yan YZhao PZhou ZKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Fast rates in time-varying strongly monotone gamesProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3620040(39139-39164)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3620040
Cai YZheng WKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Doubly optimal no-regret learning in monotone gamesProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3618551(3507-3524)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3618551
Show More Cited By

Index Terms

On the convergence of regret minimization dynamics in concave games
1. Theory of computation
  1. Design and analysis of algorithms

Recommendations

Auctions between Regret-Minimizing Agents
WWW '22: Proceedings of the ACM Web Conference 2022

We analyze a scenario in which software agents implemented as regret-minimizing algorithms engage in a repeated auction on behalf of their users. We study first-price and second-price auctions, as well as their generalized versions (e.g., as those used ...
Regret minimization and the price of total anarchy
STOC '08: Proceedings of the fortieth annual ACM symposium on Theory of computing

We propose weakening the assumption made when studying the price of anarchy: Rather than assume that self-interested players will play according to a Nash equilibrium (which may even be computationally hard to find), we assume only that selfish players ...
Airline revenue management games with simultaneous price and quantity competition

This paper treats airline revenue management under simultaneous price and quantity competition in a network. Within this setting, a competitor's demand is a function of his price as well as the competitor's price and booking limits. We present a model ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

STOC '09: Proceedings of the forty-first annual ACM symposium on Theory of computing

May 2009

750 pages

ISBN:9781605585062

DOI:10.1145/1536414

Program Chair:
Michael Mitzenmacher
Harvard University

Copyright © 2009 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

STOC '09

Sponsor:

STOC '09: Symposium on Theory of Computing

May 31 - June 2, 2009

MD, Bethesda, USA

Acceptance Rates

Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

45
Total Citations
View Citations
1,187
Total Downloads

Downloads (Last 12 months)107
Downloads (Last 6 weeks)8

Reflects downloads up to 04 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Rozen-Schiff NNavon APechtalt IBruckman L(2024)Hercules: Heterogeneous Requirements Congestion Control2024 IEEE 30th International Symposium on Local and Metropolitan Area Networks (LANMAN)10.1109/LANMAN61958.2024.10621890(106-112)Online publication date: 10-Jul-2024
https://doi.org/10.1109/LANMAN61958.2024.10621890
Yan YZhao PZhou ZKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Fast rates in time-varying strongly monotone gamesProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3620040(39139-39164)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3620040
Cai YZheng WKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Doubly optimal no-regret learning in monotone gamesProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3618551(3507-3524)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3618551
Zhuang RHan JXue KLi JWei DLi RSun QLu J(2023)Achieving Flexible and Lightweight Multipath Congestion Control Through Online LearningIEEE Transactions on Network and Service Management10.1109/TNSM.2022.320874720:1(46-59)Online publication date: Mar-2023
https://doi.org/10.1109/TNSM.2022.3208747
Cheng XMaghsudi S(2023)Distributed Task Management in Fog Computing: A Socially Concave Bandit GameIEEE Transactions on Green Communications and Networking10.1109/TGCN.2023.32764157:3(1486-1500)Online publication date: Sep-2023
https://doi.org/10.1109/TGCN.2023.3276415
Chen ZCai KZhang JZhu X(2023)PCC Priority: A Priority-Aware Bandwidth Allocation Framework for QUICIEEE Networking Letters10.1109/LNET.2023.32690545:4(279-283)Online publication date: Dec-2023
https://doi.org/10.1109/LNET.2023.3269054
Thaker PZaharia MHashimoto T(2023)Congestion Control Safety via Comparative StaticsIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10229051(1-10)Online publication date: 17-May-2023
https://doi.org/10.1109/INFOCOM53939.2023.10229051
Huang ZLiu KPan J(2023)End-to-End Congestion Control as Learning for Unknown Games with Bandit Feedback2023 IEEE 43rd International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS57875.2023.00060(327-338)Online publication date: Jul-2023
https://doi.org/10.1109/ICDCS57875.2023.00060
Shi JYi BWang XHuang MLi PZeng CLi K(2023)Vivace-Distributed: A novel congestion control mechanism for JointCloud environmentsFuture Generation Computer Systems10.1016/j.future.2023.07.023149(317-329)Online publication date: Dec-2023
https://doi.org/10.1016/j.future.2023.07.023
Mertikopoulos PHsieh YCevher V(2023)A unified stochastic approximation framework for learning in gamesMathematical Programming10.1007/s10107-023-02001-y203:1-2(559-609)Online publication date: 4-Aug-2023
https://doi.org/10.1007/s10107-023-02001-y
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents