Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2999792.2999916guideproceedingsArticle/Chapter ViewAbstractPublication PagesnipsConference Proceedingsconference-collections
Article

Minimax optimal algorithms for unconstrained linear optimization

Published: 05 December 2013 Publication History

Abstract

We design and analyze minimax-optimal algorithms for online linear optimization games where the player's choice is unconstrained. The player strives to minimize regret, the difference between his loss and the loss of a post-hoc benchmark strategy. While the standard benchmark is the loss of the best strategy chosen from a bounded comparator set, we consider a very broad range of benchmark functions. The problem is cast as a sequential multi-stage zero-sum game, and we give a thorough analysis of the minimax behavior of the game, providing characterizations for the value of the game, as well as both the player's and the adversary's optimal strategy. We show how these objects can be computed efficiently under certain circumstances, and by selecting an appropriate benchmark, we construct a novel hedging strategy for an unconstrained betting game.

References

[1]
Jacob Abernethy and Manfred K. Warmuth. Repeated games against budgeted adversaries. In NIPS, 2010.
[2]
Jacob Abernethy, Peter L. Bartlett, Alexander Rakhlin, and Ambuj Tewari. Optimal strategies and minimax lower bounds for online convex games. In COLT, 2008a.
[3]
Jacob Abernethy, Manfred K Warmuth, and Joel Yellin. Optimal strategies from random walks. In Proceedings of The 21st Annual Conference on Learning Theory, pages 437-146. Citeseer, 2008b.
[4]
Jacob Abernethy, Alekh Agarwal, Peter Bartlett, and Alexander Rakhlin. A stochastic view of optimal regret through minimax duality. In COLT, 2009.
[5]
Jacob Abernethy, Rafael M. Frongillo, and Andre Wibisono. Minimax option pricing meets black-scholes in the limit. In STOC, 2012.
[6]
Amit Agarwal, Elad Hazan, Satyen Kale, and Robert E. Schapire. Algorithms for portfolio management based on the Newton method. In ICML, 2006.
[7]
Nicolò Cesa-Bianchi and Gabor Lugosi. Prediction, Learning, and Games. Cambridge University Press, 2006.
[8]
A. de Moivre. The Doctrine of Chances: or, A Method of Calculating the Probabilities of Events in Play. 1718.
[9]
Ofer Dekel, Ambuj Tewari, and Raman Arora. Online bandit learning against an adaptive adversary: from regret to policy regret. In ICML, 2012.
[10]
Peter DeMarzo, Ilan Kremer, and Yishay Mansour. Online trading algorithms and robust option pricing. In Proceedings of the thirty-eighth annual ACM symposium on Theory of computing, pages 477-186. ACM, 2006.
[11]
Persi Diaconis and Sandy Zabell. Closed form summation for classical distributions: Variations on a theme of de Moivre. Statistical Science, 6(3), 1991.
[12]
Elad Hazan and Satyen Kale. On stochastic and worst-case models for investing. In NIPS. 2009.
[13]
J. L. Kelly Jr. A new interpretation of information rate. Bell System Technical Journal, 1956.
[14]
Wouter Koolen, Dmitry Adamskiy, and Manfred Warmuth. Putting bayes to sleep. In NIPS. 2012.
[15]
N. Merhav, E. Ordentlich, G. Seroussi, and M. J. Weinberger. On sequential strategies for loss functions with memory. IEEE Trans. Inf. Theor., 48(7), September 2006.
[16]
Alexander Rakhlin, Ohad Shamir, and Karthik Sridharan. Relax and randomize: From value to algorithms. In NIPS, 2012.
[17]
Ralph T. Rockafellar. Convex Analysis (Princeton Landmarks in Mathematics and Physics). Princeton University Press, 1997.
[18]
Shai Shalev-Shwartz. Online learning and online convex optimization. Foundations and Trends in Machine Learning, 2012.
[19]
Shai Shalev-Shwartz, Yoram Singer, Nathan Srebro, and Andrew Cotter. Pegasos: Primal estimated sub-gradient solver for svm. Mathematical Programming, 127(1):3-30, 2011.
[20]
Gilles Stoltz. Contributions to the sequential prediction of arbitrary sequences: applications to the theory of repeated games and empirical studies of the performance of the aggregation of experts. Habilitation à dinger des recherches, Université Paris-Sud, 2011.
[21]
Matthew Streeter and H. Brendan McMahan. No-regret algorithms for unconstrained online convex optimization. In NIPS, 2012.
[22]
Volodya Vovk. Competitive on-line statistics. International Statistical Review, 69, 2001.
[23]
Martin Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In ICML, 2003.

Cited By

View all
  • (2023)MechanicProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668196(47828-47848)Online publication date: 10-Dec-2023
  • (2018)Acceleration through optimistic no-regret dynamicsProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327144.3327298(3828-3838)Online publication date: 3-Dec-2018
  • (2017)A survey of algorithms and analysis for adaptive online learningThe Journal of Machine Learning Research10.5555/3122009.317683418:1(3117-3166)Online publication date: 1-Jan-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
NIPS'13: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2
December 2013
3236 pages

Publisher

Curran Associates Inc.

Red Hook, NY, United States

Publication History

Published: 05 December 2013

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)MechanicProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668196(47828-47848)Online publication date: 10-Dec-2023
  • (2018)Acceleration through optimistic no-regret dynamicsProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327144.3327298(3828-3838)Online publication date: 3-Dec-2018
  • (2017)A survey of algorithms and analysis for adaptive online learningThe Journal of Machine Learning Research10.5555/3122009.317683418:1(3117-3166)Online publication date: 1-Jan-2017
  • (2016)Online convex optimization with unconstrained domains and lossesProceedings of the 30th International Conference on Neural Information Processing Systems10.5555/3157096.3157180(748-756)Online publication date: 5-Dec-2016
  • (2016)Coin betting and parameter-free online learningProceedings of the 30th International Conference on Neural Information Processing Systems10.5555/3157096.3157161(577-585)Online publication date: 5-Dec-2016

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media