research-article

Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting

Authors:

Niranjan Srinivas,

Andreas Krause,

Sham M. Kakade,

Matthias W. SeegerAuthors Info & Claims

IEEE Transactions on Information Theory, Volume 58, Issue 5

Pages 3250 - 3265

https://doi.org/10.1109/TIT.2011.2182033

Published: 01 May 2012 Publication History

Abstract

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multiarmed bandit problem, where the payoff function is either sampled from a Gaussian process (GP) or has low norm in a reproducing kernel Hilbert space. We resolve the important open problem of deriving regret bounds for this setting, which imply novel convergence rates for GP optimization. We analyze an intuitive Gaussian process upper confidence bound (${\ssr GP}\hbox{-}{\ssr UCB})$ algorithm, and bound its cumulative regret in terms of maximal information gain, establishing a novel connection between GP optimization and experimental design. Moreover, by bounding the latter in terms of operator spectra, we obtain explicit sublinear regret bounds for many commonly used covariance functions. In some important cases, our bounds have surprisingly weak dependence on the dimensionality. In our experiments on real sensor data, ${\ssr GP}\hbox{-}{\ssr UCB}$ compares favorably with other heuristical GP optimization approaches.

Cited By

View all

Yang ZDai XDubey AHirche SHattab GDastani MSichman JAlechina NDignum V(2024)Whom to Trust? Elective Learning for Distributed Gaussian Process RegressionProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3663066(2020-2028)Online publication date: 6-May-2024
https://dl.acm.org/doi/10.5555/3635637.3663066
Chen XZhang YXie GZhang J(2024)A Uniform Error Bound for Stochastic Kriging: Properties and Implications on Simulation Experimental DesignACM Transactions on Modeling and Computer Simulation10.1145/368205935:1(1-43)Online publication date: 25-Nov-2024
https://dl.acm.org/doi/10.1145/3682059
Templier PGrillotti LRachelson EWilson DCully ALi XHandl J(2024)Quality with Just Enough Diversity in Evolutionary Policy SearchProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654047(105-113)Online publication date: 14-Jul-2024
https://dl.acm.org/doi/10.1145/3638529.3654047
Show More Cited By

Recommendations

Gaussian process optimization in the bandit setting: no regret and experimental design
ICML'10: Proceedings of the 27th International Conference on International Conference on Machine Learning

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multi-armed bandit problem, where the payoff function is either sampled from a Gaussian process (GP) or has low RKHS norm. We ...
Improved regret bounds for bandit combinatorial optimization
NIPS'19: Proceedings of the 33rd International Conference on Neural Information Processing Systems

Bandit combinatorial optimization is a bandit framework in which a player chooses an action within a given finite set A ⊆ {0, 1}^d and incurs a loss that is the inner product of the chosen action and an unobservable loss vector in ℝ^d in each round. In this ...
Misspecified Gaussian process bandit optimization
NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing Systems

We consider the problem of optimizing a black-box function based on noisy bandit feedback. Kernelized bandit algorithms have shown strong empirical and theoretical performance for this problem. They heavily rely on the assumption that the model is well-...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

IEEE Transactions on Information Theory Volume 58, Issue 5

May 2012

768 pages

ISSN:0018-9448

Issue’s Table of Contents

Publisher

IEEE Press

Publication History

Published: 01 May 2012

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

163
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 28 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Yang ZDai XDubey AHirche SHattab GDastani MSichman JAlechina NDignum V(2024)Whom to Trust? Elective Learning for Distributed Gaussian Process RegressionProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3663066(2020-2028)Online publication date: 6-May-2024
https://dl.acm.org/doi/10.5555/3635637.3663066
Chen XZhang YXie GZhang J(2024)A Uniform Error Bound for Stochastic Kriging: Properties and Implications on Simulation Experimental DesignACM Transactions on Modeling and Computer Simulation10.1145/368205935:1(1-43)Online publication date: 25-Nov-2024
https://dl.acm.org/doi/10.1145/3682059
Templier PGrillotti LRachelson EWilson DCully ALi XHandl J(2024)Quality with Just Enough Diversity in Evolutionary Policy SearchProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654047(105-113)Online publication date: 14-Jul-2024
https://dl.acm.org/doi/10.1145/3638529.3654047
Saha SSandha SAggarwal MWang BHan LBriseno JSrivastava M(2024)TinyNS: Platform-aware Neurosymbolic Auto Tiny Machine LearningACM Transactions on Embedded Computing Systems10.1145/360317123:3(1-48)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3603171
Evangelisti GHirche S(2024)Data-Driven Momentum Observers With Physically Consistent Gaussian ProcessesIEEE Transactions on Robotics10.1109/TRO.2024.336681840(1938-1951)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TRO.2024.3366818
Xu WJiang YMaddalena EJones C(2024)Lower Bounds on the Noiseless Worst-Case Complexity of Efficient Global OptimizationJournal of Optimization Theory and Applications10.1007/s10957-024-02399-1201:2(583-608)Online publication date: 1-May-2024
https://dl.acm.org/doi/10.1007/s10957-024-02399-1
Nakazato SYang BShimokawa T(2024)Analyzing Human Search Behavior When Subjective Returns are UnobservableComputational Economics10.1007/s10614-023-10388-163:5(1921-1947)Online publication date: 1-May-2024
https://dl.acm.org/doi/10.1007/s10614-023-10388-1
Ohno H(2024)Empirical study of evolutionary computation-based multi-objective Bayesian optimization for materials discoverySoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-023-09058-z28:15-16(8807-8834)Online publication date: 1-Aug-2024
https://dl.acm.org/doi/10.1007/s00500-023-09058-z
Huber FBürkner PGöddeke DSchulte M(2024)Knowledge-based modeling of simulation behavior for Bayesian optimizationComputational Mechanics10.1007/s00466-023-02427-374:1(151-168)Online publication date: 1-Jul-2024
https://dl.acm.org/doi/10.1007/s00466-023-02427-3
Robledo FAyesta UAvrachenkov K(2024)Deep Reinforcement Learning for Weakly Coupled MDP’s with Continuous ActionsAnalytical and Stochastic Modelling Techniques and Applications10.1007/978-3-031-70753-7_5(67-80)Online publication date: 14-Jun-2024
https://dl.acm.org/doi/10.1007/978-3-031-70753-7_5
Show More Cited By

View Options

View options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

Cited By

Recommendations

Gaussian process optimization in the bandit setting: no regret and experimental design

Improved regret bounds for bandit combinatorial optimization

Misspecified Gaussian process bandit optimization

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations