Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting

Published: 01 May 2012 Publication History

Abstract

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multiarmed bandit problem, where the payoff function is either sampled from a Gaussian process (GP) or has low norm in a reproducing kernel Hilbert space. We resolve the important open problem of deriving regret bounds for this setting, which imply novel convergence rates for GP optimization. We analyze an intuitive Gaussian process upper confidence bound (${\ssr GP}\hbox{-}{\ssr UCB})$ algorithm, and bound its cumulative regret in terms of maximal information gain, establishing a novel connection between GP optimization and experimental design. Moreover, by bounding the latter in terms of operator spectra, we obtain explicit sublinear regret bounds for many commonly used covariance functions. In some important cases, our bounds have surprisingly weak dependence on the dimensionality. In our experiments on real sensor data, ${\ssr GP}\hbox{-}{\ssr UCB}$ compares favorably with other heuristical GP optimization approaches.

Cited By

View all
  • (2024)Whom to Trust? Elective Learning for Distributed Gaussian Process RegressionProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3663066(2020-2028)Online publication date: 6-May-2024
  • (2024)A Uniform Error Bound for Stochastic Kriging: Properties and Implications on Simulation Experimental DesignACM Transactions on Modeling and Computer Simulation10.1145/368205935:1(1-43)Online publication date: 25-Nov-2024
  • (2024)Quality with Just Enough Diversity in Evolutionary Policy SearchProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654047(105-113)Online publication date: 14-Jul-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Information Theory
IEEE Transactions on Information Theory  Volume 58, Issue 5
May 2012
768 pages

Publisher

IEEE Press

Publication History

Published: 01 May 2012

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Whom to Trust? Elective Learning for Distributed Gaussian Process RegressionProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3663066(2020-2028)Online publication date: 6-May-2024
  • (2024)A Uniform Error Bound for Stochastic Kriging: Properties and Implications on Simulation Experimental DesignACM Transactions on Modeling and Computer Simulation10.1145/368205935:1(1-43)Online publication date: 25-Nov-2024
  • (2024)Quality with Just Enough Diversity in Evolutionary Policy SearchProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654047(105-113)Online publication date: 14-Jul-2024
  • (2024)TinyNS: Platform-aware Neurosymbolic Auto Tiny Machine LearningACM Transactions on Embedded Computing Systems10.1145/360317123:3(1-48)Online publication date: 11-May-2024
  • (2024)Data-Driven Momentum Observers With Physically Consistent Gaussian ProcessesIEEE Transactions on Robotics10.1109/TRO.2024.336681840(1938-1951)Online publication date: 1-Jan-2024
  • (2024)Lower Bounds on the Noiseless Worst-Case Complexity of Efficient Global OptimizationJournal of Optimization Theory and Applications10.1007/s10957-024-02399-1201:2(583-608)Online publication date: 1-May-2024
  • (2024)Analyzing Human Search Behavior When Subjective Returns are UnobservableComputational Economics10.1007/s10614-023-10388-163:5(1921-1947)Online publication date: 1-May-2024
  • (2024)Empirical study of evolutionary computation-based multi-objective Bayesian optimization for materials discoverySoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-023-09058-z28:15-16(8807-8834)Online publication date: 1-Aug-2024
  • (2024)Knowledge-based modeling of simulation behavior for Bayesian optimizationComputational Mechanics10.1007/s00466-023-02427-374:1(151-168)Online publication date: 1-Jul-2024
  • (2024)Deep Reinforcement Learning for Weakly Coupled MDP’s with Continuous ActionsAnalytical and Stochastic Modelling Techniques and Applications10.1007/978-3-031-70753-7_5(67-80)Online publication date: 14-Jun-2024
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media