Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3533271.3561731acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicaifConference Proceedingsconference-collections
research-article

Deep Hedging: Continuous Reinforcement Learning for Hedging of General Portfolios across Multiple Risk Aversions

Published: 26 October 2022 Publication History

Abstract

We present a method for finding optimal hedging policies for arbitrary initial portfolios and market states. We develop a novel actor-critic algorithm for solving general risk-averse stochastic control problems and use it to learn hedging strategies across multiple risk aversion levels simultaneously. We demonstrate the effectiveness of the approach with a numerical example in a stochastic volatility environment.

References

[1]
Leif BG Andersen, Peter Jäckel, and Christian Kahl. 2010. Simulation of square-root processes. Encyclopedia of Quantitative Finance(2010), 1642–1649.
[2]
Sebastian Becker, Patrick Cheridito, and Arnulf Jentzen. 2020. Pricing and hedging American-style options with deep learning. Journal of Risk and Financial Management 13, 7 (2020), 158.
[3]
Lorenzo Bisi. 2022. Algorithms for risk-averse reinforcement learning. (2022).
[4]
Mark Broadie and Özgür Kaya. 2006. Exact simulation of stochastic volatility and other affine jump diffusion processes. Operations research 54, 2 (2006), 217–231.
[5]
H. Buehler, L. Gonon, J. Teichmann, and B. Wood. 2019. Deep hedging. Quantitative Finance(2019), 1–21. https://doi.org/10.1080/14697688.2019.1571683 arXiv:https://doi.org/10.1080/14697688.2019.1571683
[6]
Hans Buehler, Phillip Murray, Mikko S. Pakkanen, and Ben Wood. March 2022. Deep hedging: Learning to remove the drift. Risk (March 2022).
[7]
Yinlam Chow and Mohammad Ghavamzadeh. 2014. Algorithms for CVaR optimization in MDPs. Advances in neural information processing systems 27 (2014).
[8]
Yinlam Chow, Aviv Tamar, Shie Mannor, and Marco Pavone. 2015. Risk-sensitive and robust decision-making: a cvar optimization approach. Advances in neural information processing systems 28 (2015).
[9]
Will Dabney, Mark Rowland, Marc Bellemare, and Rémi Munos. 2018. Distributional reinforcement learning with quantile regression. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
[10]
Oscar Dowson, David P Morton, and Bernardo K Pagnoncelli. 2020. Multistage stochastic programs with the entropic risk measure. Preprint (2020).
[11]
Jiayi Du, Muyang Jin, Petter N Kolm, Gordon Ritter, Yixuan Wang, and Bofei Zhang. 2020. Deep reinforcement learning for option replication and hedging. The Journal of Financial Data Science 2, 4 (2020), 44–57.
[12]
Hans Föllmer and Alexander Schied. 2010. Convex and coherent risk measures. Encyclopedia of Quantitative Finance(2010), 355–363.
[13]
Javier Garcıa and Fernando Fernández. 2015. A comprehensive survey on safe reinforcement learning. Journal of Machine Learning Research 16, 1 (2015), 1437–1480.
[14]
Michael Gimelfarb, André Barreto, Scott Sanner, and Chi-Guhn Lee. 2021. Risk-Aware Transfer in Reinforcement Learning using Successor Features. Advances in Neural Information Processing Systems 34 (2021), 17298–17310.
[15]
Ben Hambly, Renyuan Xu, and Huining Yang. 2021. Recent advances in reinforcement learning in finance. arXiv preprint arXiv:2112.04553(2021).
[16]
Steven L Heston. 1993. A closed-form solution for options with stochastic volatility with applications to bond and currency options. The review of financial studies 6, 2 (1993), 327–343.
[17]
Blanka Horvath, Josef Teichmann, and Žan Žurič. 2021. Deep hedging under rough volatility. Risks 9, 7 (2021), 138.
[18]
Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning. PMLR, 448–456.
[19]
Sebastian Jaimungal. 2022. Reinforcement learning and stochastic optimisation. Finance and Stochastics 26, 1 (2022), 103–129.
[20]
Petter N Kolm and Gordon Ritter. 2019. Dynamic replication and hedging: A reinforcement learning approach. The Journal of Financial Data Science 1, 1 (2019), 159–171.
[21]
Michael Kupper and Walter Schachermayer. 2009. Representation results for law invariant time consistent functions. Mathematics and Financial Economics 2, 3 (2009), 189–210.
[22]
Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971(2015).
[23]
Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction(2 ed.). MIT Press, Cambridge.
[24]
Aviv Tamar, Yinlam Chow, Mohammad Ghavamzadeh, and Shie Mannor. 2016. Sequential decision making with coherent risk. IEEE transactions on automatic control 62, 7 (2016), 3323–3338.
[25]
Edoardo Vittori, Michele Trapletti, and Marcello Restelli. 2020. Option hedging with risk averse reinforcement learning. In Proceedings of the First ACM International Conference on AI in Finance. 1–8.
[26]
Magnus Wiese, Ben Wood, Alexandre Pachoud, Ralf Korn, Hans Buehler, Phillip Murray, and Lianjun Bai. 2021. Multi-asset spot and option market simulation. arXiv preprint arXiv:2112.06823(2021).

Cited By

View all
  • (2024)Experimental Analysis of Deep Hedging Using Artificial Market Simulations for Underlying Asset SimulatorsSSRN Electronic Journal10.2139/ssrn.4794316Online publication date: 2024
  • (2024)Agricultural commodities market reaction to COVID-19Research in International Business and Finance10.1016/j.ribaf.2024.10228769(102287)Online publication date: Apr-2024
  • (2023)Deep Reinforcement Learning for Dynamic Stock Option Hedging: A ReviewMathematics10.3390/math1124494311:24(4943)Online publication date: 13-Dec-2023
  • Show More Cited By

Index Terms

  1. Deep Hedging: Continuous Reinforcement Learning for Hedging of General Portfolios across Multiple Risk Aversions

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICAIF '22: Proceedings of the Third ACM International Conference on AI in Finance
    November 2022
    527 pages
    ISBN:9781450393768
    DOI:10.1145/3533271
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Deep Hedging
    2. Reinforcement Learning
    3. Risk Averse
    4. Transaction Costs

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICAIF '22
    Sponsor:

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)80
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 22 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Experimental Analysis of Deep Hedging Using Artificial Market Simulations for Underlying Asset SimulatorsSSRN Electronic Journal10.2139/ssrn.4794316Online publication date: 2024
    • (2024)Agricultural commodities market reaction to COVID-19Research in International Business and Finance10.1016/j.ribaf.2024.10228769(102287)Online publication date: Apr-2024
    • (2023)Deep Reinforcement Learning for Dynamic Stock Option Hedging: A ReviewMathematics10.3390/math1124494311:24(4943)Online publication date: 13-Dec-2023
    • (2023)Deep treasury management for banksFrontiers in Artificial Intelligence10.3389/frai.2023.11202976Online publication date: 22-Mar-2023
    • (2023)Quantum Deep HedgingQuantum10.22331/q-2023-11-29-11917(1191)Online publication date: 29-Nov-2023
    • (2023)Applying Reinforcement Learning to Option Pricing and HedgingSSRN Electronic Journal10.2139/ssrn.4546371Online publication date: 2023
    • (2023)Adversarial Deep Hedging: Learning to Hedge without Price Process ModelingSSRN Electronic Journal10.2139/ssrn.4520273Online publication date: 2023
    • (2023)Reinforcement Learning for Quantitative TradingACM Transactions on Intelligent Systems and Technology10.1145/358256014:3(1-29)Online publication date: 24-Mar-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media