Nothing Special   »   [go: up one dir, main page]

Skip to main content

Pre-trained Financial Model for Price Movement Forecasting

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2023)

Abstract

We propose the Pre-trained Financial Model (PFM) for price movement forecasting, which is critical in the automated trading systems in the Stock and Futures markets. Inspired by recent successes of pre-trained large language models in tackling NLP tasks, our PFM adopts a pretraining-and-finetuning strategy for obtaining capable models that are adapted to various downstream price-forecasting tasks. During the pre-training stage, we train a sequence prediction backbone with multi-task learning by adopting both a supervised learning objective and an unsupervised regularization target. Our approach differs from the common masked language modeling (MLM) used in NLP studies. We develop a per-step target variable generation strategy for eliciting future predictions from the transformer encoder-decoder architecture. We verify our pre-trained model on various practical downstream forecasting tasks, including lagged movement regression, movement direction classification, and selective trading with best performing stocks. Specifically, during the fine-tuning stage, we retain the pre-trained encoder and replace the decoder with specific downstream task decoders. We then perform supervised task-specific target generation learning as the fine-tuning process. Through extensive numerical studies and analysis, we demonstrate that our fine-tuned financial model can achieve a 5–15% improvement over downstream regression and classification tasks and over 40% in selective trading task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Blondel, M., Teboul, O., Berthet, Q., Djolonga, J.: Fast differentiable sorting and ranking. In: ICML (2020)

    Google Scholar 

  2. Box, G.E.P., Jenkins, G.M.: Some recent advances in forecasting and control. J. Roy. Statist. Soc. (1968)

    Google Scholar 

  3. Brown, T.B., et al.: Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020)

  4. Cartea, A., Donnelly, R., Jaimungal, S.: Enhancing trading strategies with order book signals. Appl. Math. Financ. 25(1), 1–35 (2018)

    Article  MathSciNet  Google Scholar 

  5. Castoe, M.: Predicting stock market price direction with uncertainty using quantile regression forest (2020)

    Google Scholar 

  6. Cuturi, M.: Sinkhorn distances: lightspeed computation of optimal transport. In: NeurIPS, pp. 2292–2300 (2013)

    Google Scholar 

  7. Cuturi, M., Teboul, O., Vert, J.P.: Differentiable ranking and sorting using optimal transport. In: NeurIPS (2019)

    Google Scholar 

  8. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  9. Fan, C., Lu, H., Huang, A.: A novel differentiable rank learning method towards stock movement quantile forecasting. In: European Conference on Artificial Intelligence (2023)

    Google Scholar 

  10. Fan, C., et al.: Multi-horizon time series forecasting with temporal attention learning. In: SIGKDD (2019)

    Google Scholar 

  11. Feng, F., Chen, H., He, X., Ding, J., Sun, M., Chua, T.S.: Enhancing stock movement prediction with adversarial training. In: IJCAI (2019)

    Google Scholar 

  12. Gould, M.D., Porter, M.A., Williams, S., McDonald, M., Fenn, D.J., Howison, S.D.: Limit order books. Quant. Financ. 13(11), 1709–1742 (2013)

    Article  MathSciNet  Google Scholar 

  13. Holt, C.C.: Forecasting seasonals and trends by exponentially weighted moving averages. Int. J. Forecast. (2004)

    Google Scholar 

  14. Ke, G., et al.: Lightgbm: a highly efficient gradient boosting decision tree. In: NeurIPS (2017)

    Google Scholar 

  15. Kearns, M., Kulesza, A., Nevmyvaka, Y.: Empirical limitations on high-frequency trading profitability. J. Trading 5(4), 50–62 (2010)

    Article  Google Scholar 

  16. Koenker, R., Bassett, Jr., G.: Regression quantiles. Econometrica: J. Economet. Soc. 33–50 (1978)

    Google Scholar 

  17. Lim, B., Arık, S.Ö., Loeff, N., Pfister, T.: Temporal fusion transformers for interpretable multi-horizon time series forecasting. Int. J. Forecast. (2021)

    Google Scholar 

  18. Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: ICCV (2017)

    Google Scholar 

  19. Vaswani, A., et al.: Attention is all you need. In: NIPS (2017)

    Google Scholar 

  20. Wen, R., Torkkola, K., Narayanaswamy, B., Madeka, D.: A multi-horizon quantile recurrent forecaster. arXiv preprint arXiv:1711.11053 (2017)

  21. Winters, P.R.: Forecasting sales by exponentially weighted moving averages. Manag. Sci. (1960)

    Google Scholar 

  22. Xu, Y., Cohen, S.B.: Stock movement prediction from tweets and historical prices. In: ACL (2018)

    Google Scholar 

  23. Yoo, J., Soun, Y., Park, Y., Kang, U.: Accurate multivariate stock movement prediction via data-axis transformer with multi-level contexts. In: SIGKDD (2021)

    Google Scholar 

  24. Zhang, L., Aggarwal, C., Qi, G.J.: Stock price prediction via discovering multi-frequency trading patterns. In: SIGKDD (2017)

    Google Scholar 

  25. Zhang, Z., Zohren, S.: Multi-horizon forecasting for limit order books: novel deep learning approaches and hardware acceleration using intelligent processing units. arXiv preprint arXiv:2105.10430 (2021)

  26. Zhang, Z., Zohren, S., Roberts, S.: Deeplob: deep convolutional neural networks for limit order books. In: IEEE Transactions on Signal Processing (2018)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China, Project 62106156, and Starting Fund of South China Normal University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chenyou Fan .

Editor information

Editors and Affiliations

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (zip 4 KB)

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fan, C., Pang, T., Huang, A. (2024). Pre-trained Financial Model for Price Movement Forecasting. In: Luo, B., Cheng, L., Wu, ZG., Li, H., Li, C. (eds) Neural Information Processing. ICONIP 2023. Communications in Computer and Information Science, vol 1969. Springer, Singapore. https://doi.org/10.1007/978-981-99-8184-7_17

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-8184-7_17

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-8183-0

  • Online ISBN: 978-981-99-8184-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics