Pre-trained Financial Model for Price Movement Forecasting

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1969))

Included in the following conference series:

International Conference on Neural Information Processing

808 Accesses
1 Citations

Abstract

We propose the Pre-trained Financial Model (PFM) for price movement forecasting, which is critical in the automated trading systems in the Stock and Futures markets. Inspired by recent successes of pre-trained large language models in tackling NLP tasks, our PFM adopts a pretraining-and-finetuning strategy for obtaining capable models that are adapted to various downstream price-forecasting tasks. During the pre-training stage, we train a sequence prediction backbone with multi-task learning by adopting both a supervised learning objective and an unsupervised regularization target. Our approach differs from the common masked language modeling (MLM) used in NLP studies. We develop a per-step target variable generation strategy for eliciting future predictions from the transformer encoder-decoder architecture. We verify our pre-trained model on various practical downstream forecasting tasks, including lagged movement regression, movement direction classification, and selective trading with best performing stocks. Specifically, during the fine-tuning stage, we retain the pre-trained encoder and replace the decoder with specific downstream task decoders. We then perform supervised task-specific target generation learning as the fine-tuning process. Through extensive numerical studies and analysis, we demonstrate that our fine-tuned financial model can achieve a 5–15% improvement over downstream regression and classification tasks and over 40% in selective trading task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Stock price nowcasting and forecasting with deep learning

Article 13 November 2024

Forecasting Chinese Overnight Stock Index Movement Using Large Language Models with Market Summary

Application and Modeling of LLM in Quantitative Trading Using Deep Learning Strategies

References

Blondel, M., Teboul, O., Berthet, Q., Djolonga, J.: Fast differentiable sorting and ranking. In: ICML (2020)
Google Scholar
Box, G.E.P., Jenkins, G.M.: Some recent advances in forecasting and control. J. Roy. Statist. Soc. (1968)
Google Scholar
Brown, T.B., et al.: Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020)
Cartea, A., Donnelly, R., Jaimungal, S.: Enhancing trading strategies with order book signals. Appl. Math. Financ. 25(1), 1–35 (2018)
Article MathSciNet Google Scholar
Castoe, M.: Predicting stock market price direction with uncertainty using quantile regression forest (2020)
Google Scholar
Cuturi, M.: Sinkhorn distances: lightspeed computation of optimal transport. In: NeurIPS, pp. 2292–2300 (2013)
Google Scholar
Cuturi, M., Teboul, O., Vert, J.P.: Differentiable ranking and sorting using optimal transport. In: NeurIPS (2019)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Fan, C., Lu, H., Huang, A.: A novel differentiable rank learning method towards stock movement quantile forecasting. In: European Conference on Artificial Intelligence (2023)
Google Scholar
Fan, C., et al.: Multi-horizon time series forecasting with temporal attention learning. In: SIGKDD (2019)
Google Scholar
Feng, F., Chen, H., He, X., Ding, J., Sun, M., Chua, T.S.: Enhancing stock movement prediction with adversarial training. In: IJCAI (2019)
Google Scholar
Gould, M.D., Porter, M.A., Williams, S., McDonald, M., Fenn, D.J., Howison, S.D.: Limit order books. Quant. Financ. 13(11), 1709–1742 (2013)
Article MathSciNet Google Scholar
Holt, C.C.: Forecasting seasonals and trends by exponentially weighted moving averages. Int. J. Forecast. (2004)
Google Scholar
Ke, G., et al.: Lightgbm: a highly efficient gradient boosting decision tree. In: NeurIPS (2017)
Google Scholar
Kearns, M., Kulesza, A., Nevmyvaka, Y.: Empirical limitations on high-frequency trading profitability. J. Trading 5(4), 50–62 (2010)
Article Google Scholar
Koenker, R., Bassett, Jr., G.: Regression quantiles. Econometrica: J. Economet. Soc. 33–50 (1978)
Google Scholar
Lim, B., Arık, S.Ö., Loeff, N., Pfister, T.: Temporal fusion transformers for interpretable multi-horizon time series forecasting. Int. J. Forecast. (2021)
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: ICCV (2017)
Google Scholar
Vaswani, A., et al.: Attention is all you need. In: NIPS (2017)
Google Scholar
Wen, R., Torkkola, K., Narayanaswamy, B., Madeka, D.: A multi-horizon quantile recurrent forecaster. arXiv preprint arXiv:1711.11053 (2017)
Winters, P.R.: Forecasting sales by exponentially weighted moving averages. Manag. Sci. (1960)
Google Scholar
Xu, Y., Cohen, S.B.: Stock movement prediction from tweets and historical prices. In: ACL (2018)
Google Scholar
Yoo, J., Soun, Y., Park, Y., Kang, U.: Accurate multivariate stock movement prediction via data-axis transformer with multi-level contexts. In: SIGKDD (2021)
Google Scholar
Zhang, L., Aggarwal, C., Qi, G.J.: Stock price prediction via discovering multi-frequency trading patterns. In: SIGKDD (2017)
Google Scholar
Zhang, Z., Zohren, S.: Multi-horizon forecasting for limit order books: novel deep learning approaches and hardware acceleration using intelligent processing units. arXiv preprint arXiv:2105.10430 (2021)
Zhang, Z., Zohren, S., Roberts, S.: Deeplob: deep convolutional neural networks for limit order books. In: IEEE Transactions on Signal Processing (2018)
Google Scholar

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China, Project 62106156, and Starting Fund of South China Normal University.

Author information

Authors and Affiliations

South China Normal University, Guangzhou, China
Chenyou Fan & Tianqi Pang
Hangzhou Higgs Asset Management Co., Ltd., Hangzhou, China
Aimin Huang

Authors

Chenyou Fan
View author publications
You can also search for this author in PubMed Google Scholar
Tianqi Pang
View author publications
You can also search for this author in PubMed Google Scholar
Aimin Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chenyou Fan .

Editor information

Editors and Affiliations

School of Automation, Central South University, Changsha, China
Biao Luo
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Long Cheng
Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, China
Zheng-Guang Wu
School of Automation, Guangdong University of Technology, Guangzhou, China
Hongyi Li
School of Electrical Engineering and Telecommunications, UNSW Sydney, Sydney, NSW, Australia
Chaojie Li

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (zip 4 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fan, C., Pang, T., Huang, A. (2024). Pre-trained Financial Model for Price Movement Forecasting. In: Luo, B., Cheng, L., Wu, ZG., Li, H., Li, C. (eds) Neural Information Processing. ICONIP 2023. Communications in Computer and Information Science, vol 1969. Springer, Singapore. https://doi.org/10.1007/978-981-99-8184-7_17

Download citation

DOI: https://doi.org/10.1007/978-981-99-8184-7_17
Published: 26 November 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8183-0
Online ISBN: 978-981-99-8184-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Pre-trained Financial Model for Price Movement Forecasting

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Stock price nowcasting and forecasting with deep learning

Forecasting Chinese Overnight Stock Index Movement Using Large Language Models with Market Summary

Application and Modeling of LLM in Quantitative Trading Using Deep Learning Strategies

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (zip 4 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Pre-trained Financial Model for Price Movement Forecasting

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Stock price nowcasting and forecasting with deep learning

Forecasting Chinese Overnight Stock Index Movement Using Large Language Models with Market Summary

Application and Modeling of LLM in Quantitative Trading Using Deep Learning Strategies

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (zip 4 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation