计算机科学 ›› 2020, Vol. 47 ›› Issue (5): 79-83.doi: 10.11896/jsjkx.190400145
赵澄, 叶耀威, 姚明海
ZHAO Cheng, YE Yao-wei, YAO Ming-hai
摘要: 股票市场的情绪可以在一定程度上反映投资者的行为并影响其投资决策。市场新闻作为一种非结构性数据,能够体现并引导市场的大环境情绪,与股票价格一同成为至关重要的市场参考数据,能够为投资者的投资决策提供有效帮助。文中提出了一种可以准确、快速地建立针对海量新闻数据的多维情绪特征向量化方法,利用支持向量机(Support Victor Machine,SVM)模型来预测金融新闻对股票市场的影响,并通过bootstrap来减轻过拟合问题。在沪深股指上进行实验的结果表明,相比于传统模型,所提方法能够将预测准确度提高约8%,并在3个月的回测实验中获得了6.52%的超额收益,证明了其有效性。
中图分类号:
[1]OLIVEIRA N,CORTEZ P,AREAL N.Stock market sentiment lexicon acquisition using microblogging data and statistical measures[J].Decision Support Systems,2016,85:62-73. [2]LONG W,TANG Y,TIAN Y.Investor sentiment identification based on the universum SVM[J].Neural Computing and Applications,2018,30(2):661-670. [3]PERIKOS I,HATZILYGEROUDIS I.Recognizing emotions in text using ensemble of classifiers[J].Engineering Applications of Artificial Intelligence,2016,51:191-201. [4]WU B,ZHOU X,JIN Q,et al.Analyzing Social Roles Based on a Hierarchical Model and Data Mining for Collective Decision-Making Support[J].IEEE Systems Journal,2015:1-10. [5]JIANG F,LEE J,MARTIN X,et al.Manager sentiment andstock returns[J].Journal of Financial Economics,2019,132(1):126-149. [6]MIWA K.Investor sentiment,stock mispricing,and long-termgrowth expectations[J].Research in International Business and Finance,2016,36:414-423. [7]BOLLEN J,MAO H,ZENG X.Twitter mood predicts the stock market[J].Journal of Computational Science,2011,2(1):1-8. [8]SUL H K,DENNIS A R,YUAN L.Trading on twitter:Using social media sentiment to predict stock returns[J].DecisionScie-nces,2017,48(3):454-488. [9]OLIVEIRA N,CORTEZ P,AREAL N.On the predictability of stock market behavior using stocktwits sentiment and posting volume[C]//Portuguese Conference on Artificial Intelligence.Berlin,Heidelberg:Springer,2013:355-365. [10]MAKREHCHI M,SHAH S,LIAO W.Stock prediction usingevent-based sentiment analysis[C]//Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT)-Vo-lume 01.IEEE Computer Society,2013:337-342. [11]CHECKLEY M S,HIGÓN D A,ALLES H.The hasty wisdom of the mob:How market sentiment predicts stock market beha-vior[J].Expert Systems with Applications,2017,77:256-263. [12]NIKKINEN J,SAHLSTRÖM P.Impact of Scheduled US Macroeconomic News on Stock Market Uncertainty:A Multinational Perspecive[J].Multinational Finance Journal,2011,5(2):129-148. [13]REN R,WU D D,LIU T.Forecasting stock market movement direction using sentiment analysis and support vector machine[J].IEEE Systems Journal,2018,13(1):760-770. [14]CHEN Y,HAO Y.A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction[J].Expert Systems with Applications,2017,80:340-355. [15]HUANG W,NAKAMORI Y,WANG S Y.Forecasting stockmarket movement direction with support vector machine[J].Computers & Operations Research,2005,32(10):2513-2522. [16]CHEN W,ZHANG Y,YEO C K,et al.Stock market prediction using neural network through news on online social networks[C]//2017 International Smart Cities Conference (ISC2).IEEE,2017:1-6. [17]HÁJEK P.Combining bag-of-words and sentiment features ofannual reports to predict abnormal stock returns[J].Neural Computing and Applications,2018,29(7):343-358. [18]SCHUMAKER R P,CHEN H.A discrete stock price prediction engine based on financial news[J].COMPUTER,2010,43(1):51-56. [19]CI Y X,ZHAO S L,LUO Y,et al.Text data preprocessingmethod based on word frequency statistics[J].Computer Scie-nce,2017,44(10):276-282,288. [20]LI L,ZHANG G Y,LI Z W,et al.Research on topic crawlertechnology based on SVM[J].Computer Science,2015,42(2):118-122. [21]LI X,XIE H,WANG R,et al.Empirical analysis:stock market prediction via extreme learning machine[J].Neural Computing and Applications,2016,27(1):67-78. [22]YAO W D,WANG R J.An Empirical Study of the Relationship between Stock Market Volatility and Policy Events from the Perspective of Structural Decomposition-Based on EEMD Algorithm [J].Shanghai Economic Research,2016(1):71-80. |
[1] | 姜梦函, 李邵梅, 郑洪浩, 张建朋. 基于改进位置编码的谣言检测模型 Rumor Detection Model Based on Improved Position Embedding 计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046 |
[2] | 毛典辉, 黄晖煜, 赵爽. 符合监管合规性的自动合成新闻检测方法研究 Study on Automatic Synthetic News Detection Method Complying with Regulatory Compliance 计算机科学, 2022, 49(6A): 523-530. https://doi.org/10.11896/jsjkx.210300083 |
[3] | 康雁, 吴志伟, 寇勇奇, 张兰, 谢思宇, 李浩. 融合Bert和图卷积的深度集成学习软件需求分类 Deep Integrated Learning Software Requirement Classification Fusing Bert and Graph Convolution 计算机科学, 2022, 49(6A): 150-158. https://doi.org/10.11896/jsjkx.210500065 |
[4] | 蒲岍岍, 雷航, 李贞昊, 李晓瑜. 增强列表信息和用户兴趣的个性化新闻推荐算法 Personalized News Recommendation Algorithm with Enhanced List Information and User Interests 计算机科学, 2022, 49(6): 142-148. https://doi.org/10.11896/jsjkx.210400173 |
[5] | 丛颖男, 王兆毓, 朱金清. 关于法律人工智能数据和算法问题的若干思考 Insights into Dataset and Algorithm Related Problems in Artificial Intelligence for Law 计算机科学, 2022, 49(4): 74-79. https://doi.org/10.11896/jsjkx.210900191 |
[6] | 李野, 陈松灿. 基于物理信息的神经网络:最新进展与展望 Physics-informed Neural Networks:Recent Advances and Prospects 计算机科学, 2022, 49(4): 254-262. https://doi.org/10.11896/jsjkx.210500158 |
[7] | 朝乐门, 尹显龙. 人工智能治理理论及系统的现状与趋势 AI Governance and System:Current Situation and Trend 计算机科学, 2021, 48(9): 1-8. https://doi.org/10.11896/jsjkx.210600034 |
[8] | 王剑, 王玉翠, 黄梦杰. 社交网络中的虚假信息:定义、检测及控制 False Information in Social Networks:Definition,Detection and Control 计算机科学, 2021, 48(8): 263-277. https://doi.org/10.11896/jsjkx.210300053 |
[9] | 景慧昀, 魏薇, 周川, 贺欣. 人工智能安全框架 Artificial Intelligence Security Framework 计算机科学, 2021, 48(7): 1-8. https://doi.org/10.11896/jsjkx.210300306 |
[10] | 谢宸琪, 张保稳, 易平. 人工智能模型水印研究综述 Survey on Artificial Intelligence Model Watermarking 计算机科学, 2021, 48(7): 9-16. https://doi.org/10.11896/jsjkx.201200204 |
[11] | 景慧昀, 周川, 贺欣. 针对人脸检测对抗攻击风险的安全测评方法 Security Evaluation Method for Risk of Adversarial Attack on Face Detection 计算机科学, 2021, 48(7): 17-24. https://doi.org/10.11896/jsjkx.210300305 |
[12] | 暴雨轩, 芦天亮, 杜彦辉, 石达. 基于i_ResNet34模型和数据增强的深度伪造视频检测方法 Deepfake Videos Detection Method Based on i_ResNet34 Model and Data Augmentation 计算机科学, 2021, 48(7): 77-85. https://doi.org/10.11896/jsjkx.210300258 |
[13] | 裴莹, 李天祥, 王鏖清, 付加胜, 韩霄松. 基于新闻的国际天然气价格趋势预测方法 Prediction Method of International Natural Gas Price Trends Based on News 计算机科学, 2021, 48(6A): 235-239. https://doi.org/10.11896/jsjkx.201000056 |
[14] | 秦智慧, 李宁, 刘晓彤, 刘秀磊, 佟强, 刘旭红. 无模型强化学习研究综述 Overview of Research on Model-free Reinforcement Learning 计算机科学, 2021, 48(3): 180-187. https://doi.org/10.11896/jsjkx.200700217 |
[15] | 郁友琴, 李弼程. 基于多粒度文本特征表示的微博用户兴趣识别 Microblog User Interest Recognition Based on Multi-granularity Text Feature Representation 计算机科学, 2021, 48(12): 219-225. https://doi.org/10.11896/jsjkx.201100128 |
|