Abstract
Generating a storyline is aiming to discover the evolution of events from news websites. Some existing approaches aim to automatically cluster news articles into events and connect related events in growing trees to generate storylines. Unfortunately, these methods did not perform well in learning the implicit associations of events. More recently, Graph Convolutional Network (GCN) based methods are proposed to learn the implicit associations between events. However, since the event representation in GCN model tends to be consistent after multi-layer propagation, the events cannot be correctly distinguished, which is not conducive to comprehensively learning the implicit associations between different events. In this paper, we propose an effective storyline generation method for news articles. Firstly, a novel model is presented based on Approximate Personalized Propagation of Neural Predictions for Story Branch Construction Model, called SBCM, which preserves local features and can better learn the implicit association between different events. Then, we utilize a statistical method to identify transition events in news articles, and connect story branches with transition events through temporal relationships to finally generate storylines. The experimental results on two real-world Chinese news datasets show that our proposal outperforms several state-of-the-art methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Ansah, J., Liu, L., Kang, W., Kwashie, S., Li, J., Li, J.: A graph is worth a thousand words: telling event stories using timeline summarization graphs. In: The World Wide Web Conference, pp. 2565–2571 (2019)
Born, L., Bacher, M., Markert, K.: Dataset reproducibility and IR methods in timeline summarization. In: Proceedings of the 12th Language Resources and Evaluation Conference, pp. 1763–1771 (2020)
Chen, C.C., Chen, Y.-T., Sun, Y., Chen, M.C.: Life cycle modeling of news events using aging theory. In: Lavrač, N., Gamberger, D., Blockeel, H., Todorovski, L. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 47–59. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39857-8_7
Chen, X., Chan, Z., Gao, S., Yu, M.H., Zhao, D., Yan, R.: Learning towards abstractive timeline summarization. In: IJCAI, pp. 4939–4945 (2019)
Du, J., Zhang, S., Wu, G., Moura, J.M., Kar, S.: Topology adaptive graph convolutional networks. arXiv preprint arXiv:1710.10370 (2017)
Gasteiger, J., Bojchevski, A., Günnemann, S.: Predict then propagate: graph neural networks meet personalized pagerank. In: International Conference on Learning Representations (ICLR) (2019)
Ghalandari, D.G., Ifrim, G.: Examining the state-of-the-art in news timeline summarization. arXiv preprint arXiv:2005.10107 (2020)
Guo, L., Zhou, D., He, Y., Xu, H.: Storyline extraction from news articles with dynamic dependency. Intell. Data Anal. 24(1), 183–197 (2020)
Hawwash, B., Nasraoui, O.: From tweets to stories: using stream-dashboard to weave the Twitter data stream into dynamic cluster models. In: Proceedings of the 3rd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, pp. 182–197. PMLR (2014)
Hua, T., Zhang, X., et al.: Automatical storyline generation with help from Twitter. In: CIKM, pp. 2383–2388 (2016)
Huang, L., et al.: Optimized event storyline generation based on mixture-event-aspect model. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 726–735 (2013)
Jiang, H., Beeferman, D., Mao, W., Roy, D.K.: Topic detection and tracking with time-aware document embeddings. ArXiv: abs/2112.06166 (2021)
La Quatra, M., Cagliero, L., Baralis, E., Messina, A., Montagnuolo, M.: Summarize dates first: a paradigm shift in timeline summarization. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 418–427 (2021)
Li, M., et al.: Timeline summarization based on event graph compression via time-aware optimal transport. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp. 6443–6456 (2021)
Li, M., et al.: Timeline summarization based on event graph compression via time-aware optimal transport. In: EMNLP (2021)
Liao, Y., Wang, S., Lee, D.: Wilson: divide and conquer approach for fast and effective news timeline summarization. In: 24th International Conference on Extending Database Technology (2021)
Lin, C., Lin, C., et al.: Generating event storylines from microblogs. In: CIKM, pp. 175–184 (2012)
Liu, B., Han, F.X., Niu, D., Kong, L., Lai, K., Xu, Y.: Story forest. ACM Trans. Knowl. Disc. Data (TKDD) 14, 1–28 (2020)
Liu, B., Niu, D., Lai, K., Kong, L., Xu, Y.: Growing story forest online from massive breaking news. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management(CIKM), pp. 777–785 (2017)
Mehrotra, R., Sanner, S., Buntine, W., Xie, L.: Improving LDA topic models for microblogs via tweet pooling and automatic labeling. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 889–892 (2013)
Nallapati, R., Feng, A., Peng, F., Allan, J.: Event threading within news topics. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, pp. 446–453 (2004)
Nguyen, K.H., Tannier, X., Moriceau, V.: Ranking multidocument event descriptions for building thematic timelines. In: COLING 2014, the 25th International Conference on Computational Linguistic, pp. 1208–1217 (2014)
Wen, A., Lin, W., Ma, Y., Xie, H., Zhang, G.: News event evolution model based on the reading willingness and modified TF-IDF formula. J. High Speed Netw. 23(1), 33–47 (2017)
Xu, K., Hu, W., Leskovec, J., Jegelka, S.: How powerful are graph neural networks? arXiv preprint arXiv:1810.00826 (2018)
Zhao, X., Wang, C., Jin, P., Zhang, H., Yang, C., Li, B.: Post2story: automatically generating storylines from microblogging platforms. In: Proceedings of the 29th ACM International Conference on Multimedia, pp. 2786–2788 (2021)
Zhou, D., Xu, H., Dai, X.Y., He, Y.: Unsupervised storyline extraction from news articles. In: IJCAI, pp. 3014–3021 (2016)
Zhou, D., Xu, H., He, Y.: An unsupervised Bayesian modelling approach for storyline detection on news articles. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1943–1948 (2015)
Acknowledgements
This paper is supported by the Humanities and Social Sciences Foundation of the Ministry of Education (17YJCZH260), the National Science Foundation of China (62072419), the Sichuan Science and Technology Program (2020YFS0057).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, J., Zhao, X., Jin, P., Yang, C., Li, B., Zhang, H. (2023). Storyline Generation from News Articles Based on Approximate Personalized Propagation of Neural Predictions. In: Wang, X., et al. Database Systems for Advanced Applications. DASFAA 2023. Lecture Notes in Computer Science, vol 13946. Springer, Cham. https://doi.org/10.1007/978-3-031-30678-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-30678-5_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-30677-8
Online ISBN: 978-3-031-30678-5
eBook Packages: Computer ScienceComputer Science (R0)