Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3488560.3498387acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Translating Human Mobility Forecasting through Natural Language Generation

Published: 15 February 2022 Publication History

Abstract

Existing human mobility forecasting models follow the standard design of the time-series prediction model which takes a series of numerical values as input to generate a numerical value as a prediction. Although treating this as a regression problem seems straightforward, incorporating various contextual information such as the semantic category information of each Place-of-Interest (POI) is a necessary step, and often the bottleneck, in designing an effective mobility prediction model. As opposed to the typical approach, we treat forecasting as a translation problem and propose a novel forecasting through a language generation pipeline. The paper aims to address the human mobility forecasting problem as a language translation task in a sequence-to-sequence manner. A mobility-to-language template is first introduced to describe the numerical mobility data as natural language sentences. The core intuition of the human mobility forecasting translation task is to convert the input mobility description sentences into a future mobility description from which the prediction target can be obtained. Under this pipeline, a two-branch network, SHIFT (Translating Human Mobility Forecasting), is designed. Specifically, it consists of one main branch for language generation and one auxiliary branch to directly learn mobility patterns. During the training, we develop a momentum mode for better connecting and training the two branches. Extensive experiments on three real-world datasets demonstrate that the proposed SHIFT is effective and presents a new revolutionary approach to forecasting human mobility.

Supplementary Material

MP4 File (wsdm22-fp150.mp4)
Presentation Video

References

[1]
Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Luvc ić, and Cordelia Schmid. 2021. Vivit: A video vision transformer. arXiv preprint arXiv:2103.15691 (2021).
[2]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). http://arxiv.org/abs/1409.0473
[3]
Yile Chen, Cheng Long, Gao Cong, and Chenliang Li. 2020. Context-aware deep model for joint mobility and time prediction. In Proceedings of the 13th International Conference on Web Search and Data Mining . 106--114.
[4]
Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling . arXiv preprint arXiv:1412.3555 (2014).
[5]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2--7, 2019, Volume 1 (Long and Short Papers), Jill Burstein, Christy Doran, and Thamar Solorio (Eds.). Association for Computational Linguistics, 4171--4186. https://doi.org/10.18653/v1/n19--1423
[6]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2021. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3--7, 2021. OpenReview.net. https://openreview.net/forum?id=YicbFdNTTy
[7]
Jie Feng, Yong Li, Zeyu Yang, Qiang Qiu, and Depeng Jin. 2020. Predicting Human Mobility with Semantic Motivation via Multi-task Attentional Recurrent Networks. IEEE Transactions on Knowledge and Data Engineering (2020).
[8]
Jie Feng, Yong Li, Chao Zhang, Funing Sun, Fanchao Meng, Ang Guo, and Depeng Jin. 2018. Deepmove: Predicting human mobility with attentional recurrent networks. In Proceedings of the 2018 world wide web conference. 1459--1468.
[9]
Qing Guo, Zhu Sun, Jie Zhang, and Yin-Leng Theng. 2020. An attentional recurrent neural network for personalized next location recommendation. In Proceedings of the AAAI Conference on artificial intelligence, Vol. 34. 83--90.
[10]
Sajal Halder, Kwan Hui Lim, Je rey Chan, and Xiuzhen Zhang. 2021. Transformer-based Multi-task Learning for euing Time Aware Next POI Recommendation. (2021).
[11]
Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition . 9729--9738.
[12]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-term Memory. Neural computation, Vol. 9, 8 (1997), 1735--1780.
[13]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.).
[14]
Nikita Kitaev, Lukasz Kaiser, and Anselm Levskaya. 2020. Reformer: The Efficient Transformer. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26--30, 2020 . OpenReview.net. https://openreview.net/forum?id=rkgNKkHtvB
[15]
Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, and Luke Zettlemoyer. 2020. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 7871--7880.
[16]
Shiyang Li, Xiaoyong Jin, Yao Xuan, Xiyou Zhou, Wenhu Chen, Yu-Xiang Wang, and Xifeng Yan. 2019. Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting. Advances in Neural Information Processing Systems, Vol. 32 (2019), 5243--5253.
[17]
Xutao Li, Gao Cong, Xiao-Li Li, Tuan-Anh Nguyen Pham, and Shonali Krishnaswamy. 2015. Rank-geofm: A ranking based geographical factorization method for point of interest recommendation. In Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval . 433--442.
[18]
Xiaolong Li, Gang Pan, Zhaohui Wu, Guande Qi, Shijian Li, Daqing Zhang, Wangsheng Zhang, and Zonghui Wang. 2012. Prediction of urban human mobility using large-scale taxi traces and its applications. Frontiers of Computer Science, Vol. 6, 1 (2012), 111--121.
[19]
Defu Lian, Cong Zhao, Xing Xie, Guangzhong Sun, Enhong Chen, and Yong Rui. 2014. GeoMF: joint geographical modeling and matrix factorization for point-of-interest recommendation. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining . 831--840.
[20]
Marco Lippi, Matteo Bertini, and Paolo Frasconi. 2013. Short-term traffic flow forecasting: An experimental comparison of time-series analysis and supervised learning. IEEE Transactions on Intelligent Transportation Systems, Vol. 14, 2 (2013), 871--882.
[21]
Qiang Liu, Shu Wu, Liang Wang, and Tieniu Tan. 2016. Predicting the next location: A recurrent model with spatial and temporal contexts. In Thirtieth AAAI conference on artificial intelligence .
[22]
Yingtao Luo, Qiang Liu, and Zhaocheng Liu. 2021. STAN: Spatio-Temporal Attention Network for Next Location Recommendation. In Proceedings of the Web Conference 2021. 2177--2185.
[23]
Shaohui Ma and Robert Fildes. 2020. Forecasting third-party mobile payments with implications for customer flow prediction. International Journal of Forecasting, Vol. 36, 3 (2020), 739--760.
[24]
Congcong Miao, Jiajun Fu, Jilong Wang, Heng Yu, Botao Yao, Anqi Zhong, Jie Chen, and Zekun He. 2021. Predicting Crowd Flows via Pyramid Dilated Deeper Spatial-temporal Network. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining . 806--814.
[25]
Congcong Miao, Ziyan Luo, Fengzhu Zeng, and Jilong Wang. 2020. Predicting Human Mobility via Attentive Convolutional Network. In Proceedings of the 13th International Conference on Web Search and Data Mining . 438--446.
[26]
Xingyi Ren, Meina Song, E Haihong, and Junde Song. 2017. Context-aware probabilistic matrix factorization modeling for point-of-interest recommendation. Neurocomputing, Vol. 241 (2017), 38--55.
[27]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.
[28]
Billy M Williams and Lester A Hoel. 2003. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. Journal of transportation engineering, Vol. 129, 6 (2003), 664--672.
[29]
Haixu Wu, Jiehui Xu, Jianmin Wang, and Mingsheng Long. 2021. Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting. arXiv preprint arXiv:2106.13008 (2021).
[30]
Xian Wu, Chao Huang, Chuxu Zhang, and Nitesh V Chawla. 2020. Hierarchically structured transformer networks for fine-grained spatial event forecasting. In Proceedings of The Web Conference 2020 . 2320--2330.
[31]
Hao Xue and Flora D. Salim. 2021 a. Exploring Self-Supervised Representation Ensembles for COVID-19 Cough Classification. In KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Virtual Event, Singapore, August 14--18, 2021. ACM, 1944--1952.
[32]
Hao Xue and Flora D. Salim. 2021 b. TERMCast: Temporal Relation Modeling for Effective Urban Flow Forecasting. In Advances in Knowledge Discovery and Data Mining - 25th Pacific-Asia Conference, PAKDD 2021, Virtual Event, May 11--14, 2021, Proceedings, Part I (Lecture Notes in Computer Science, Vol. 12712), Kamal Karlapalem, Hong Cheng, Naren Ramakrishnan, R. K. Agrawal, P. Krishna Reddy, Jaideep Srivastava, and Tanmoy Chakraborty (Eds.). Springer, 741--753. https://doi.org/10.1007/978--3-030--75762--5_58
[33]
Dingqi Yang, Benjamin Fankhauser, Paolo Rosso, and Philippe Cudre-Mauroux. 2020. Location Prediction over Sparse User Mobility Traces Using RNNs: Flashback in Hidden States!. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence. 2184--2190.
[34]
Junbo Zhang, Yu Zheng, and Dekang Qi. 2017. Deep spatio-temporal residual networks for citywide crowd flows prediction. In Thirty-first AAAI conference on artificial intelligence .
[35]
Junbo Zhang, Yu Zheng, Junkai Sun, and Dekang Qi. 2019. Flow prediction in spatio-temporal networks based on multitask deep learning. IEEE Transactions on Knowledge and Data Engineering, Vol. 32, 3 (2019), 468--478.
[36]
Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, and Wancai Zhang. 2021. Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of AAAI .

Cited By

View all
  • (2024)Sustainable Mobility in the Century of Metropolises: Case Study of Greater LondonLand10.3390/land1310166213:10(1662)Online publication date: 12-Oct-2024
  • (2024)T-JEPA: A Joint-Embedding Predictive Architecture for Trajectory Similarity ComputationProceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems10.1145/3678717.3691271(569-572)Online publication date: 29-Oct-2024
  • (2024)Prompt Mining for Language Models-based Mobility Flow ForecastingProceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems10.1145/3678717.3691232(113-122)Online publication date: 29-Oct-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining
February 2022
1690 pages
ISBN:9781450391320
DOI:10.1145/3488560
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 February 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. human mobility prediction
  2. natural language
  3. temporal forecasting

Qualifiers

  • Research-article

Funding Sources

Conference

WSDM '22

Acceptance Rates

Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)107
  • Downloads (Last 6 weeks)15
Reflects downloads up to 22 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Sustainable Mobility in the Century of Metropolises: Case Study of Greater LondonLand10.3390/land1310166213:10(1662)Online publication date: 12-Oct-2024
  • (2024)T-JEPA: A Joint-Embedding Predictive Architecture for Trajectory Similarity ComputationProceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems10.1145/3678717.3691271(569-572)Online publication date: 29-Oct-2024
  • (2024)Prompt Mining for Language Models-based Mobility Flow ForecastingProceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems10.1145/3678717.3691232(113-122)Online publication date: 29-Oct-2024
  • (2024)Enhancing Spatio-temporal Quantile Forecasting with Curriculum Learning: Lessons LearnedProceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems10.1145/3678717.3691216(42-53)Online publication date: 29-Oct-2024
  • (2024)MAPLEProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36435148:1(1-25)Online publication date: 6-Mar-2024
  • (2024)Generative AI for Energy: Multi-Horizon Power Consumption Forecasting using Large Language ModelsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679933(4015-4019)Online publication date: 21-Oct-2024
  • (2024)PromptCast: A New Prompt-Based Learning Paradigm for Time Series ForecastingIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.334213736:11(6851-6864)Online publication date: Nov-2024
  • (2024)Spatial-Temporal Large Language Model for Traffic Prediction2024 25th IEEE International Conference on Mobile Data Management (MDM)10.1109/MDM61037.2024.00025(31-40)Online publication date: 24-Jun-2024
  • (2024)Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.00078(753-766)Online publication date: 16-Jun-2024
  • (2024)Social Networks and Large Language Models for Division I Basketball Game Winner PredictionIEEE Access10.1109/ACCESS.2024.340349012(84774-84784)Online publication date: 2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media