Abstract
In this paper, we describe an inference model for deducing the location of a tweet whose geolocation information is not available in the metadata. The approach we propose is based on machine learning techniques and uses the information contained in the tweets such as the places mentioned in the tweets and the profile of the authors of the tweets. The objective of the study is to contribute to setting up an early warning system for epidemics based on the monitoring of events on social networks like twitter. For this we need to geolocate the messages even if the smartphone’s GPS is not active. We trained on three models and obtained the best result with K-nearest neighbors model with an accuracy of 0.90.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jurgens, D., Finethy, T., McCorriston, J., Xu, Y.T., Ruths, D.: Geolocation prediction in twitter using social networks: a critical analysis and review of current practice. In: ICWSM (2015)
Mishra, P.: Geolocation of tweets with a BiLSTM regression model. In: Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, pp. 283–289 (2020)
Han, B., Cook, P., Baldwin, T.: Text-based twitter user geolocation prediction. J. Artif. Intell. Res. 49, 451–500 (2014)
Huang B., Carley K.M.: A hierarchical location prediction neural network for twitter user geolocation (2019). arXiv preprint arXiv:1910.12941
Hulden, M., Silfverberg, M., Francom, J.: Kernel density estimation for text-based geolocation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 29 (2015)
Wing, B., Baldridge, J.: Simple supervised document geolocation with geodesic grids. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 955–964 (2011)
Wing, B., Baldridge, J.: Hierarchical discriminative classification for text based geolocation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 336–348 (2014)
Han, B., Cook, P., Baldwin, T.: Geolocation prediction in social media data by finding location indicative words. In: Proceedings of COLING, vol. 2012, pp. 1045–1062 (2012)
Roller, S., Speriosu, M., Rallapalli, S., Wing, B., Baldridge, J.: Supervised text-based geolocation using language models on an adaptive grid. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1500–1510 (2012)
Zhou, F., Wang, T., Zhong, T., Trajcevski, G.: Identifying user geolocation with hierarchical graph neural networks and explainable fusion. Inf. Fusion 81, 1–13 (2022). https://doi.org/10.1016/j.inffus.2021.11.004
Rahimi, A., Cohn, T., Baldwin, T.: A neural model for user geolocation and lexical dialectology. arXiv preprint arXiv:1704.04008 (2017)
Mostafa, A., Gad, W., Abdelkader, T., Badr, N.: Pre-HLSA: predicting home location for twitter users based on sentimental analysis. Ain Shams Eng. J. 13, 101501 (2022)
Mahajan, R., Mansotra, V.: Predicting geolocation of tweets: using combination of CNN and BiLSTM. Data Sci. Eng. 6, 402–410 (2021)
Simanjuntak, L.F., Mahendra, R., Yulianti, E.: We know you are living in Bali: location prediction of twitter users using BERT language model. Big Data Cogn. Comput. 6(3), 77 (2022)
Alsaqer, M., Alelyani, S., Mohana, M., Alreemy, K., Alqahtani, A.: Predicting location of tweets using machine learning approaches. Appl. Sci. 13, 3025 (2023). https://doi.org/10.3390/app13053025
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Julie, T., Sadouanouan, M., Yaya, T. (2023). A Geolocation Approach for Tweets Not Explicitly Georeferenced Based on Machine Learning. In: Ossowski, S., Sitek, P., Analide, C., Marreiros, G., Chamoso, P., Rodríguez, S. (eds) Distributed Computing and Artificial Intelligence, 20th International Conference. DCAI 2023. Lecture Notes in Networks and Systems, vol 740. Springer, Cham. https://doi.org/10.1007/978-3-031-38333-5_23
Download citation
DOI: https://doi.org/10.1007/978-3-031-38333-5_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-38332-8
Online ISBN: 978-3-031-38333-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)