Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3341161.3342870acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open access

A large-scale empirical study of geotagging behavior on Twitter

Published: 15 January 2020 Publication History

Abstract

Geotagging on social media has become an important proxy for understanding people's mobility and social events. Research that uses geotags to infer public opinions relies on several key assumptions about the behavior of geotagged and non-geotagged users. However, these assumptions have not been fully validated. Lack of understanding the geotagging behavior prohibits people further utilizing it. In this paper, we present an empirical study of geotagging behavior on Twitter based on more than 40 billion tweets collected from 20 million users. There are three main findings that may challenge these common assumptions. Firstly, different groups of users have different geotagging preferences. For example, less than 3% of users speaking in Korean are geotagged, while more than 40% of users speaking in Indonesian use geotags. Secondly, users who report their locations in profiles are more likely to use geotags, which may affects the generability of those location prediction systems on non-geotagged users. Thirdly, strong homophily effect exists in users' geotagging behavior, that users tend to connect to friends with similar geotagging preferences.

References

[1]
N. Li and G. Chen, "Analysis of a location-based social network," in Computational Science and Engineering, 2009. CSE'09. International Conference on, vol. 4. Ieee, 2009, pp. 263--270.
[2]
A. Noulas, S. Scellato, C. Mascolo, and M. Pontil, "An empirical study of geographic user activity patterns in foursquare." ICwSM, vol. 11, no. 70--573, p. 2, 2011.
[3]
B. Hu and M. Ester, "Spatial topic modeling in online social media for location recommendation," in Proceedings of the 7th ACM conference on Recommender systems. ACM, 2013, pp. 25--32.
[4]
T. N. Maeda, M. Yoshida, F. Toriumi, and H. Ohashi, "Decision tree analysis of tourists' preferences regarding tourist attractions using geotag data from social media," in Proceedings of the Second International Conference on IoT in Urban Space. ACM, 2016, pp. 61--64.
[5]
Y. Wei and L. Singh, "Location-based event detection using geotagged semantic graphs," in KDD Workshop Mining and Learning with Graphs, 2017.
[6]
A. Crooks, A. Croitoru, A. Stefanidis, and J. Radzikowski, "# earthquake: Twitter as a distributed sensor system," Transactions in GIS, vol. 17, no. 1, pp. 124--147, 2013.
[7]
E. Cho, S. A. Myers, and J. Leskovec, "Friendship and mobility: user movement in location-based social networks," in Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2011, pp. 1082--1090.
[8]
M. J. Widener and W. Li, "Using geolocated twitter data to monitor the prevalence of healthy and unhealthy food references across the us," Applied Geography, vol. 54, pp. 189--197, 2014.
[9]
M. Graham, S. A. Hale, and D. Gaffney, "Where in the world are you? geolocation and language identification in twitter," The Professional Geographer, vol. 66, no. 4, pp. 568--578, 2014.
[10]
B. Huang and K. M. Carley, "On predicting geolocation of tweets using convolutional neural networks," in International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation. Springer, 2017, pp. 281--291.
[11]
M. McPherson, L. Smith-Lovin, and J. M. Cook, "Birds of a feather: Homophily in social networks," Annual review of sociology, vol. 27, no. 1, pp. 415--444, 2001.
[12]
L. Backstrom, E. Sun, and C. Marlow, "Find me if you can: improving geographical prediction with social and spatial proximity," in Proceedings of the 19th international conference on World wide web. ACM, 2010, pp. 61--70.
[13]
A. Rahimi, T. Cohn, and T. Baldwin, "Twitter user geolocation using a unified text and network prediction model," arXiv preprint arXiv:1506.08259, 2015.
[14]
R. Jurdak, K. Zhao, J. Liu, M. AbouJaoude, M. Cameron, and D. Newth, "Understanding human mobility from twitter," PloS one, vol. 10, no. 7, p. e0131469, 2015.
[15]
B. Huang and K. M. Carley, "Location order recovery in trails with low temporal resolution," IEEE Transactions on Network Science and Engineering, 2018.
[16]
S. Kisilevich, M. Krstajic, D. Keim, N. Andrienko, and G. Andrienko, "Event-based analysis of people's activities and behavior using flickr and panoramio geotagged photo collections," in Information visualisation (IV), 2010 14th international conference. IEEE, 2010, pp. 289--296.
[17]
B. Hawelka, I. Sitko, E. Beinat, S. Sobolevsky, P. Kazakopoulos, and C. Ratti, "Geo-located twitter as proxy for global mobility patterns," Cartography and Geographic Information Science, vol. 41, no. 3, pp. 260--271, 2014.
[18]
B. Huang and K. Carley, "Parameterized convolutional neural networks for aspect level sentiment classification," in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018, pp. 1091--1096.
[19]
B. Huang, Y. Ou, and K. M. Carley, "Aspect level sentiment classification with attention-over-attention neural networks," in International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation. Springer, 2018, pp. 197--206.
[20]
T. Sakaki, M. Okazaki, and Y. Matsuo, "Earthquake shakes twitter users: real-time event detection by social sensors," in Proceedings of the 19th international conference on World wide web. ACM, 2010, pp. 851--860.
[21]
B. Han, P. Cook, and T. Baldwin, "Geolocation prediction in social media data by finding location indicative words," Proceedings of COLING 2012, pp. 1045--1062, 2012.
[22]
Y. Zhang, W. Wei, B. Huang, K. M. Carley, and Y. Zhang, "Rate: Overcoming noise and sparsity of textual features in real-time location estimation," in Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. ACM, 2017, pp. 2423--2426.
[23]
Y. Qian, J. Tang, Z. Yang, B. Huang, W. Wei, and K. M. Carley, "A probabilistic framework for location inference from social media," arXiv preprint arXiv:1702.07281, 2017.
[24]
H.-S. Kim, "What drives you to check in on facebook? motivations, privacy concerns, and mobile phone involvement for location-based information sharing," Computers in Human Behavior, vol. 54, pp. 397--406, 2016.
[25]
D. Tasse, Z. Liu, A. Sciuto, and J. I. Hong, "State of the geotags: Motivations and recent changes." in ICWSM, 2017, pp. 250--259.
[26]
P. J. Ludford, R. Priedhorsky, K. Reily, and L. Terveen, "Capturing, sharing, and using local place information," in Proceedings of the SIGCHI conference on Human factors in computing systems. ACM, 2007, pp. 1235--1244.
[27]
L. Sloan and J. Morgan, "Who tweets with their location? understanding the relationship between demographic characteristics and the use of geoservices and geotagging on twitter," PloS one, vol. 10, no. 11, p. e0142209, 2015.
[28]
P. Biernacki and D. Waldorf, "Snowball sampling: Problems and techniques of chain referral sampling," Sociological methods & research, vol. 10, no. 2, pp. 141--163, 1981.
[29]
L. Sloan, J. Morgan, W. Housley, M. Williams, A. Edwards, P. Burnap, and O. Rana, "Knowing the tweeters: Deriving sociologically relevant demographics from twitter," Sociological research online, vol. 18, no. 3, pp. 1--11, 2013.
[30]
B. Han, P. Cook, and T. Baldwin, "Text-based twitter user geolocation prediction," Journal of Artificial Intelligence Research, vol. 49, pp. 451--500, 2014.
[31]
B. Han, P. Cook, and T. Baldwin, "A stacking-based approach to twitter user geolocation prediction," in Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2013, pp. 7--12.
[32]
P. V. Marsden and N. E. Friedkin, "Network studies of social influence," Sociological Methods & Research, vol. 22, no. 1, pp. 127--151, 1993.

Cited By

View all
  • (2024)Long-term validation of inner-urban mobility metrics derived from Twitter/XEnvironment and Planning B: Urban Analytics and City Science10.1177/23998083241278275Online publication date: 14-Oct-2024
  • (2024)Understanding the impact of geotagging on location inference models for accurate generalization to non-geotagged datasetsGeomatica10.1016/j.geomat.2024.10000476:1(100004)Online publication date: Jul-2024
  • (2024)Beyond climate change? Environmental discourse on the planetary boundaries in Twitter networksClimatic Change10.1007/s10584-024-03729-y177:5Online publication date: 25-Apr-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ASONAM '19: Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
August 2019
1228 pages
ISBN:9781450368681
DOI:10.1145/3341161
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

  • IEEE CS

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 January 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. empirical study
  2. geotagging
  3. social media
  4. user behavior

Qualifiers

  • Research-article

Funding Sources

Conference

ASONAM '19
Sponsor:

Acceptance Rates

ASONAM '19 Paper Acceptance Rate 41 of 286 submissions, 14%;
Overall Acceptance Rate 116 of 549 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)174
  • Downloads (Last 6 weeks)23
Reflects downloads up to 24 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Long-term validation of inner-urban mobility metrics derived from Twitter/XEnvironment and Planning B: Urban Analytics and City Science10.1177/23998083241278275Online publication date: 14-Oct-2024
  • (2024)Understanding the impact of geotagging on location inference models for accurate generalization to non-geotagged datasetsGeomatica10.1016/j.geomat.2024.10000476:1(100004)Online publication date: Jul-2024
  • (2024)Beyond climate change? Environmental discourse on the planetary boundaries in Twitter networksClimatic Change10.1007/s10584-024-03729-y177:5Online publication date: 25-Apr-2024
  • (2024)Applying social media in emergency response: an attention-based bidirectional deep learning system for location reference recognition in disaster tweetsApplied Intelligence10.1007/s10489-024-05462-654:7(5768-5793)Online publication date: 27-Apr-2024
  • (2023)Temporal Relationship between Daily Reports of COVID-19 Infections and Related GDELT and Tweet MentionsGeographies10.3390/geographies30300313:3(584-609)Online publication date: 16-Sep-2023
  • (2023)Detecting informal green, blue, and street physical activity spaces in the city using geotagged sports-related Twitter tweetsFrontiers in Sociology10.3389/fsoc.2023.11253438Online publication date: 5-May-2023
  • (2023)Enhanced geocoding precision for location inference of tweet text using spaCy, Nominatim and Google Maps. A comparative analysis of the influence of data selectionPLOS ONE10.1371/journal.pone.028294218:3(e0282942)Online publication date: 15-Mar-2023
  • (2023)Geolocated Social Media Posts are Happier: Understanding the Characteristics of Check-in Posts on TwitterProceedings of the 15th ACM Web Science Conference 202310.1145/3578503.3583596(136-146)Online publication date: 30-Apr-2023
  • (2023)Learning Social Meta-knowledge for Nowcasting Human Mobility in DisasterProceedings of the ACM Web Conference 202310.1145/3543507.3583991(2655-2665)Online publication date: 30-Apr-2023
  • (2023)Impact of COVID-19 on online grocery shopping discussion and behavior reflected from Google Trends and geotagged tweetsComputational Urban Science10.1007/s43762-023-00083-03:1Online publication date: 22-Feb-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media