Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2872427.2883067acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Joint Recognition and Linking of Fine-Grained Locations from Tweets

Published: 11 April 2016 Publication History

Abstract

Many users casually reveal their locations such as restaurants, landmarks, and shops in their tweets. Recognizing such fine-grained locations from tweets and then linking the location mentions to well-defined location profiles (e.g., with formal name, detailed address, and geo-coordinates etc.) offer a tremendous opportunity for many applications. Different from existing solutions which perform location recognition and linking as two sub-tasks sequentially in a pipeline setting, in this paper, we propose a novel joint framework to perform location recognition and location linking simultaneously in a joint search space. We formulate this end-to-end location linking problem as a structured prediction problem and propose a beam-search based algorithm. Based on the concept of multi-view learning, we further enable the algorithm to learn from unlabeled data to alleviate the dearth of labeled data. Extensive experiments are conducted to recognize locations mentioned in tweets and link them to location profiles in Foursquare. Experimental results show that the proposed joint learning algorithm outperforms the state-of-the-art solutions, and learning from unlabeled data improves both the recognition and linking accuracy.

References

[1]
S. Abney. Bootstrapping. In ACL, pages 360--367, 2002.
[2]
E. Amitay, N. Har'El, R. Sivan, and A. Soffer. Web-a-where: Geotagging Web Content. In SIGIR, pages 273--280, 2004.
[3]
A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In COLT, pages 92--100, 1998.
[4]
U. Brefeld, C. Büscher, and T. Scheffer. Multi-view discriminative sequential learning. In ECML, pages 60--71. 2005.
[5]
B. Cao, F. Chen, D. Joshi, and P. S. Yu. Inferring Crowd-Sourced Venues for Tweets. In IEEE Big Data, 2015.
[6]
S. Chandra, L. Khan, and F. B. Muhaya. Estimating Twitter User Location Using Social Interactions--A Content Based Approach. In IEEE SocialCom, pages 838--843, 2011.
[7]
H.-w. Chang, D. Lee, M. Eltaher, and J. Lee. @Phillies Tweeting from Philly? Predicting Twitter User Locations with Spatial Word Usage. In ASONAM, pages 111--118, 2012.
[8]
M. Collins. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms. In ACL, pages 1--8, 2002.
[9]
M. Collins and B. Roark. Incremental Parsing with the Perceptron Algorithm. In ACL, 2004.
[10]
S. Cucerzan. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In EMNLP-CoNLL, pages 708--716, 2007.
[11]
S. Dasgupta, M. L. Littman, and D. McAllester. PAC Generalization Bounds for Co-training. NIPS, pages 375--382, 2002.
[12]
V. R. de Sa. Learning Classification with Unlabeled Data. In NIPS, pages 112--119, 1994.
[13]
J. Eisenstein, B. O'Connor, N. A. Smith, and E. P. Xing. A Latent Variable Model for Geographic Lexical Variation. In EMNLP, pages 1277--1287, 2010.
[14]
P. Ferragina and U. Scaiella. Tagme: On-the-fly Annotation of Short Text Fragments (by wikipedia entities. In CIKM, pages 1625--1628, 2010.
[15]
D. Flatow, M. Naaman, K. E. Xie, Y. Volkovich, and Y. Kanza. On the Accuracy of Hyper-local Geotagging of Social Media Content. In WSDM, pages 127--136, 2015.
[16]
S. Guo, M.-W. Chang, and E. Kiciman. To Link or Not to Link? A Study on End-to-End Tweet Entity Linking. In NAACL, 2013.
[17]
B. Hecht, L. Hong, B. Suh, and E. H. Chi. Tweets from Justin Bieber's Heart: The Dynamics of the Location Field in User Profiles. In SIGCHI, pages 237--246, 2011.
[18]
H. Huang, Y. Cao, X. Huang, H. Ji, and C.-Y. Lin. Collective Tweet Wikification based on Semi-supervised Graph Regularization. In ACL, pages 380--390, 2014.
[19]
L. Huang, S. Fayong, and Y. Guo. Structured Perceptron with Inexact Search. In NAACL, pages 142--151, 2012.
[20]
D. Inkpen, J. Liu, A. Farzindar, F. Kazemi, and D. Ghazi. Detecting and Disambiguating Locations Mentioned in Twitter Messages. In Computational Linguistics and Intelligent Text Processing, pages 321--332. 2015.
[21]
S. Kinsella, V. Murdock, and N. O'Hare. "I'M Eating a Sandwich in Glasgow": Modeling Locations with Tweets. In SMUC, pages 61--68, 2011.
[22]
J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In ICML, pages 282--289, 2001.
[23]
K. Lee, R. K. Ganti, M. Srivatsa, and L. Liu. When Twitter Meets Foursquare: Tweet Location Prediction Using Foursquare. In MOBIQUITOUS, pages 198--207, 2014.
[24]
C. Li and A. Sun. Fine-grained Location Extraction from Tweets with Temporal Awareness. In SIGIR, pages 43--52, 2014.
[25]
Q. Li and H. Ji. Incremental Joint Extraction of Entity Mentions and Relations. In ACL, pages 402--412, 2014.
[26]
Q. Li, H. Ji, and L. Huang. Joint Event Extraction via Structured Prediction with Global Features. In ACL, pages 73--82, 2013.
[27]
W. Li, P. Serdyukov, A. P. de Vries, C. Eickhoff, and M. Larson. The Where in the Tweet. In CIKM, pages 2473--2476, 2011.
[28]
J. Lingad, S. Karimi, and J. Yin. Location Extraction from Disaster-related Microblogs. In WWW, pages 1017--1020, 2013.
[29]
X. Liu, Y. Li, H. Wu, M. Zhou, F. Wei, and Y. Lu. Entity Linking for Tweets. In ACL, pages 1304--1311, 2013.
[30]
G. Luo, X. Huang, C.-y. Lin, and Z. Nie. Joint Named Entity Recognition and Disambiguation. In EMNLP, pages 879--888, 2015.
[31]
J. Mahmud, J. Nichols, and C. Drews. Home Location Identification of Twitter Users. ACM Transactions on TIST, pages 47:1---47:21, 2014.
[32]
E. Meij, W. Weerkamp, and M. de Rijke. Adding Semantics to Microblog Posts. In WSDM, pages 563--572, 2012.
[33]
S. E. Middleton, L. Middleton, and S. Modafferi. Real-time Crisis Mapping of Natural Disasters using Social Media. IEEE Intelligent Systems, pages 9--17, 2014.
[34]
R. Mihalcea and A. Csomai. Wikify!: Linking Documents to Encyclopedic Knowledge. In CIKM, pages 233--242, 2007.
[35]
D. Milne and I. H. Witten. Learning to Link with Wikipedia. In CIKM, pages 509--518, 2008.
[36]
S. Paradesi. Geotagging Tweets Using Their Content. In Twenty-Fourth International FLAIRS Conference, 2011.
[37]
A. Rae, V. Murdock, A. Popescu, and H. Bouchard. Mining the Web for Points of Interest. In SIGIR, pages 711--720, 2012.
[38]
L. Ratinov and D. Roth. Design Challenges and Misconceptions in Named Entity Recognition. In CoNLL, pages 147--155, 2009.
[39]
S. Sarawagi and W. W. Cohen. Semi-markov Conditional Random Fields for Information Extraction. In NIPS, pages 1185--1192, 2004.
[40]
A. Schulz, A. Hadjakos, H. Paulheim, J. Nachtwey, and M. Max. A Multi-Indicator Approach for Geolocalization of Tweets. In ICWSM, pages 573--582, 2013.
[41]
W. Shen, J. Wang, P. Luo, and M. Wang. LIEGE: Link Entities in Web Lists with Knowledge Base. In SIGKDD, pages 1424--1432, 2012.
[42]
W. Shen, J. Wang, P. Luo, and M. Wang. Linking Named Entities in Tweets with Knowledge Base via User Interest Modeling. In SIGKDD, pages 68--76, 2013.
[43]
A. Sil and A. Yates. Re-ranking for Joint Named-Entity Recognition and Linking. In CIKM, pages 2369--2374, 2013.
[44]
J. Yin, S. Karimi, and J. Lingad. Pinpointing Locational Focus in Microblogs. In ADCS, pages 66:66---66:72, 2014.
[45]
Y. Zhang and S. Clark. Joint Word Segmentation and POS Tagging using a Single Perceptron. In ACL, pages 888--896, 2008.

Cited By

View all
  • (2024)Large Language Model-Driven Semi-Automated Construction of Named Entity Datasets in Specific Knowledge Domain2024 9th International Symposium on Computer and Information Processing Technology (ISCIPT)10.1109/ISCIPT61983.2024.10673354(312-318)Online publication date: 24-May-2024
  • (2024)DLRGeoTweet: A comprehensive social media geocoding corpus featuring fine-grained placesInformation Processing & Management10.1016/j.ipm.2024.10374261:4(103742)Online publication date: Jul-2024
  • (2024)Medical Concept NormalizationNatural Language Processing in Biomedicine10.1007/978-3-031-55865-8_6(137-164)Online publication date: 9-Jun-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '16: Proceedings of the 25th International Conference on World Wide Web
April 2016
1482 pages
ISBN:9781450341431

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 11 April 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. POI
  2. beam search
  3. location linking
  4. location recognition
  5. multi-view learning
  6. structured prediction
  7. tweet
  8. twitter

Qualifiers

  • Research-article

Funding Sources

  • Singapore Ministry of Education

Conference

WWW '16
Sponsor:
  • IW3C2
WWW '16: 25th International World Wide Web Conference
April 11 - 15, 2016
Québec, Montréal, Canada

Acceptance Rates

WWW '16 Paper Acceptance Rate 115 of 727 submissions, 16%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Large Language Model-Driven Semi-Automated Construction of Named Entity Datasets in Specific Knowledge Domain2024 9th International Symposium on Computer and Information Processing Technology (ISCIPT)10.1109/ISCIPT61983.2024.10673354(312-318)Online publication date: 24-May-2024
  • (2024)DLRGeoTweet: A comprehensive social media geocoding corpus featuring fine-grained placesInformation Processing & Management10.1016/j.ipm.2024.10374261:4(103742)Online publication date: Jul-2024
  • (2024)Medical Concept NormalizationNatural Language Processing in Biomedicine10.1007/978-3-031-55865-8_6(137-164)Online publication date: 9-Jun-2024
  • (2023)A Knowledge-Grounded Task-Oriented Dialogue System with Hierarchical Structure for Enhancing Knowledge SelectionSensors10.3390/s2302068523:2(685)Online publication date: 6-Jan-2023
  • (2023)Improving Feature Extraction Using a Hybrid of CNN and LSTM for Entity IdentificationNeural Processing Letters10.1007/s11063-022-11122-y55:5(5979-5994)Online publication date: 3-Jan-2023
  • (2023)Role of Geolocation Prediction in Disaster ManagementInternational Handbook of Disaster Research10.1007/978-981-19-8388-7_176(647-677)Online publication date: 1-Oct-2023
  • (2023)Role of Geolocation Prediction in Disaster ManagementInternational Handbook of Disaster Research10.1007/978-981-16-8800-3_176-2(1-31)Online publication date: 4-Jul-2023
  • (2023)Role of Geolocation Prediction in Disaster ManagementInternational Handbook of Disaster Research10.1007/978-981-16-8800-3_176-1(1-33)Online publication date: 14-May-2023
  • (2022)Few-Shot Named Entity Recognition via Meta-LearningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.303867034:9(4245-4256)Online publication date: 1-Sep-2022
  • (2022)Towards the Inference of Travel Purpose with Heterogeneous Urban DataIEEE Transactions on Big Data10.1109/TBDATA.2019.29218238:1(166-177)Online publication date: 1-Feb-2022
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media