research-article

Joint Recognition and Linking of Fine-Grained Locations from Tweets

Authors:

Jialong HanAuthors Info & Claims

WWW '16: Proceedings of the 25th International Conference on World Wide Web

Pages 1271 - 1281

https://doi.org/10.1145/2872427.2883067

Published: 11 April 2016 Publication History

Abstract

Many users casually reveal their locations such as restaurants, landmarks, and shops in their tweets. Recognizing such fine-grained locations from tweets and then linking the location mentions to well-defined location profiles (e.g., with formal name, detailed address, and geo-coordinates etc.) offer a tremendous opportunity for many applications. Different from existing solutions which perform location recognition and linking as two sub-tasks sequentially in a pipeline setting, in this paper, we propose a novel joint framework to perform location recognition and location linking simultaneously in a joint search space. We formulate this end-to-end location linking problem as a structured prediction problem and propose a beam-search based algorithm. Based on the concept of multi-view learning, we further enable the algorithm to learn from unlabeled data to alleviate the dearth of labeled data. Extensive experiments are conducted to recognize locations mentioned in tweets and link them to location profiles in Foursquare. Experimental results show that the proposed joint learning algorithm outperforms the state-of-the-art solutions, and learning from unlabeled data improves both the recognition and linking accuracy.

References

[1]

S. Abney. Bootstrapping. In ACL, pages 360--367, 2002.

Digital Library

[2]

E. Amitay, N. Har'El, R. Sivan, and A. Soffer. Web-a-where: Geotagging Web Content. In SIGIR, pages 273--280, 2004.

Digital Library

[3]

A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In COLT, pages 92--100, 1998.

Digital Library

[4]

U. Brefeld, C. Büscher, and T. Scheffer. Multi-view discriminative sequential learning. In ECML, pages 60--71. 2005.

Digital Library

[5]

B. Cao, F. Chen, D. Joshi, and P. S. Yu. Inferring Crowd-Sourced Venues for Tweets. In IEEE Big Data, 2015.

Digital Library

[6]

S. Chandra, L. Khan, and F. B. Muhaya. Estimating Twitter User Location Using Social Interactions--A Content Based Approach. In IEEE SocialCom, pages 838--843, 2011.

[7]

H.-w. Chang, D. Lee, M. Eltaher, and J. Lee. @Phillies Tweeting from Philly? Predicting Twitter User Locations with Spatial Word Usage. In ASONAM, pages 111--118, 2012.

Digital Library

[8]

M. Collins. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms. In ACL, pages 1--8, 2002.

Digital Library

[9]

M. Collins and B. Roark. Incremental Parsing with the Perceptron Algorithm. In ACL, 2004.

Digital Library

[10]

S. Cucerzan. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In EMNLP-CoNLL, pages 708--716, 2007.

[11]

S. Dasgupta, M. L. Littman, and D. McAllester. PAC Generalization Bounds for Co-training. NIPS, pages 375--382, 2002.

[12]

V. R. de Sa. Learning Classification with Unlabeled Data. In NIPS, pages 112--119, 1994.

[13]

J. Eisenstein, B. O'Connor, N. A. Smith, and E. P. Xing. A Latent Variable Model for Geographic Lexical Variation. In EMNLP, pages 1277--1287, 2010.

Digital Library

[14]

P. Ferragina and U. Scaiella. Tagme: On-the-fly Annotation of Short Text Fragments (by wikipedia entities. In CIKM, pages 1625--1628, 2010.

Digital Library

[15]

D. Flatow, M. Naaman, K. E. Xie, Y. Volkovich, and Y. Kanza. On the Accuracy of Hyper-local Geotagging of Social Media Content. In WSDM, pages 127--136, 2015.

Digital Library

[16]

S. Guo, M.-W. Chang, and E. Kiciman. To Link or Not to Link? A Study on End-to-End Tweet Entity Linking. In NAACL, 2013.

[17]

B. Hecht, L. Hong, B. Suh, and E. H. Chi. Tweets from Justin Bieber's Heart: The Dynamics of the Location Field in User Profiles. In SIGCHI, pages 237--246, 2011.

Digital Library

[18]

H. Huang, Y. Cao, X. Huang, H. Ji, and C.-Y. Lin. Collective Tweet Wikification based on Semi-supervised Graph Regularization. In ACL, pages 380--390, 2014.

[19]

L. Huang, S. Fayong, and Y. Guo. Structured Perceptron with Inexact Search. In NAACL, pages 142--151, 2012.

Digital Library

[20]

D. Inkpen, J. Liu, A. Farzindar, F. Kazemi, and D. Ghazi. Detecting and Disambiguating Locations Mentioned in Twitter Messages. In Computational Linguistics and Intelligent Text Processing, pages 321--332. 2015.

[21]

S. Kinsella, V. Murdock, and N. O'Hare. "I'M Eating a Sandwich in Glasgow": Modeling Locations with Tweets. In SMUC, pages 61--68, 2011.

Digital Library

[22]

J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In ICML, pages 282--289, 2001.

Digital Library

[23]

K. Lee, R. K. Ganti, M. Srivatsa, and L. Liu. When Twitter Meets Foursquare: Tweet Location Prediction Using Foursquare. In MOBIQUITOUS, pages 198--207, 2014.

Digital Library

[24]

C. Li and A. Sun. Fine-grained Location Extraction from Tweets with Temporal Awareness. In SIGIR, pages 43--52, 2014.

Digital Library

[25]

Q. Li and H. Ji. Incremental Joint Extraction of Entity Mentions and Relations. In ACL, pages 402--412, 2014.

[26]

Q. Li, H. Ji, and L. Huang. Joint Event Extraction via Structured Prediction with Global Features. In ACL, pages 73--82, 2013.

[27]

W. Li, P. Serdyukov, A. P. de Vries, C. Eickhoff, and M. Larson. The Where in the Tweet. In CIKM, pages 2473--2476, 2011.

Digital Library

[28]

J. Lingad, S. Karimi, and J. Yin. Location Extraction from Disaster-related Microblogs. In WWW, pages 1017--1020, 2013.

Digital Library

[29]

X. Liu, Y. Li, H. Wu, M. Zhou, F. Wei, and Y. Lu. Entity Linking for Tweets. In ACL, pages 1304--1311, 2013.

[30]

G. Luo, X. Huang, C.-y. Lin, and Z. Nie. Joint Named Entity Recognition and Disambiguation. In EMNLP, pages 879--888, 2015.

[31]

J. Mahmud, J. Nichols, and C. Drews. Home Location Identification of Twitter Users. ACM Transactions on TIST, pages 47:1---47:21, 2014.

Digital Library

[32]

E. Meij, W. Weerkamp, and M. de Rijke. Adding Semantics to Microblog Posts. In WSDM, pages 563--572, 2012.

Digital Library

[33]

S. E. Middleton, L. Middleton, and S. Modafferi. Real-time Crisis Mapping of Natural Disasters using Social Media. IEEE Intelligent Systems, pages 9--17, 2014.

[34]

R. Mihalcea and A. Csomai. Wikify!: Linking Documents to Encyclopedic Knowledge. In CIKM, pages 233--242, 2007.

Digital Library

[35]

D. Milne and I. H. Witten. Learning to Link with Wikipedia. In CIKM, pages 509--518, 2008.

Digital Library

[36]

S. Paradesi. Geotagging Tweets Using Their Content. In Twenty-Fourth International FLAIRS Conference, 2011.

[37]

A. Rae, V. Murdock, A. Popescu, and H. Bouchard. Mining the Web for Points of Interest. In SIGIR, pages 711--720, 2012.

Digital Library

[38]

L. Ratinov and D. Roth. Design Challenges and Misconceptions in Named Entity Recognition. In CoNLL, pages 147--155, 2009.

Digital Library

[39]

S. Sarawagi and W. W. Cohen. Semi-markov Conditional Random Fields for Information Extraction. In NIPS, pages 1185--1192, 2004.

Digital Library

[40]

A. Schulz, A. Hadjakos, H. Paulheim, J. Nachtwey, and M. Max. A Multi-Indicator Approach for Geolocalization of Tweets. In ICWSM, pages 573--582, 2013.

[41]

W. Shen, J. Wang, P. Luo, and M. Wang. LIEGE: Link Entities in Web Lists with Knowledge Base. In SIGKDD, pages 1424--1432, 2012.

Digital Library

[42]

W. Shen, J. Wang, P. Luo, and M. Wang. Linking Named Entities in Tweets with Knowledge Base via User Interest Modeling. In SIGKDD, pages 68--76, 2013.

Digital Library

[43]

A. Sil and A. Yates. Re-ranking for Joint Named-Entity Recognition and Linking. In CIKM, pages 2369--2374, 2013.

Digital Library

[44]

J. Yin, S. Karimi, and J. Lingad. Pinpointing Locational Focus in Microblogs. In ADCS, pages 66:66---66:72, 2014.

Digital Library

[45]

Y. Zhang and S. Clark. Joint Word Segmentation and POS Tagging using a Single Perceptron. In ACL, pages 888--896, 2008.

Cited By

Sun BGong KLi WSong X(2025)MetaR: Few-Shot Named Entity Recognition With Meta-Learning and Relation NetworkIEEE Transactions on Audio, Speech and Language Processing10.1109/TASLPRO.2025.353728033(974-986)Online publication date: 2025
https://doi.org/10.1109/TASLPRO.2025.3537280
Liang YCao H(2024)Large Language Model-Driven Semi-Automated Construction of Named Entity Datasets in Specific Knowledge Domain2024 9th International Symposium on Computer and Information Processing Technology (ISCIPT)10.1109/ISCIPT61983.2024.10673354(312-318)Online publication date: 24-May-2024
https://doi.org/10.1109/ISCIPT61983.2024.10673354
Xiong DLi XGao LGao Y(2024)Domain‐adaptation‐based named entity recognition with information enrichment for equipment fault knowledge graphIET Collaborative Intelligent Manufacturing10.1049/cim2.700036:4Online publication date: 25-Nov-2024
https://doi.org/10.1049/cim2.70003
Show More Cited By

Index Terms

Joint Recognition and Linking of Fine-Grained Locations from Tweets
1. Information systems
  1. Data management systems
    1. Information integration
      1. Entity resolution
  2. Information retrieval
    1. Retrieval tasks and goals
      1. Information extraction

Recommendations

DLocRL: A Deep Learning Pipeline for Fine-Grained Location Recognition and Linking in Tweets
WWW '19: The World Wide Web Conference

In recent years, with the prevalence of social media and smart devices, people causally reveal their locations such as shops, hotels, and restaurants in their tweets. Recognizing and linking such fine-grained location mentions to well-defined location ...
Fine-grained location extraction from tweets with temporal awareness
SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval

Twitter is a popular platform for sharing activities, plans, and opinions. Through tweets, users often reveal their location information and short term visiting plans. In this paper, we are interested in extracting fine-grained locations mentioned in ...
Fine-grained location prediction of non geo-tagged tweets: a multi-view learning approach
GeoAI '22: Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery

Geotagged Social Media (GTSM) data, especially geotagged tweets are valuable sources of information for many important applications. Only small portions of geotagged tweets are available (less than 3%). Identifying tweet location is a challenging ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '16: Proceedings of the 25th International Conference on World Wide Web

April 2016

1482 pages

ISBN:9781450341431

General Chairs:
Jacqueline Bourdeau
Tele-university (TELUQ), Montreal, QC, Canada
,
Jim A. Hendler
Rensselaer Polytechnic Institute, Troy, NY, USA
,
Roger Nkambou Nkambou
Université du Québec à Montréal, Montreal, QC, Canada
,
Program Chairs:
Ian Horrocks
University of Oxford, UK
,
Ben Y. Zhao
University of California at Santa Barbara, CA, USA

Copyright © 2016 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

Sponsors

IW3C2: International World Wide Web Conference Committee

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 11 April 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Singapore Ministry of Education

Conference

WWW '16

Sponsor:

IW3C2

WWW '16: 25th International World Wide Web Conference

April 11 - 15, 2016

Québec, Montréal, Canada

Acceptance Rates

WWW '16 Paper Acceptance Rate 115 of 727 submissions, 16%;

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

36
Total Citations
View Citations
436
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)1

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sun BGong KLi WSong X(2025)MetaR: Few-Shot Named Entity Recognition With Meta-Learning and Relation NetworkIEEE Transactions on Audio, Speech and Language Processing10.1109/TASLPRO.2025.353728033(974-986)Online publication date: 2025
https://doi.org/10.1109/TASLPRO.2025.3537280
Liang YCao H(2024)Large Language Model-Driven Semi-Automated Construction of Named Entity Datasets in Specific Knowledge Domain2024 9th International Symposium on Computer and Information Processing Technology (ISCIPT)10.1109/ISCIPT61983.2024.10673354(312-318)Online publication date: 24-May-2024
https://doi.org/10.1109/ISCIPT61983.2024.10673354
Xiong DLi XGao LGao Y(2024)Domain‐adaptation‐based named entity recognition with information enrichment for equipment fault knowledge graphIET Collaborative Intelligent Manufacturing10.1049/cim2.700036:4Online publication date: 25-Nov-2024
https://doi.org/10.1049/cim2.70003
Hu XElßner TZheng SSerere HKersten JKlan FQiu Q(2024)DLRGeoTweet: A comprehensive social media geocoding corpus featuring fine-grained placesInformation Processing & Management10.1016/j.ipm.2024.10374261:4(103742)Online publication date: Jul-2024
https://doi.org/10.1016/j.ipm.2024.103742
Xu HDemner Fushman DHong NRaja K(2024)Medical Concept NormalizationNatural Language Processing in Biomedicine10.1007/978-3-031-55865-8_6(137-164)Online publication date: 9-Jun-2024
https://doi.org/10.1007/978-3-031-55865-8_6
Lee HJeong O(2023)A Knowledge-Grounded Task-Oriented Dialogue System with Hierarchical Structure for Enhancing Knowledge SelectionSensors10.3390/s2302068523:2(685)Online publication date: 6-Jan-2023
https://doi.org/10.3390/s23020685
Parsaeimehr EFartash MAkbari Torkestani J(2023)Improving Feature Extraction Using a Hybrid of CNN and LSTM for Entity IdentificationNeural Processing Letters10.1007/s11063-022-11122-y55:5(5979-5994)Online publication date: 3-Jan-2023
https://doi.org/10.1007/s11063-022-11122-y
Suwaileh RElsayed TImran M(2023)Role of Geolocation Prediction in Disaster ManagementInternational Handbook of Disaster Research10.1007/978-981-19-8388-7_176(647-677)Online publication date: 1-Oct-2023
https://doi.org/10.1007/978-981-19-8388-7_176
Suwaileh RElsayed TImran M(2023)Role of Geolocation Prediction in Disaster ManagementInternational Handbook of Disaster Research10.1007/978-981-16-8800-3_176-2(1-31)Online publication date: 4-Jul-2023
https://doi.org/10.1007/978-981-16-8800-3_176-2
Suwaileh RElsayed TImran M(2023)Role of Geolocation Prediction in Disaster ManagementInternational Handbook of Disaster Research10.1007/978-981-16-8800-3_176-1(1-33)Online publication date: 14-May-2023
https://doi.org/10.1007/978-981-16-8800-3_176-1
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten