Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2525314.2525455acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Weighted multi-attribute matching of user-generated points of interest

Published: 05 November 2013 Publication History

Abstract

To a large degree, the attraction of Big Data lies in the variety of its heterogeneous multi-thematic and multi-dimensional data sources and not merely its volume. To fully exploit this variety, however, requires conflation. This is a two step process. First, one has to establish identity relations between information entities across the different data sources; and second, attribute values have to be merged according to certain procedures which avoid logical contradictions. The first step, also called matching, can be thought of as a weighted combination of common attributes according to some similarity measures. In this work, we propose such a matching based on multiple attributes of Points of Interests (POI) from the Location-based Social Network Foursquare and the Yelp local directory service. While both contain overlapping attributes that can be use for matching, they have specific strengths and weaknesses which makes their conflation desirable. We present a weighted multi-attribute matching strategy and evaluate its performance. Our strategy can automatically match 97% of randomly selected Yelp POI to their corresponding Foursquare entities.

References

[1]
B. Adams and K. Janowicz. On the geo-indicativeness of non-georeferenced text. In J. G. Breslin, N. B. Ellison, J. G. Shanahan, and Z. Tufekci, editors, ICWSM, pages 375--378. The AAAI Press, 2012.
[2]
B. Adams, L. Li, M. Raubal, and M. F. Goodchild. A general framework for conflation. Extended Abstracts Volume, GIScience 2010, 2010.
[3]
J. Bao, Y. Zheng, and M. F. Mokbel. Location-based & preference-aware recommendation using sparse geo-social networking data. In ACM SIGSPATIAL, 2012.
[4]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[5]
C.-C. Chen, C. A. Knoblock, and C. Shahabi. Automatically conflating road vector data with orthoimagery. GeoInformatica, 10(4):495--530, 2006.
[6]
Z. Cheng, J. Caverlee, and K. Lee. A content-driven framework for geolocating microblog users. ACM Transactions on Intelligent Systems and Technology, 4(1):2, 2013.
[7]
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. Knowledge & Data Engineering, IEEE Transactions, 19(1):1--16, 2007.
[8]
J. Hastings. Automated conflation of digital gazetteer data. International Journal of Geographical Information Science, 22(10):1109--1127, 2008.
[9]
C. B. Jones and R. S. Purves. Geographical information retrieval. Int. Journal of Geographical Information Science, 22(3):219--228, 2008.
[10]
A. Lait and B. Randell. An assessment of name matching algorithms. Technical Report Series-University of Newcastle Upon Tyne Computing Science, 1996.
[11]
L. Li and M. F. Goodchild. An optimisation model for linear feature matching in geographical data conflation. International Journal of Image and Data Fusion, 2(4):309--328, 2011.
[12]
Q. Li, Y. Zheng, X. Xie, Y. Chen, W. Liu, and W.-Y. Ma. Mining user similarity based on location history. In ACM SIGSPATIAL 2008, page 34. ACM, 2008.
[13]
A. K. McCallum. Mallet: A machine learning for language toolkit. http://mallet.cs.umass.edu, 2002.
[14]
G. McKenzie, B. Adams, and K. Janowicz. A thematic approach to user similarity built on geosocial check-ins. In Proceedings of the 2013 AGILE Conference, 2013.
[15]
C. Mülligann, K. Janowicz, M. Ye, and W.-C. Lee. Analyzing the spatial-semantic interaction of points of interest in volunteered geographic information. Spatial Information Theory, pages 350--370, 2011.
[16]
Y. Wu and S. Winter. Inferring relevant gazetteer instances to a placename. In 10th International Conference on GeoComputation. UNSW, Sydney, Australia, 2009.

Cited By

View all
  • (2024)A Semantic-Spatial Aware Data Conflation Approach for Place Knowledge GraphsISPRS International Journal of Geo-Information10.3390/ijgi1304010613:4(106)Online publication date: 22-Mar-2024
  • (2023)Research on the Spatial and Temporal Dynamics of Crowd Activities in Commercial Streets and Their Relationship with Formats—A Case Study of Lao Men Dong Commercial Street in NanjingSustainability10.3390/su15241683815:24(16838)Online publication date: 14-Dec-2023
  • (2022)A System for Aligning Geographical Entities from Large Heterogeneous SourcesISPRS International Journal of Geo-Information10.3390/ijgi1102009611:2(96)Online publication date: 28-Jan-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGSPATIAL'13: Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
November 2013
598 pages
ISBN:9781450325219
DOI:10.1145/2525314
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 November 2013

Check for updates

Author Tags

  1. POI
  2. conflation
  3. location-based services
  4. point of interest
  5. similarity
  6. volunteered geographic information

Qualifiers

  • Research-article

Conference

SIGSPATIAL'13
Sponsor:

Acceptance Rates

Overall Acceptance Rate 220 of 1,116 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Semantic-Spatial Aware Data Conflation Approach for Place Knowledge GraphsISPRS International Journal of Geo-Information10.3390/ijgi1304010613:4(106)Online publication date: 22-Mar-2024
  • (2023)Research on the Spatial and Temporal Dynamics of Crowd Activities in Commercial Streets and Their Relationship with Formats—A Case Study of Lao Men Dong Commercial Street in NanjingSustainability10.3390/su15241683815:24(16838)Online publication date: 14-Dec-2023
  • (2022)A System for Aligning Geographical Entities from Large Heterogeneous SourcesISPRS International Journal of Geo-Information10.3390/ijgi1102009611:2(96)Online publication date: 28-Jan-2022
  • (2020)Different Sourcing Point of Interest Matching Method Considering Multiple ConstraintsISPRS International Journal of Geo-Information10.3390/ijgi90402149:4(214)Online publication date: 31-Mar-2020
  • (2019)SONETProceedings of the 3rd ACM SIGSPATIAL International Workshop on Geospatial Humanities10.1145/3356991.3365474(1-6)Online publication date: 5-Nov-2019
  • (2019)Spatial Entity Matching with GeoAlign (demo paper)Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems10.1145/3347146.3359345(580-583)Online publication date: 5-Nov-2019
  • (2019)Automatic analysis of positional plausibility for points of interest in OpenStreetMap using coexistence patternsInternational Journal of Geographical Information Science10.1080/13658816.2019.1584803(1-24)Online publication date: 5-Mar-2019
  • (2018)Venue matching in social network APIs using neural networksProceedings of the 22nd Pan-Hellenic Conference on Informatics10.1145/3291533.3291558(133-138)Online publication date: 29-Nov-2018
  • (2016)Open Data for Local SearchProceedings of the 25th International Conference Companion on World Wide Web10.1145/2872518.2890487(641-644)Online publication date: 11-Apr-2016
  • (2016)Automatic geospatial data conflation using semantic web technologiesProceedings of the Australasian Computer Science Week Multiconference10.1145/2843043.2843375(1-10)Online publication date: 1-Feb-2016

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media