Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3139958.3139965acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

LiveMaps: Learning Geo-Intent from Images of Maps on a Large Scale

Published: 07 November 2017 Publication History

Abstract

Image search is a popular application on web search engines. Issuing a location-related query on an image search engine often returns multiple images of maps among the top ranked results. Traditionally, clicking on such images either opens the image in a new browser tab or takes users to a web page containing the image. However, finding the area of intent on an interactive web map (e.g., Bing Maps) is a manual process. In this paper, we describe a novel system, LiveMaps, for analyzing and retrieving an appropriate map viewport for a given image of a map. This provides annotation of images of maps returned by image search engines, allowing users to directly open a link to an interactive map centered on the location of interest.
LiveMaps works in several stages. It first checks whether the input image represents a map. If yes, then the system attempts to identify what geographical area this map image represents. In the process, we use textual as well as visual information extracted from the image. Finally, we construct an interactive map object capturing the geographical area inferred for the image. Evaluation results on a dataset of labeled map images indicate our system constructs precise map representations while also achieving good levels of coverage.

References

[1]
2017. Bing Images. (2017). http://www.bing.com/images
[2]
2017. Bing Maps. (2017). http://www.bing.com/maps
[3]
Sean Bell and Kavita Bala. 2015. Learning visual similarity for product design with convolutional neural networks. ACM Transactions on Graphics (TOG) 34, 4 (2015), 98.
[4]
Pavel Berkhin, Michael R. Evans, Florin Teodorescu, Wei Wu, and Dragomir Yankov. 2015. A New Approach to Geocoding: BingGC (SIGSPATIAL '15). ACM, New York, NY, USA.
[5]
Gal Chechik, Varun Sharma, Uri Shalit, and Samy Bengio. 2010. Large scale online learning of image similarity through ranking. Journal of Machine Learning Research 11, Mar (2010), 1109--1135.
[6]
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. 2009. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR09.
[7]
Zhengming Ding, Nasser M Nasrabadi, and Yun Fu. 2016. Task-driven deep transfer learning for image classification. In Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on. IEEE, 2414--2418.
[8]
Michael R. Evans, Dragomir Yankov, Pavel Berkhin, Pavel Yudin, Florin Teodorescu, and Wei Wu. 2017. LiveMaps - Converting Map Images into Interactive Maps. In SIGIR '17. Shinjuku, Japan.
[9]
Jerome H Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics (2001).
[10]
Andrew Gallagher, Dhiraj Joshi, Jie Yu, and Jiebo Luo. 2009. Geo-location inference from image content and user tags. In CVPR Workshops 2009. IEEE, 55--62.
[11]
Moritz Hardt and Tengyu Ma. 2016. Identity Matters in Deep Learning. (2016). http://arxiv.org/abs/1611.04231
[12]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In CVPR16.
[13]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105.
[14]
Sihan Li, Jiantao Jiao, Yanjun Han, and Tsachy Weissman. 2016. Demystifying ResNet. (2016). http://arxiv.org/abs/1611.01186
[15]
Yunpeng Li, Noah Snavely, Daniel P Huttenlocher, and Pascal Fua. 2016. Worldwide pose estimation using 3d point clouds. In Large-Scale Visual Geo-Localization. Springer, 147--163.
[16]
Heng Liu, Tao Mei, Jiebo Luo, Houqiang Li, and Shipeng Li. 2012. Finding perfect rendezvous on the go: accurate mobile visual localization and its applications to routing. In ACM Multimedia '12. ACM, 9--18.
[17]
Jiebo Luo, Dhiraj Joshi, Jie Yu, and Andrew Gallagher. 2011. Geotagging in multimedia and computer vision - survey. Multimedia Tools and Applications 51, 1 (2011), 187--211.
[18]
Sinno Jialin Pan and Qiang Yang. 2010. A survey on transfer learning. IEEE Transactions on knowledge and data engineering 22, 10 (2010), 1345--1359.
[19]
Arun Sacheti and Eason Want. 2015. The Image Graph-Powering the Next Generation of Bing Image Search. https://blogs.bing.com/search-quality-insights/2015/04/22/the-image-graph-powering-the-next-generation-of-bing-image-search. (2015).
[20]
Hanan Samet and Aya Soffer. 1996. Marco: Map retrieval by content. IEEE Transactions on Pattern Analysis and Machine Intelligence 18, 8 (1996), 783--798.
[21]
Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. (2014). http://arxiv.org/abs/1409.1556
[22]
Krysta M Svore and CJ Burges. 2011. Large-scale learning to rank using boosted decision trees. Scaling Up Machine Learning: Parallel and Distributed Approaches 2 (2011).
[23]
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott E. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. In CVPR. IEEE Computer Society, 1--9.
[24]
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2818--2826.
[25]
Benjamin E Teitler, Michael D Lieberman, Daniele Panozzo, Jagan Sankaranarayanan, Hanan Samet, and Jon Sperling. 2008. NewsStand: A new view on news. In Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems. ACM, 18.
[26]
Jiang Wang, Yang Song, Thomas Leung, Chuck Rosenberg, Jingbin Wang, James Philbin, Bo Chen, and Ying Wu. 2014. Learning fine-grained image similarity with deep ranking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1386--1393.
[27]
Zhou Wang and Eero P Simoncelli. 2005. Translation insensitive image similarity in complex wavelet domain. In Acoustics, Speech, and Signal Processing, 2005. Proceedings.(ICASSP'05). IEEE International Conference on, Vol. 2. IEEE, ii--573.
[28]
Benjamin P. Wing and Jason Baldridge. 2011. Simple Supervised Document Geolocation with Geodesic Grids. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies -Volume 1 (HLT '11). Association for Computational Linguistics, Stroudsburg, PA, USA, 955--964. http://dl.acm.org/citation.cfm?id=2002472.2002593
[29]
Ruimao Zhang, Liang Lin, Rui Zhang, Wangmeng Zuo, and Lei Zhang. 2015. Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. IEEE Transactions on Image Processing 24, 12 (2015), 4766--4779.
[30]
Yan-Tao Zheng, Ming Zhao, Yang Song, Hartwig Adam, Ulrich Buddemeier, Alessandro Bissacco, Fernando Brucher, Tat-Seng Chua, and Hartmut Neven. 2009. Tour the world: building a web-scale landmark recognition engine. In Computer vision and pattern recognition, 2009. CVPR 2009. IEEE conference on. IEEE, 1085--1092.
[31]
Fuzhen Zhuang, Xiaohu Cheng, Ping Luo, Sinno Jialin Pan, and Qing He. 2015. Supervised Representation Learning: Transfer Learning with Deep Autoencoders. In IJCAI. 4119--4125.

Cited By

View all
  • (2024)Artificial intelligence studies in cartography: a review and synthesis of methods, applications, and ethicsCartography and Geographic Information Science10.1080/15230406.2023.2295943(1-32)Online publication date: 16-Jan-2024
  • (2021)Enriching the metadata of map images: a deep learning approach with GIS-based data augmentationInternational Journal of Geographical Information Science10.1080/13658816.2021.196840736:4(799-821)Online publication date: 23-Aug-2021

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGSPATIAL '17: Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
November 2017
677 pages
ISBN:9781450354905
DOI:10.1145/3139958
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Geographic Information Retrieval
  2. Image Search
  3. Map Search

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SIGSPATIAL'17
Sponsor:

Acceptance Rates

SIGSPATIAL '17 Paper Acceptance Rate 39 of 193 submissions, 20%;
Overall Acceptance Rate 220 of 1,116 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Artificial intelligence studies in cartography: a review and synthesis of methods, applications, and ethicsCartography and Geographic Information Science10.1080/15230406.2023.2295943(1-32)Online publication date: 16-Jan-2024
  • (2021)Enriching the metadata of map images: a deep learning approach with GIS-based data augmentationInternational Journal of Geographical Information Science10.1080/13658816.2021.196840736:4(799-821)Online publication date: 23-Aug-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media