Abstract
In this demonstration, we present Country Guesser, a live system that guesses the country that a photo is taken in. In particular, given a Google Street View image, our federated ranking model uses a combination of computer vision, machine learning and text retrieval methods to compute a ranking of likely countries of the location shown in a given image from Street View. Interestingly, using text-based features to probe large pre-trained language models can assist to provide cross-modal supervision. We are not aware of previous country guessing systems informed by visual and textual features.
The bulk of this work was undertaken by the first author during his Master’s thesis. We gratefully acknowledge the funding by the High Tech Agenda Bavaria (https://www.bayern.de/wp-content/uploads/2019/10/Regierungserklaerung_101019_engl.pdf) to the third author that supported part of this research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Between acceptance of this paper and our preparation of the publication version, we found another work, [1] which attempts country guessing of photos that are not necessarily from Street View, which is very much in the spirit of this paper.
- 2.
https://gitlab.com/Timperator/which-country-is-this (cited 2022-01-10).
References
Alamayreh, O., Dimitri, G.M., Wang, J., Tondi, B., Barni, M.: Which country is this picture from? New data and methods for DNN-based country recognition, September 2022. Unpublished Manuscript, Cornell University, ArXiv Pre-Print Server. https://arxiv.org/abs/2209.02429
Brejcha, J., Čadík, M.: State-of-the-art in visual geo-localization. Pattern Anal. Appl. 20(3), 613–637 (2017). https://doi.org/10.1007/s10044-017-0611-1
Hays, J., Efros, A.A.: IM2GPS: estimating geographic information from a single image. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2008)
Higgins, E.: We Are Bellingcat: Global Crime, Online Sleuths, and the Bold Future of News. Bloomsbury, London (2021)
Jocher, G., et al.: ultralytics/yolov5: v6.1 - TensorRT, TensorFlow edge TPU and OpenVINO export and inference (2022). https://doi.org/10.5281/ZENODO.6222936. https://zenodo.org/record/6222936
Kittinaradorn, R.: Easy OCR. Software (2022). https://github.com/JaidedAI/EasyOCR
Leidner, J.L.: Toponym Resolution in Text: Annotation, Evaluation and Applications of Spatial Grounding of Place Names. Universal Press, Boca Raton (2008)
Mokady, R., Hertz, A., Bermano, A.H.: ClipCap: CLIP prefix for image captioning. arXiv preprint arXiv:2111.09734 (2021)
Nakatani, S.: Language detection library for Java (2010). https://github.com/shuyo/language-detection
Royal Geographical Society (ed.): Where in the World – Castles & Walled Cities. Geographical (2022). https://geographical.co.uk/crossword-and-quizzes/where-in-the-world-castles-walled-cities. Accessed 22 Aug 2022
Stahl, P.M.: Lingua (2022). https://github.com/pemistahl/lingua-py
Theethira, N.S.P., Ravindranath, D.: GeoguessrLSTM. Technical report, University of Colorado (unpublished). https://github.com/Nirvan66/geoguessrLSTM/blob/master/documentation/CSCI5922_ProjectReport.pdf. Accessed 14 Oct 2022
Weyand, T., Kostrikov, I., Philbin, J.: PlaNet - photo geolocation with convolutional neural networks. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 37–55. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_3
Zamir, A.R., Shah, M.: Accurate image localization based on Google Maps Street view. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 255–268. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_19
Zhang, W., Kosecka, J.: Image based localization in urban environments. In: Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT 2006), pp. 33–40. IEEE (2006)
Acknowledgements
The authors wish to thank the three anonymous reviewers, whose comments improved the quality of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Menzner, T., Mittag, F., Leidner, J.L. (2023). Which Country Is This? Automatic Country Ranking of Street View Photos. In: Kamps, J., et al. Advances in Information Retrieval. ECIR 2023. Lecture Notes in Computer Science, vol 13982. Springer, Cham. https://doi.org/10.1007/978-3-031-28241-6_26
Download citation
DOI: https://doi.org/10.1007/978-3-031-28241-6_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-28240-9
Online ISBN: 978-3-031-28241-6
eBook Packages: Computer ScienceComputer Science (R0)