research-article

Location-Based Parallel Tag Completion for Geo-tagged Social Image Retrieval

Authors:

Qingming HuangAuthors Info & Claims

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

Pages 355 - 362

https://doi.org/10.1145/2671188.2749353

Published: 22 June 2015 Publication History

Abstract

Benefit from tremendous growth of user-generated content, social annotated tags get higher importance in organization and retrieval of large scale image database on Online Sharing Websites (OSW). To obtain high-quality tags from existing community contributed tags with missing information and noise, tag-based annotation or recommendation methods have been proposed for performance promotion of tag prediction. While images from OSW contain rich social attributes, existing studies only utilize the relations between visual content and tags to construct global information completion models. In this paper, beyond the image-tag relation, we take full advantage of the ubiquitous GPS locations and image-user relationship, to enhance the accuracy of tag prediction and improve the computational efficiency. For GPS locations, we define the popular geo-locations where people tend to take more images as Points of Interests (POI), which are discovered by mean shift approach. For image-user relationship, we integrate a localized prior constraint, expecting the completed tag sub-matrix in each POI to maintain consistency with users' tagging behaviors. Based on these two key issues, we propose a unified tag matrix completion framework which learns the image-tag relation within each POI. To solve the proposed model, an efficient proximal sub-gradient descent algorithm is designed. The model optimization can be easily parallelized and distributed to learn the tag sub-matrix for each POI. Extensive experimental results reveal that the learned tag sub-matrix of each POI reflects the major trend of users' tagging results with respect to different POIs and users, and the parallel learning process provides strong support for processing large scale online image database.

References

[1]

K. Barnard, P. Duygulu, D. Forsyth, N. De Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, 2003.

Digital Library

[2]

G. Carneiro, A. B. Chan, P. J. Moreno, and N. Vasconcelos. Supervised learning of semantic classes for image annotation and retrieval. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(3):394--410, 2007.

Digital Library

[3]

C. Cartis, N. I. Gould, and P. L. Toint. On the evaluation complexity of composite function minimization with applications to nonconvex nonlinear programming. SIAM Journal on Optimization, 21(4):1721--1739, 2011.

Digital Library

[4]

L. Chen, D. Xu, I. W. Tsang, and J. Luo. Tag-based web photo retrieval improved by batch mode re-tagging. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 3440--3446. IEEE, 2010.

[5]

D. J. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In WWW'09, pages 761--770. ACM, 2009.

Digital Library

[6]

C. Desai, D. Ramanan, and C. C. Fowlkes. Discriminative models for multi-class object layout. International Journal of Computer Vision, 95(1):1--12, 2011.

Digital Library

[7]

K.-S. Goh, E. Y. Chang, and B. Li. Using one-class and two-class svms for multiclass image annotation. Knowledge and Data Engineering, IEEE Transactions on, 17(10):1333--1346, 2005.

Digital Library

[8]

A. Goldberg, B. Recht, J. Xu, R. Nowak, and X. Zhu. Transduction with matrix completion: Three birds with one stone. In Advances in neural information processing systems, pages 757--765, 2010.

Digital Library

[9]

M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In Computer Vision, 2009 IEEE 12th International Conference on, pages 309--316. IEEE, 2009.

[10]

H. Halpin, V. Robu, and H. Shepherd. The complex dynamics of collaborative tagging. In Proceedings of the 16th international conference on World Wide Web, pages 211--220. ACM, 2007.

Digital Library

[11]

B. Hariharan, L. Zelnik-Manor, M. Varma, and S. Vishwanathan. Large scale max-margin multi-label classification with priors. In Proceedings of the 27th International Conference on Machine Learning (ICML-10), pages 423--430, 2010.

Digital Library

[12]

Y.-G. Jiang, C.-W. Ngo, and J. Yang. Towards optimal bag-of-features for object categorization and semantic video retrieval. In Proceedings of the 6th ACM international conference on Image and video retrieval, pages 494--501. ACM, 2007.

Digital Library

[13]

M. E. Kipp and D. G. Campbell. Patterns and inconsistencies in collaborative tagging systems: An examination of tagging practices. Proceedings of the American Society for Information Science and Technology, 43(1):1--18, 2006.

[14]

X. Li, C. G. Snoek, and M. Worring. Learning social tag relevance by neighbor voting. Multimedia, IEEE Transactions on, 11(7):1310--1322, 2009.

Digital Library

[15]

X. Li, T. Uricchio, L. Ballan, M. Bertini, C. G. Snoek, and A. Del Bimbo. Socializing the semantic gap: A comparative survey on image tag assignment, refinement and retrieval. arXiv preprint arXiv:1503.08248, 2015.

[16]

D. Liu, S. Yan, X.-S. Hua, and H.-J. Zhang. Image retagging using collaborative tag propagation. Multimedia, IEEE Transactions on, 13(4):702--712, 2011.

Digital Library

[17]

J. Liu, Z. Li, J. Tang, Y. Jiang, and H. Lu. Personalized geo-specific tag recommendation for photos on social websites. IEEE Transactions on Multimedia, 16(3):588--600, 2014.

Digital Library

[18]

S. Liu, Y. Liu, L. M. Ni, J. Fan, and M. Li. Towards mobility-based clustering. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 919--928. ACM, 2010.

Digital Library

[19]

E. Moxley, J. Kleban, and B. Manjunath. Spirittagger: a geo-aware tag suggestion tool mined from flickr. In ACM MIR, pages 24--30. ACM, 2008.

Digital Library

[20]

B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman. Labelme: a database and web-based tool for image annotation. International journal of computer vision, 77(1-3):157--173, 2008.

Digital Library

[21]

J. Sang, C. Xu, and J. Liu. User-aware image tag refinement via ternary semantic analysis. Multimedia, IEEE Transactions on, 14(3):883--895, 2012.

Digital Library

[22]

A. Vedaldi and B. Fulkerson. Vlfeat: An open and portable library of computer vision algorithms. In ACM Multimedia, pages 1469--1472. ACM, 2010.

Digital Library

[23]

L. Wu, S. C. Hoi, R. Jin, J. Zhu, and N. Yu. Distance metric learning from uncertain side information with application to automated photo tagging. In ACM Multimedia, pages 135--144. ACM, 2009.

Digital Library

[24]

L. Wu, X.-S. Hua, N. Yu, W.-Y. Ma, and S. Li. Flickr distance. In ACM Multimedia'08, pages 31--40. ACM, 2008.

Digital Library

[25]

L. Wu, R. Jin, and A. K. Jain. Tag completion for image retrieval. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 35(3):716--727, 2013.

Digital Library

[26]

L. Wu, M. Li, Z. Li, W.-Y. Ma, and N. Yu. Visual language modeling for image classification. In Proceedings of the International Workshop on MIR, pages 115--124. ACM, 2007.

Digital Library

[27]

L. Wu, L. Yang, N. Yu, and X.-S. Hua. Learning to tag. In WWW'09, pages 361--370. ACM, 2009.

Digital Library

[28]

H. Xu, J. Wang, X.-S. Hua, and S. Li. Tag refinement by regularized lda. In Proceedings of the 17th ACM international conference on Multimedia, pages 573--576. ACM, 2009.

Digital Library

[29]

Z.-J. Zha, T. Mei, J. Wang, Z. Wang, and X.-S. Hua. Graph-based semi-supervised learning with multiple labels. Journal of Visual Communication and Image Representation, 20(2):97--103, 2009.

Digital Library

[30]

J. Zheng, S. Liu, and L. M. Ni. User characterization from geographic topic analysis in online social media. In Advances in Social Networks Analysis and Mining (ASONAM), 2014 IEEE/ACM International Conference on, pages 464--471. IEEE, 2014.

Digital Library

[31]

N. Zhou, W. K. Cheung, G. Qiu, and X. Xue. A hybrid probabilistic model for unified collaborative and content-based image tagging. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 33(7):1281--1294, 2011.

Digital Library

[32]

G. Zhu, S. Yan, and Y. Ma. Image tag refinement towards low-rank, content-tag prior and error sparsity. In Proceedings of the international conference on Multimedia, pages 461--470. ACM, 2010.

Digital Library

Cited By

Bui T(2023)Estimating Bounding Box for Point of Interest Using Social Media Geo-Tagged PhotosIEEE Access10.1109/ACCESS.2023.323901411(7837-7849)Online publication date: 2023
https://doi.org/10.1109/ACCESS.2023.3239014
Song WZhou LChen MZheng C(2019)Research of Mobile Recommender Service Based on User Preferences and Social TagsProceedings of the 2nd International Conference on Computer Science and Software Engineering10.1145/3339363.3339365(6-12)Online publication date: 24-May-2019
https://dl.acm.org/doi/10.1145/3339363.3339365
Peng YZhu WZhao YXu CHuang QLu HZheng QHuang TGao W(2017)Cross-media analysis and reasoning: advances and directionsFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.160178718:1(44-57)Online publication date: 4-Feb-2017
https://doi.org/10.1631/FITEE.1601787
Show More Cited By

Index Terms

Location-Based Parallel Tag Completion for Geo-tagged Social Image Retrieval
1. Information systems
  1. Information retrieval
    1. Document representation

Recommendations

Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval
Special Issue: Mobile Social Multimedia Analytics in the Big Data Era and Regular Papers

Having benefited from tremendous growth of user-generated content, social annotated tags get higher importance in the organization and retrieval of large-scale image databases on Online Sharing Websites (OSW). To obtain high-quality tags from existing ...
Learning tag relevance by neighbor voting for social image retrieval
MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval

Social image retrieval is important for exploiting the increasing amounts of amateur-tagged multimedia such as Flickr images. Since amateur tagging is known to be uncontrolled, ambiguous, and personalized, a fundamental problem is how to reliably ...
Tag relevance fusion for social image retrieval

Due to the subjective nature of social tagging, measuring the relevance of social tags with respect to the visual content is crucial for retrieving the increasing amounts of social-networked images. Witnessing the limit of a single measurement of tag ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

June 2015

700 pages

ISBN:9781450332743

DOI:10.1145/2671188

General Chairs:
Alex Hauptmann
Carnegie Mellon University, USA
,
Chong-Wah Ngo
City University of Hong Kong, China
,
Xiangyang Xue
Fudan University, China
,
Program Chairs:
Yu-Gang Jiang
Fudan University, China
,
Cees Snoek
University of Amsterdam and Qualcomm Research Netherlands
,
Nuno Vasconcelos
University of California, San Diego, USA

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 June 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Postdoctoral Science Foundation of China
Basic Research Program of Shenzhen
National Basic Research Program of China (973 Program)
863 program of China
National Natural Science Foundation of China

Conference

ICMR '15

Sponsor:

SIGMM

ICMR '15: International Conference on Multimedia Retrieval

June 23 - 26, 2015

Shanghai, China

Acceptance Rates

ICMR '15 Paper Acceptance Rate 48 of 127 submissions, 38%;

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
198
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bui T(2023)Estimating Bounding Box for Point of Interest Using Social Media Geo-Tagged PhotosIEEE Access10.1109/ACCESS.2023.323901411(7837-7849)Online publication date: 2023
https://doi.org/10.1109/ACCESS.2023.3239014
Song WZhou LChen MZheng C(2019)Research of Mobile Recommender Service Based on User Preferences and Social TagsProceedings of the 2nd International Conference on Computer Science and Software Engineering10.1145/3339363.3339365(6-12)Online publication date: 24-May-2019
https://dl.acm.org/doi/10.1145/3339363.3339365
Peng YZhu WZhao YXu CHuang QLu HZheng QHuang TGao W(2017)Cross-media analysis and reasoning: advances and directionsFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.160178718:1(44-57)Online publication date: 4-Feb-2017
https://doi.org/10.1631/FITEE.1601787
Wang SSinnott RNepal S(2017)Sensitive gazetteer discovery and protection for mobile social media users2017 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2017.8258036(1109-1116)Online publication date: Dec-2017
https://doi.org/10.1109/BigData.2017.8258036
Wang SSinnott RNepal S(2017)Privacy-protected place of activity mining on big location data2017 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2017.8258035(1101-1108)Online publication date: Dec-2017
https://doi.org/10.1109/BigData.2017.8258035
Bui TPark S(2017)Point of interest mining with proper semantic annotationMultimedia Tools and Applications10.1007/s11042-016-4114-776:22(23435-23457)Online publication date: 1-Nov-2017
https://dl.acm.org/doi/10.1007/s11042-016-4114-7
Zhang JWang SQu QHuang Q(2017)JEREMIE: Joint Semantic Feature Learning via Multi-relational Matrix CompletionMobility Analytics for Spatio-Temporal and Social Data10.1007/978-3-319-73521-4_6(87-108)Online publication date: 28-Dec-2017
https://doi.org/10.1007/978-3-319-73521-4_6
Shah RZimmermann RShah RZimmermann R(2017)Conclusion and Future WorkMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_8(235-260)Online publication date: 1-Sep-2017
https://doi.org/10.1007/978-3-319-61807-4_8
Shah RZimmermann RShah RZimmermann R(2017)Adaptive News Video UploadingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_7(205-234)Online publication date: 1-Sep-2017
https://doi.org/10.1007/978-3-319-61807-4_7
Shah RZimmermann RShah RZimmermann R(2017)Lecture Video SegmentationMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_6(173-203)Online publication date: 1-Sep-2017
https://doi.org/10.1007/978-3-319-61807-4_6
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten