Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2671188.2749353acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Location-Based Parallel Tag Completion for Geo-tagged Social Image Retrieval

Published: 22 June 2015 Publication History

Abstract

Benefit from tremendous growth of user-generated content, social annotated tags get higher importance in organization and retrieval of large scale image database on Online Sharing Websites (OSW). To obtain high-quality tags from existing community contributed tags with missing information and noise, tag-based annotation or recommendation methods have been proposed for performance promotion of tag prediction. While images from OSW contain rich social attributes, existing studies only utilize the relations between visual content and tags to construct global information completion models. In this paper, beyond the image-tag relation, we take full advantage of the ubiquitous GPS locations and image-user relationship, to enhance the accuracy of tag prediction and improve the computational efficiency. For GPS locations, we define the popular geo-locations where people tend to take more images as Points of Interests (POI), which are discovered by mean shift approach. For image-user relationship, we integrate a localized prior constraint, expecting the completed tag sub-matrix in each POI to maintain consistency with users' tagging behaviors. Based on these two key issues, we propose a unified tag matrix completion framework which learns the image-tag relation within each POI. To solve the proposed model, an efficient proximal sub-gradient descent algorithm is designed. The model optimization can be easily parallelized and distributed to learn the tag sub-matrix for each POI. Extensive experimental results reveal that the learned tag sub-matrix of each POI reflects the major trend of users' tagging results with respect to different POIs and users, and the parallel learning process provides strong support for processing large scale online image database.

References

[1]
K. Barnard, P. Duygulu, D. Forsyth, N. De Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, 2003.
[2]
G. Carneiro, A. B. Chan, P. J. Moreno, and N. Vasconcelos. Supervised learning of semantic classes for image annotation and retrieval. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(3):394--410, 2007.
[3]
C. Cartis, N. I. Gould, and P. L. Toint. On the evaluation complexity of composite function minimization with applications to nonconvex nonlinear programming. SIAM Journal on Optimization, 21(4):1721--1739, 2011.
[4]
L. Chen, D. Xu, I. W. Tsang, and J. Luo. Tag-based web photo retrieval improved by batch mode re-tagging. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 3440--3446. IEEE, 2010.
[5]
D. J. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In WWW'09, pages 761--770. ACM, 2009.
[6]
C. Desai, D. Ramanan, and C. C. Fowlkes. Discriminative models for multi-class object layout. International Journal of Computer Vision, 95(1):1--12, 2011.
[7]
K.-S. Goh, E. Y. Chang, and B. Li. Using one-class and two-class svms for multiclass image annotation. Knowledge and Data Engineering, IEEE Transactions on, 17(10):1333--1346, 2005.
[8]
A. Goldberg, B. Recht, J. Xu, R. Nowak, and X. Zhu. Transduction with matrix completion: Three birds with one stone. In Advances in neural information processing systems, pages 757--765, 2010.
[9]
M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In Computer Vision, 2009 IEEE 12th International Conference on, pages 309--316. IEEE, 2009.
[10]
H. Halpin, V. Robu, and H. Shepherd. The complex dynamics of collaborative tagging. In Proceedings of the 16th international conference on World Wide Web, pages 211--220. ACM, 2007.
[11]
B. Hariharan, L. Zelnik-Manor, M. Varma, and S. Vishwanathan. Large scale max-margin multi-label classification with priors. In Proceedings of the 27th International Conference on Machine Learning (ICML-10), pages 423--430, 2010.
[12]
Y.-G. Jiang, C.-W. Ngo, and J. Yang. Towards optimal bag-of-features for object categorization and semantic video retrieval. In Proceedings of the 6th ACM international conference on Image and video retrieval, pages 494--501. ACM, 2007.
[13]
M. E. Kipp and D. G. Campbell. Patterns and inconsistencies in collaborative tagging systems: An examination of tagging practices. Proceedings of the American Society for Information Science and Technology, 43(1):1--18, 2006.
[14]
X. Li, C. G. Snoek, and M. Worring. Learning social tag relevance by neighbor voting. Multimedia, IEEE Transactions on, 11(7):1310--1322, 2009.
[15]
X. Li, T. Uricchio, L. Ballan, M. Bertini, C. G. Snoek, and A. Del Bimbo. Socializing the semantic gap: A comparative survey on image tag assignment, refinement and retrieval. arXiv preprint arXiv:1503.08248, 2015.
[16]
D. Liu, S. Yan, X.-S. Hua, and H.-J. Zhang. Image retagging using collaborative tag propagation. Multimedia, IEEE Transactions on, 13(4):702--712, 2011.
[17]
J. Liu, Z. Li, J. Tang, Y. Jiang, and H. Lu. Personalized geo-specific tag recommendation for photos on social websites. IEEE Transactions on Multimedia, 16(3):588--600, 2014.
[18]
S. Liu, Y. Liu, L. M. Ni, J. Fan, and M. Li. Towards mobility-based clustering. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 919--928. ACM, 2010.
[19]
E. Moxley, J. Kleban, and B. Manjunath. Spirittagger: a geo-aware tag suggestion tool mined from flickr. In ACM MIR, pages 24--30. ACM, 2008.
[20]
B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman. Labelme: a database and web-based tool for image annotation. International journal of computer vision, 77(1-3):157--173, 2008.
[21]
J. Sang, C. Xu, and J. Liu. User-aware image tag refinement via ternary semantic analysis. Multimedia, IEEE Transactions on, 14(3):883--895, 2012.
[22]
A. Vedaldi and B. Fulkerson. Vlfeat: An open and portable library of computer vision algorithms. In ACM Multimedia, pages 1469--1472. ACM, 2010.
[23]
L. Wu, S. C. Hoi, R. Jin, J. Zhu, and N. Yu. Distance metric learning from uncertain side information with application to automated photo tagging. In ACM Multimedia, pages 135--144. ACM, 2009.
[24]
L. Wu, X.-S. Hua, N. Yu, W.-Y. Ma, and S. Li. Flickr distance. In ACM Multimedia'08, pages 31--40. ACM, 2008.
[25]
L. Wu, R. Jin, and A. K. Jain. Tag completion for image retrieval. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 35(3):716--727, 2013.
[26]
L. Wu, M. Li, Z. Li, W.-Y. Ma, and N. Yu. Visual language modeling for image classification. In Proceedings of the International Workshop on MIR, pages 115--124. ACM, 2007.
[27]
L. Wu, L. Yang, N. Yu, and X.-S. Hua. Learning to tag. In WWW'09, pages 361--370. ACM, 2009.
[28]
H. Xu, J. Wang, X.-S. Hua, and S. Li. Tag refinement by regularized lda. In Proceedings of the 17th ACM international conference on Multimedia, pages 573--576. ACM, 2009.
[29]
Z.-J. Zha, T. Mei, J. Wang, Z. Wang, and X.-S. Hua. Graph-based semi-supervised learning with multiple labels. Journal of Visual Communication and Image Representation, 20(2):97--103, 2009.
[30]
J. Zheng, S. Liu, and L. M. Ni. User characterization from geographic topic analysis in online social media. In Advances in Social Networks Analysis and Mining (ASONAM), 2014 IEEE/ACM International Conference on, pages 464--471. IEEE, 2014.
[31]
N. Zhou, W. K. Cheung, G. Qiu, and X. Xue. A hybrid probabilistic model for unified collaborative and content-based image tagging. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 33(7):1281--1294, 2011.
[32]
G. Zhu, S. Yan, and Y. Ma. Image tag refinement towards low-rank, content-tag prior and error sparsity. In Proceedings of the international conference on Multimedia, pages 461--470. ACM, 2010.

Cited By

View all
  • (2023)Estimating Bounding Box for Point of Interest Using Social Media Geo-Tagged PhotosIEEE Access10.1109/ACCESS.2023.323901411(7837-7849)Online publication date: 2023
  • (2019)Research of Mobile Recommender Service Based on User Preferences and Social TagsProceedings of the 2nd International Conference on Computer Science and Software Engineering10.1145/3339363.3339365(6-12)Online publication date: 24-May-2019
  • (2017)Cross-media analysis and reasoning: advances and directionsFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.160178718:1(44-57)Online publication date: 4-Feb-2017
  • Show More Cited By

Index Terms

  1. Location-Based Parallel Tag Completion for Geo-tagged Social Image Retrieval

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval
    June 2015
    700 pages
    ISBN:9781450332743
    DOI:10.1145/2671188
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. geo-location information
    2. social image retrieval
    3. tag matrix completion

    Qualifiers

    • Research-article

    Funding Sources

    • Postdoctoral Science Foundation of China
    • Basic Research Program of Shenzhen
    • National Basic Research Program of China (973 Program)
    • 863 program of China
    • National Natural Science Foundation of China

    Conference

    ICMR '15
    Sponsor:

    Acceptance Rates

    ICMR '15 Paper Acceptance Rate 48 of 127 submissions, 38%;
    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Estimating Bounding Box for Point of Interest Using Social Media Geo-Tagged PhotosIEEE Access10.1109/ACCESS.2023.323901411(7837-7849)Online publication date: 2023
    • (2019)Research of Mobile Recommender Service Based on User Preferences and Social TagsProceedings of the 2nd International Conference on Computer Science and Software Engineering10.1145/3339363.3339365(6-12)Online publication date: 24-May-2019
    • (2017)Cross-media analysis and reasoning: advances and directionsFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.160178718:1(44-57)Online publication date: 4-Feb-2017
    • (2017)Sensitive gazetteer discovery and protection for mobile social media users2017 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2017.8258036(1109-1116)Online publication date: Dec-2017
    • (2017)Privacy-protected place of activity mining on big location data2017 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2017.8258035(1101-1108)Online publication date: Dec-2017
    • (2017)Point of interest mining with proper semantic annotationMultimedia Tools and Applications10.1007/s11042-016-4114-776:22(23435-23457)Online publication date: 1-Nov-2017
    • (2017)JEREMIE: Joint Semantic Feature Learning via Multi-relational Matrix CompletionMobility Analytics for Spatio-Temporal and Social Data10.1007/978-3-319-73521-4_6(87-108)Online publication date: 28-Dec-2017
    • (2017)Conclusion and Future WorkMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_8(235-260)Online publication date: 1-Sep-2017
    • (2017)Adaptive News Video UploadingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_7(205-234)Online publication date: 1-Sep-2017
    • (2017)Lecture Video SegmentationMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_6(173-203)Online publication date: 1-Sep-2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media