Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2448556.2448577acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

Multi-label image annotation based on multi-model

Published: 17 January 2013 Publication History

Abstract

Image automatic annotation is a promising and essential step for semantic image retrieval, and it's still a challenge because of the open problem of semantic gap. Recently, most of image annotation approaches paid more attention to detect single label for an image, but in fact they are multi-label learning problems. In this paper, we propose a new multi-model method for image multi-label annotation, which includes two different models for foreground and background semantic detection in terms of their distinct characters of semantic and visual features respectively, and a semantic correlation analysis model for refining the annotation results. A new visual saliency analysis algorithm based on multi-feature is proposed to obtaining the salient object, and multiple Nyström-approximating kernel discriminant analysis is used to acquire foreground semantic concept. Region semantic analysis is proposed to get annotation words of background, and semantic correlation matrix constructed by Latent Semantic Analysis is used to remove the unreliable labels. Experimental results show that our multi-model image labeling method could achieve promising performance for multi-labeling, and outperform previous methods on benchmark datasets.

References

[1]
Tsoumakas, G., Katakis, I. and Vlahavas, I. 2010. Mining Multi-Label Data. Journal of Data Mining and Knowledge Discovery Handbook, Part 6, Springer, (2010), 667--685.
[2]
Zhang, M. L., Zhou, Z. H. MI-KNN: A lazy learning approach to multi-label learning. Journal of Pattern Recognition, (2007), 2038--2048.
[3]
Su, J. H., Chou, C. L., Lin C. Y. and Tseng, V. S. 2001. Effective semantic annotation by image-to-concept distribution model. IEEE Transaction on Multimedia. 13, 3(Jun. 2011), 530--538.
[4]
Hu, J., Lam, K. M. and Qiu, G. 2010. A hierarchical algorithm for image multi-labeling. In Proceedings of the IEEE 17th International Conference on Image Process, (Sept. 2010), 2349--2352.
[5]
Mylonas, P., Spyrou, E., Avrithis, Y., and Kollias, S. 2009. Using visual context and region semantics for high-level concept detection. Journal of IEEE Transaction on Multimedia. 11, 2(Feb. 2009). 229--243.
[6]
Tsoumakas, G., Katakis, I., Vlahvas, I. 2010. Random k-Label sets for Multi-Label Classification, Journal of IEEE Transactions on Knowledge and Data Engineering. 23, Los Alamitos, CA, USA (2010). 1--12.
[7]
Spyromitros, E., Tsoumakas, G., Vlahavas, I. 2008. An empirical study of lazy multilabel classification algorithms. In Proceedings of Hellenic conference on Artificial Intelligence: Theories, Models and Applications. (2008). 401--406.
[8]
Harel, J., Koch, C., Perona, P. 2006. Graph-based visual saliency. In Proceedings of Neural Information Processing Systems. (Dec. 2006). 545--552.
[9]
Z. Wang, W. B. Jie, and D. Q. Gao. A novel multiple Nystrom-approximating kernel discriminant analysis. Journal of Neurocomputing, in press.
[10]
Landauera, T. K., Foltzb, P. W., Lahamc, D. 1988. An introduction of latent semantic analysis. Journal of Transaction on Discourse Processes. 25, 2--3. (1988).259--284.
[11]
Naphade, M., Smith, J. R., Tesic, J., Chang, S. F., Hsu, W., Kennedy, L., Hauptmann, A. et al. 2006. Large-Scale Concept Ontology for Multimedia. Journal of Transaction on Multimedia. 13, 3. (Jul. 2006). 86--91.
[12]
Liu, T., Yuan, Z. J., Sun, J., Wang J. D. 2011. Learning to detect a salient object. Journal of IEEE Transactions on Pattern Analysis and Machine Intelligence. 2, 33(Feb. 2011). 1--8.

Cited By

View all
  • (2018)A novel image annotation model based on content representation with multi-layer segmentationNeural Computing and Applications10.1007/s00521-014-1815-626:6(1407-1422)Online publication date: 27-Dec-2018
  • (2015)Representation of image content based on RoI-BoWJournal of Visual Communication and Image Representation10.1016/j.jvcir.2014.10.00726:C(37-49)Online publication date: 1-Jan-2015

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICUIMC '13: Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
January 2013
772 pages
ISBN:9781450319584
DOI:10.1145/2448556
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 January 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. MFBSA
  2. MNKDA
  3. RSA
  4. image annotation
  5. multi-label
  6. multi-model

Qualifiers

  • Research-article

Funding Sources

Conference

ICUIMC '13
Sponsor:

Acceptance Rates

Overall Acceptance Rate 251 of 941 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2018)A novel image annotation model based on content representation with multi-layer segmentationNeural Computing and Applications10.1007/s00521-014-1815-626:6(1407-1422)Online publication date: 27-Dec-2018
  • (2015)Representation of image content based on RoI-BoWJournal of Visual Communication and Image Representation10.1016/j.jvcir.2014.10.00726:C(37-49)Online publication date: 1-Jan-2015

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media