Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2020408.2020592acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
poster

Mining partially annotated images

Published: 21 August 2011 Publication History

Abstract

In this paper, we study the problem of mining partially annotated images. We first define what the problem of mining partially annotated images is, and argue that in many real-world applications annotated images are typically partially annotated and thus that the problem of mining partially annotated images exists in many situations. We then propose an effective solution to this problem based on a statistical model we have developed called the Semi-Supervised Correspondence Hierarchical Dirichlet Process (SSCHDP). The main idea of this model lies in exploiting the information pertaining to partially annotated images or even unannotated images to achieve semi-supervised learning under the HDP structure. We apply this model to completing the annotations appropriately for partially annotated images in the training data and then to predicting the annotations appropriately and completely for all the unannotated images either in the training data or in any unseen data beyond the training process. Experiments show that SSC-HDP is superior to the peer models from the recent literature when they are applied to solving the problem of mining partially annotated images.

References

[1]
http://www.fruitfly.org/.
[2]
K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, 2003.
[3]
D. M. Blei and M. I. Jordan. Modeling annotated data. In Proceedings of the 26th International ACM SIGIR Conference, 2003.
[4]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[5]
M. R. Boutell, J. Luo, X. Shen, and C. M. Brown. Learning multi-label scene classification. Pattern Recognition, 37(9):1757--1771, 2004.
[6]
T. S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng. Nus-wide: a real-world web image database from national university of singapore. In Proceedings of the ACM International Conference on Image and Video Retrieval, pages 1--9, 2009.
[7]
R. Datta, D. Joshi, J. Li, and J. Z. Wang. Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys, 40:1--60, 2008.
[8]
M. Everingham, L. V. Gool, C. Williams, C. K. I., J. Winn, and A. Zisserman. The pascal visual object classes (voc) challenge. International Journal of Computer Vision, 88(2):303--338, 2010.
[9]
S. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevance models for image and video annotation. In Proceedings of International Conference on Computer Vision and Pattern Recognition, pages 1002--1009, 2004.
[10]
Z. Guo, Z. Zhang, E. P. Xing, and C. Faloutsos. Enhanced max margin learning on multimodal data mining in a multimedia database. In Proceedings of 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007.
[11]
H. Ishwaran and L. F. James. Gibbs sampling methods for stick-breaking priors. Journal of the American Statistical Association, 96(453):161--173, 2001.
[12]
J. Jeon, V. Lavrenko, and R. Manmatha. Automatic image annotation and retrieval using cross-media relevance models. In Proceedings of ACM Special Interest Group on Information Retrieval, pages 119--126, 2003.
[13]
V. Lavrenko, R. Manmatha, and J. Jeon. A model for learning the semantics of pictures. In Proceedings of Neural Information Processing Systems, 2003.
[14]
L.-J. Li and L. Fei-Fei. What, where and who? classifying events by scene and object recognition. In Proceedings of International Conference Computer Vision, 2007.
[15]
W. Li and M. Sun. Semi-supervised learning for image annotation based on conditional random fields. In Proceedings of ACM International Conference on Image and Video Retrieval, pages 463--472, 2006.
[16]
Z. Li, J. Liu, X. Zhu, T. Liu, and H. Lu. Image annotation using multi-correlation probabilistic matrix factorization. In Proceedings of ACM international conference on Multimedia, pages 1187--1190, 2010.
[17]
P. Liang, Petrov, M. I. Jordan, and D. Klein. The infinite pcfg using hierarchical dirichlet processes. In Proceedings of Empirical Methods in Natural Language Processing, pages 688--697, 2007.
[18]
B. Liu, W. S. Lee, P. S. Yu, and X. Li. Partially supervised classification of text documents. In Proceedings of the 19th International Conference on Machine Learning, pages 387--394, 2002.
[19]
N. Loeff, A. Farhadi, I. Endres, and D. A. Forsyth. Unlabeled data improves word prediction. In Proceedings of International Conference Computer Vision, 2009.
[20]
D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91--110, 2004.
[21]
D. G. Luenberger and Y. Ye. Linear and Nonlinear Programming. Springer, third edition, 2008.
[22]
J. Sethuraman. A constructive definition of dirichlet priors. Statistica Sinica, 4:639--650, 1994.
[23]
A. Sharma, G. Hua, Z. Liu, and Z. Zhang. Meta-tag propagation by co-training an ensemble classifier for improving image search relevance. In Computer Vision and Pattern Recognition Workshop, pages 1--6, 2008.
[24]
Y.-Y. Sun, Y. Zhang, and Z.-H. Zhou. Multi-label learning with weak label. In Proceedings of Association for the Advancement of Artificial Intelligence, pages 593--598, 2010.
[25]
Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei. Hierarchical dirichlet processes. Journal of the American Statistical Association, 101:1566--1581, 2004.
[26]
O. Yakhnenko and V. Honavar. Annotating images and image objects using a hierarchical dirichlet process model. In Proceedings of the 9th International Workshop on Multimedia Data Mining, pages 1--7, 2008.
[27]
S.-H. Yang, H. Zha, and B.-G. Hu. Dirichlet-bernoulli alignment: A generative model for multi-class multi-label multi-instance corpora. In Proceedings of Neural Information Processing Systems, pages 2143--2150, 2009.
[28]
R. Zhang, Z. Zhang, M. Li, W.-Y. Ma, and H.-J. Zhang. A probabilistic semantic model for image annotation and multi-modal image retrieval. In Proceedings of International Conference Computer Vision, pages 846--851, 2005.
[29]
Z.-H. Zhou and M.-L. Zhang. Multi-instance multilabel learning with application to scene classification. In Proccedings of Neural Information Processing Systems, pages 1609--1616, 2007.
[30]
X. Zhu. Semi-supervised learning literature survey. Technical report, Computer Sciences TR 1530, University of Wisconsin-Madison, 2005.

Cited By

View all
  • (2024)Dual Perspective of Label-Specific Feature Learning for Multi-Label ClassificationACM Transactions on Knowledge Discovery from Data10.1145/370500619:1(1-30)Online publication date: 21-Nov-2024
  • (2022)The Emerging Trends of Multi-Label LearningIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2021.311933444:11(7955-7974)Online publication date: 1-Nov-2022
  • (2022)Semi‐supervised multi‐label learning with missing labels by exploiting feature‐label correlationsStatistical Analysis and Data Mining: The ASA Data Science Journal10.1002/sam.1160716:2(187-209)Online publication date: 31-Dec-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
August 2011
1446 pages
ISBN:9781450308137
DOI:10.1145/2020408
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 August 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. image annotation completion and prediction
  2. partially annotated training set
  3. semi-supervised learning

Qualifiers

  • Poster

Conference

KDD '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Dual Perspective of Label-Specific Feature Learning for Multi-Label ClassificationACM Transactions on Knowledge Discovery from Data10.1145/370500619:1(1-30)Online publication date: 21-Nov-2024
  • (2022)The Emerging Trends of Multi-Label LearningIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2021.311933444:11(7955-7974)Online publication date: 1-Nov-2022
  • (2022)Semi‐supervised multi‐label learning with missing labels by exploiting feature‐label correlationsStatistical Analysis and Data Mining: The ASA Data Science Journal10.1002/sam.1160716:2(187-209)Online publication date: 31-Dec-2022
  • (2021)Semi-Supervised Low-Rank Semantics Grouping for Zero-Shot LearningIEEE Transactions on Image Processing10.1109/TIP.2021.305067730(2207-2219)Online publication date: 1-Jan-2021
  • (2019)Topic-Based Algorithm for Multilabel Learning With Missing LabelsIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2018.287443430:7(2138-2152)Online publication date: Jul-2019
  • (2016)A Framework of the Semi-supervised Multi-label Classification with Non-uniformly Distributed Incomplete LabelsBig Data Analytics and Knowledge Discovery10.1007/978-3-319-43946-4_18(267-280)Online publication date: 6-Aug-2016
  • (2015)Semi-supervised multi-label learning with incomplete labelsProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832747.2832815(4062-4068)Online publication date: 25-Jul-2015
  • (2014)Protein function prediction with incomplete annotationsIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2013.14211:3(579-591)Online publication date: 1-May-2014
  • (2014)Image tag completion via dual-view linear sparse reconstructionsComputer Vision and Image Understanding10.1016/j.cviu.2014.03.012124(42-60)Online publication date: Jul-2014
  • (2012)Multi-view learning from imperfect taggingProceedings of the 20th ACM international conference on Multimedia10.1145/2393347.2393416(479-488)Online publication date: 29-Oct-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media