research-article

Mining text snippets for images on the web

Authors:

Krishnan Ramnath,

Lucy Vanderwende,

Matt Uyttendaele,

Lei ZhangAuthors Info & Claims

KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 1534 - 1543

https://doi.org/10.1145/2623330.2623346

Published: 24 August 2014 Publication History

Abstract

Images are often used to convey many different concepts or illustrate many different stories. We propose an algorithm to mine multiple diverse, relevant, and interesting text snippets for images on the web. Our algorithm scales to all images on the web. For each image, all webpages that contain it are considered. The top-K text snippet selection problem is posed as combinatorial subset selection with the goal of choosing an optimal set of snippets that maximizes a combination of relevancy, interestingness, and diversity. The relevancy and interestingness are scored by machine learned models. Our algorithm is run at scale on the entire image index of a major search engine resulting in the construction of a database of images with their corresponding text snippets. We validate the quality of the database through a large-scale comparative study. We showcase the utility of the database through two web-scale applications: (a) augmentation of images on the web as webpages are browsed and (b)~an image browsing experience (similar in spirit to web browsing) that is enabled by interconnecting semantically related images (which may not be visually related) through shared concepts in their corresponding text snippets.

Supplementary Material

MP4 File (p1534-sidebyside.mp4)

Download
298.14 MB

References

[1]

R. Agrawal, M. Christoforaki, S. Gollapudi, A. Kannan, K. Kenthapadi, and A. Swaminathan. Mining videos from the web for electronic textbooks. International Conference on Formal Concept Analysis, 2014.

[2]

R. Angheluta, R. De Busser, and M.-F. Moens. The use of topic segmentation for automatic summarization. In Proceedings of the ACL-2002 Workshop on Automatic Summarization, 2002.

[3]

A. L. Berger and V. O. Mittal. Ocelot: a system for summarizing web pages. In Proceedings of ACM SIGIR, pages 144--151. ACM, 2000.

Digital Library

[4]

O. Buyukkokten, H. Garcia-Molina, and A. Paepcke. Seeing the whole in parts: text summarization for web browsing on handheld devices. In Proceedings of the 10th international conference on World Wide Web, pages 652--662. ACM, 2001.

Digital Library

[5]

W. T. Chuang and J. Yang. Extracting sentence segments for text summarization: a machine learning approach. In Proceedings of ACM SIGIR, pages 152--159. ACM, 2000.

Digital Library

[6]

J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In Sixth Symposium on Operating System Design and Implementation, pages 137--149, 2004.

Digital Library

[7]

A. Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences from images. In ECCV 2010. Springer, 2010.

Digital Library

[8]

E. Gabrilovich and S. Markovitch. Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In IJCAI, 2007.

Digital Library

[9]

J. Goldstein, V. Mittal, J. Carbonell, and M. Kantrowitz. Multi-document summarization by sentence extraction. In Proceedings of the 2000 NAACL-ANLP Workshop on Automatic summarization, 2000.

Digital Library

[10]

Y. Jing, H. A. Rowley, C. Rosenberg, J. Wang, M. Zhao, and M. Covell. Google image swirl, a large-scale content-based image browsing system. In Multimedia and Expo (ICME), IEEE International Conference on, pages 267--267. IEEE, 2010.

[11]

C.-W. Ko, J. Lee, and M. Queyranne. An exact algorithm for maximum entropy sampling. Operations Research, 43(4):684--691, 1995.

Digital Library

[12]

G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Baby talk: Understanding and generating simple image descriptions. In Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, pages 1601--1608. IEEE, 2011.

Digital Library

[13]

P. Kuznetsova, V. Ordonez, A. C. Berg, T. L. Berg, and Y. Choi. Collective generation of natural image descriptions. In Proceedings of the Association for Computational Linguistics, pages 359--368, 2012.

Digital Library

[14]

D. C. Liu and J. Nocedal. On the limited memory method for large scale optimization. Mathematical Programming, 45(3):503--528, 1989.

Digital Library

[15]

I. Mani and M. T. Maybury. Advances in automatic text summarization. the MIT Press, 1999.

Digital Library

[16]

R. Mason and E. Charniak. Annotation of online shopping images without labeled training examples. NAACL HLT 2013, page 1, 2013.

[17]

O. Medelyan, D. Milne, C. Legg, and I. Witten. Mining meaning from Wikipedia. International Journal of Human-Computer Studies, 67(9), 2009.

Digital Library

[18]

Microsoft. Internet Explorer. http://windows.microsoft.com/en-us/internet-explorer/go-explore-ie.

[19]

Microsoft. Windows Azure Cloud Services. http://www.windowsazure.com.

[20]

Y. Mori, H. Takahashi, and R. Oka. Image-to-word transformation based on dividing and vector quantizing images with words. In First International Workshop on Multimedia Intelligent Storage and Retrieval Management, 1999.

[21]

G. L. Nemhauser, L. A. Wolsey, and M. L. Fisher. An analysis of approximations for maximizing submodular set functions. Mathematical Programming, 14(1):265--294, 1978.

Digital Library

[22]

V. Ordonez, G. Kulkarni, and T. L. Berg. Im2text: Describing images using 1 million captioned photographs. In Advances in Neural Information Processing Systems, pages 1143--1151, 2011.

Digital Library

[23]

K. Spärck Jones. Automatic summarising: The state of the art. Information Processing & Management, 43(6):1449--1481, 2007.

Digital Library

[24]

G. Strong, E. Hoque, M. Gong, and O. Hoeber. Organizing and browsing image search results based on conceptual and visual similarities. In Advances in Visual Computing, pages 481--490. Springer, 2010.

Digital Library

[25]

M. Strube and S. Ponzetto. WikiRelate! Computing semantic relatedness using Wikipedia. In AAAI, 2006.

Digital Library

[26]

X.-J. Wang, L. Zhang, and C. Liu. Duplicate discovery on 2 billion internet images. In Proceedings of the Big Data Workshop, IEEE CVPR, pages 429--346. IEEE, 2013.

Digital Library

[27]

S. Winder and M. Brown. Learning local image descriptors. In IEEE Computer Vision and Pattern Recognition, pages 1--8, 2007.

[28]

B. Z. Yao, X. Yang, L. Lin, M. W. Lee, and S.-C. Zhu. I2t: Image parsing to text description. Proceedings of the IEEE, 98(8):1485--1508, 2010.

Cited By

Bracamonte TBustos BPoblete BSchreck T(2018)Extracting semantic knowledge from web context for multimedia IRMultimedia Tools and Applications10.1007/s11042-017-4997-y77:11(13853-13889)Online publication date: 1-Jun-2018
https://dl.acm.org/doi/10.1007/s11042-017-4997-y

Index Terms

Mining text snippets for images on the web
1. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia databases

Recommendations

Optimizing social image search with multiple criteria: Relevance, diversity, and typicality

The explosive growth and wide-spread accessibility of community-contributed multimedia contents on the Internet have led to a surging research activity in social image search. However, the existing tag-based search methods frequently return irrelevant ...
Improving Web search using image snippets

The Web has become the largest information repository in the world; thus, effectively and efficiently searching the Web becomes a key challenge. Interactive Web search divides the search process into several rounds, and for each round the search engine ...
Composite retrieval of heterogeneous web search
WWW '14: Proceedings of the 23rd international conference on World wide web

Traditional search systems generally present a ranked list of documents as answers to user queries. In aggregated search systems, results from different and increasingly diverse verticals (image, video, news, etc.) are returned to users. For instance, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

August 2014

2028 pages

ISBN:9781450329569

DOI:10.1145/2623330

General Chairs:
Sofus Macskassy
Facebook
,
Claudia Perlich
Dstillery
,
Program Chairs:
Jure Leskovec
Stanford University
,
Wei Wang
UCLA
,
Rayid Ghani
University of Chicago

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD '14

Sponsor:

KDD '14: The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 24 - 27, 2014

New York, New York, USA

Acceptance Rates

KDD '14 Paper Acceptance Rate 151 of 1,036 submissions, 15%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
489
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 21 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Bracamonte TBustos BPoblete BSchreck T(2018)Extracting semantic knowledge from web context for multimedia IRMultimedia Tools and Applications10.1007/s11042-017-4997-y77:11(13853-13889)Online publication date: 1-Jun-2018
https://dl.acm.org/doi/10.1007/s11042-017-4997-y

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents