Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

ImageSense: Towards contextual image advertising

Published: 03 February 2012 Publication History

Abstract

The daunting volumes of community-contributed media contents on the Internet have become one of the primary sources for online advertising. However, conventional advertising treats image and video advertising as general text advertising by displaying relevant ads based on the contents of the Web page, without considering the inherent characteristics of visual contents. This article presents a contextual advertising system driven by images, which automatically associates relevant ads with an image rather than the entire text in a Web page and seamlessly inserts the ads in the nonintrusive areas within each individual image. The proposed system, called ImageSense, supports scalable advertising of, from root to node, Web sites, pages, and images. In ImageSense, the ads are selected based on not only textual relevance but also visual similarity, so that the ads yield contextual relevance to both the text in the Web page and the image content. The ad insertion positions are detected based on image salience, as well as face and text detection, to minimize intrusiveness to the user. We evaluate ImageSense on a large-scale real-world images and Web pages, and demonstrate the effectiveness of ImageSense for online image advertising.

References

[1]
Adams, B., Phung, D. Q., and Venkatesh, S. 2006. Extraction of social context and application to personal multimedia exploration. In Proceedings of ACM Multimedia.
[2]
AdSense. http://www.google.com/adsense/.
[3]
AdWords. http://adwords.google.com/.
[4]
Ahlers, D. and Boll, S. 2008. ‘Oh web image, where art thou?’. In Proceedings of the International Conference on Multi-Media Modeling.
[5]
Aroyo, L., Nack, F., Schiphorst, T., Schut, H., and KauwATjoe, M. 2007. Personalized ambient media experience: move.me case study. In Proceedings of the Conference on Intelligent User Interfaces. 298--301.
[6]
Baeza-Yates, R. and Ribeiro-Neto, B. 1999. Modern Information Retrieval. Addison Wesley.
[7]
Blei, D. M., Ng, A. Y., and Jordan, M. I. 2003. Latent dirichlet allocation. J. Mach. Learn. Resear. 3, 993--1022.
[8]
Boyd, S. and Vandenberghe, L. 2004. Convex Optimization. Cambridge University Press.
[9]
BritePic. http://www.britepic.com/.
[10]
Broder, A., Fontoura, M., Josifovski, V., and Riedel, L. 2007. A semantic approach to contextual advertising. In Proceedings of Annual ACM SIGIR Conference on Research and Development in Information Retrieval.
[11]
Cai, D., Yu, S., Wen, J.-R., and Ma, W.-Y. 2003. VIPS: a vision-based page segmentation algorithm. Tech. rep. MSR-TR-2003-79, Microsoft.
[12]
Chatterjee, P., Hoffman, D. L., and Novak, T. P. 2003. Modeling the clickstream: Implications for web-based advertising efforts. Market. Science 22, 4, 520--541.
[13]
Chen, X. and Zhang, H.-J. 2001. Text area detection from video frames. In Proceedings of the IEEE Pacific Rim Conference on Multimedia. 222--228.
[14]
CNN. http://www.cnn.com/.
[15]
Dai, H., Zhao, L., Nie, Z., Wen, J.-R., Wang, L., and Li, Y. 2006. Detecting online commercial intention. In Proceedings of International World Wide Web Conference.
[16]
Flickr. http://www.flickr.com/.
[17]
Gerrig, R. J. and Zimbardo, P. G. 2001. Psychology and Life (16 Edition). Allyn & Bacon.
[18]
He, X., Cai, D., Wen, J.-R., Ma, W.-Y., and Zhang, H.-J. 2007. Clustering and searching www images using link and page layout analysis. ACM Transactions on Multimedia Computing, Communications, and Applications 3, 2.
[19]
Hu, J., Zeng, H.-J., Li, H., Niu, C., and Chen, Z. 2007. Demographic prediction based on user's browsing behavior. In Proceedings of International World Wide Web Conference.
[20]
Itti, L. and Koch, C. 2001. Computational modeling of visual attention. Nature Rev. Neurosci. 2, 3, 194--203.
[21]
Jonas, P. 1976. Photographic Composition Simplified. Amphoto Publishers.
[22]
Joshi, A. and Motwani, R. 2006. Keyword generation for search engine advertising. In Proceedings of the Workshops of the IEEE International Conference on Data Mining.
[23]
Kastidou, G. and Cohen, R. 2006. An approach for delivering personalized ads in interactive TV customized to both users and advertisers. In Proceedings of the European Conference on Interactive Television.
[24]
Kennedy, L., Naaman, M., Ahern, S., Nair, R., and Rattenbury, T. 2007. How Flickr helps us make sense of the world: Context and content in community-contributed media collections. In Proceedings of ACM Multimedia.
[25]
Kullback, S. and Leibler, R. A. 1951. On information and sufficiency. Ann. Math. Stat. 22, 1, 79--86.
[26]
Lacerda, A., Cristo, M., Goncalves, M. A., et al. 2006. Learning to advertise. In Proceedings of Annual ACM SIGIR Conference on Research and Development in Information Retrieval.
[27]
Li, H., Zhang, D., Hu, J., Zeng, H.-J., and Chen, Z. 2007. Finding keyword from online broadcasting content for targeted advertising. In Proceedings of the International Workshop on Data Mining and Audience Intelligence for Advertising.
[28]
Li, S. Z., Zhu, L., Zhang, Z., Blake, A., Zhang, H.-J., and Shum, H. 2002. Statistical learning of multi-view face detection. In Proceedings of European Conference on Computer Vision. 67--81.
[29]
Li, Z., Zhang, L., and Ma, W.-Y. 2008. Delivering online advertisements inside images. In Proceedings of ACM Multimedia. 1051--1060.
[30]
Liao, W.-S., Chen, K.-T., and Hsu, W. H. 2008. Adimage: video advertising by image matching and ad scheduling optimization. In Proceedings of Annual ACM SIGIR Conference on Research and Development in Information Retrieval. 767--768.
[31]
Live Image Search. http://www.live.com/?&scope=images.
[32]
Lowe, D. G. 2004. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 2, 91--110.
[33]
Ma, Y.-F. and Zhang, H.-J. 2003. Contrast-based image attention analysis by using fuzzy growing. In Proceedings of ACM Multimedia. 374--381.
[34]
McCoy, S., Everard, A., Polak, P., and Galletta, D. F. 2007. The effects of online advertising. Comm. ACM 50, 3, 84--88.
[35]
Mehta, A., Saberi, A., Vazirani, U., and Vazirani, V. 2007. Adwords and generalized on-line matching. J. ACM 54, 5.
[36]
Mei, T., Hua, X.-S., Lai, W., Yang, L., et al. 2007. MSRA-USTC-SJTU at TRECVID 2007: High-level feature extraction and search. In TREC Video Retrieval Evaluation Online Proceedings.
[37]
Mei, T., Hua, X.-S., and Li, S. 2008. Contextual in-image advertising. In Proceedings of ACM Multimedia. 439--448.
[38]
Mei, T., Hua, X.-S., and Li, S. 2009. VideoSense: A contextual in-video advertising system. IEEE Trans. Circ. Syst. Video Techn. 19, 12, 1866--1879.
[39]
Mei, T., Hua, X.-S., Yang, L., and Li, S. 2007. VideoSense: Towards effective online video advertising. In Proceedings of ACM Multimedia. 1075--1084.
[40]
MSN. http://www.msn.com/.
[41]
Murdock, V., Ciaramita, M., and Plachouras, V. 2007. A noisy channel approach to contextual advertising. In Proceedings of the International Workshop on Data Mining and Audience Intelligence for Advertising.
[42]
MySpace. http://www.myspace.com/.
[43]
New York Times. http://www.nytimes.com/.
[44]
photoSIG. http://www.photosig.com/.
[45]
Ribeiro-Neto, B., Cristo, M., Golgher, P. B., and Moura, E. S. 2005. Impedance coupling in content-targeted advertising. In Proceedings of Annual ACM SIGIR Conference on Research and Development in Information Retrieval.
[46]
Richardson, M., Dominowska, E., and Ragno, R. 2007. Predicting clicks: Estimating click-through rate for new ads. In Proceedings of International World Wide Web Conference.
[47]
Shen, D., Sun, J.-T., Yang, Q., and Chen, Z. 2006. Building bridges for web query classification. In Proceedings of Annual ACM SIGIR Conference on Research and Development in Information Retrieval.
[48]
Srinivasan, S. H., Sawant, N., and Wadhwa, S. 2007. vADeo: Video Advertising System. In Proceedings of ACM Multimedia. 455--456.
[49]
Webshot. http://www.webshot.com/.
[50]
Whitley, D. 1994. A genetic algorithm tutorial. Statistics and Computing 4, 65--85.
[51]
Yahoo! http://images.search.yahoo.com/images.
[52]
Yih, W.-T., Goodman, J., and Carvalho, V. R. 2006. Finding advertising keywords on web pages. In Proceedings of International World Wide Web Conference.
[53]
YouTube. http://www.youtube.com/.

Cited By

View all
  • (2023)Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/352461719:1s(1-24)Online publication date: 3-Feb-2023
  • (2023)Learning an insertion region for advertisement embedding on planesSignal Processing: Image Communication10.1016/j.image.2023.116963116(116963)Online publication date: Aug-2023
  • (2021)Interpreting the Rhetoric of Visual AdvertisementsIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2019.294744043:4(1308-1323)Online publication date: 1-Apr-2021
  • Show More Cited By

Index Terms

  1. ImageSense: Towards contextual image advertising

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 8, Issue 1
      January 2012
      149 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/2071396
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 03 February 2012
      Accepted: 01 August 2010
      Revised: 01 July 2010
      Received: 01 March 2010
      Published in TOMM Volume 8, Issue 1

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)28
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 21 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/352461719:1s(1-24)Online publication date: 3-Feb-2023
      • (2023)Learning an insertion region for advertisement embedding on planesSignal Processing: Image Communication10.1016/j.image.2023.116963116(116963)Online publication date: Aug-2023
      • (2021)Interpreting the Rhetoric of Visual AdvertisementsIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2019.294744043:4(1308-1323)Online publication date: 1-Apr-2021
      • (2021)Predicting Content Similarity via Multimodal Modeling for Video-In-Video AdvertisingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2020.297992831:2(569-581)Online publication date: Feb-2021
      • (2021)COVID-19 ads on purchase intention of online consumer behavior as business innovation activityElectronic Commerce Research and Applications10.1016/j.elerap.2021.10108649:COnline publication date: 1-Sep-2021
      • (2020)Metadata Matters in User Engagement PredictionProceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3397271.3401201(1529-1532)Online publication date: 25-Jul-2020
      • (2019)LiveSenseProceedings of the 27th ACM International Conference on Multimedia10.1145/3343031.3350888(392-400)Online publication date: 15-Oct-2019
      • (2019)BTDPACM Transactions on Multimedia Computing, Communications, and Applications10.1145/328246915:2s(1-21)Online publication date: 3-Jul-2019
      • (2019)Combination of 1D CNN and 2D CNN to Evaluate the Attractiveness of Display Image Advertisement and CTR PredictionSoftware Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing10.1007/978-3-030-26428-4_11(157-170)Online publication date: 23-Aug-2019
      • (2018)PageSenseIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2016.259870228:1(254-266)Online publication date: 1-Jan-2018
      • Show More Cited By

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media