Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1027527.1027572acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

From context to content: leveraging context to infer media metadata

Published: 10 October 2004 Publication History

Abstract

The recent popularity of mobile camera phones allows for new opportunities to gather important metadata at the point of capture. This paper describes a method for generating metadata for photos using spatial, temporal, and social context. We describe a system we implemented for inferring location information for pictures taken with camera phones and its performance evaluation. We propose that leveraging contextual metadata at the point of capture can address the problems of the semantic and sensory gaps. In particular, combining and sharing spatial, temporal, and social contextual metadata from a given user and across users allows us to make inferences about media content.

References

[1]
Aigrain, P., Zhang, H. and Petkovic, D. Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review. Multimedia Tools and Applications, 3, 3 (Nov. 1996), 179--202.
[2]
Davis, M. Media Streams: An Iconic Visual Language for Video Representation. In Readings in Human-Computer Interaction: Toward the Year 2000, eds. Baecker, R., Grudin, J., Buxton, W., and Greenberg, S. 2nd ed. Morgan Kaufmann Publishers, Inc., San Francisco, CA, 1995, 854--866.
[3]
Davis, M. Active Capture: Integrating Human-Computer Interaction and Computer Vision/Audition to Automate Media Capture. In Proc. of 2003 IEEE International Conference on Multimedia and Expo (ICME2003) Special Session on Moving from Features to Semantics Using Computational Media Aesthetics (Baltimore, MD, July 6-9, 2003). IEEE Computer Society Press, New York, NY, 2003, Vol. II, 185--188.
[4]
Davis, M. Active Capture: Automatic Direction for Automatic Movies. In Video Proc. of 11th Annual ACM International Conference on Multimedia (MM2003) (Berkeley, CA, November 2-8, 2003). ACM Press, New York, NY, 2003.
[5]
Davis, M. and Sarvas, R. Mobile Media Metadata for Mobile Imaging. In Proc. of 2004 IEEE International Conference on Multimedia and Expo (ICME2004) Special Session on Mobile Imaging (Taipei, Taiwan, June 27-30, 2004). IEEE Computer Society Press, New York, NY, 2004.
[6]
Davis, M. Mobile Media Metadata: Metadata Creation System for Mobile Images. In Video Proc. of 12th Annual ACM International Conference on Multimedia (MM2004) (New York, NY, October 10-16, 2004). ACM Press, New York, NY, Forthcoming 2004.
[7]
Dey, A. K. Understanding and Using Context. Personal and Ubiquitous Computing Journal, 5, 1 (Feb. 2001), 4--7.
[8]
Dorai, C. and Venkatesh, S. Computational Media Aesthetics: Finding Meaning Beautiful. IEEE MultiMedia, 8, 4 (Oct.-Dec. 2001), 10--12.
[9]
Haase, K. and Tames, D. Babelvision: Better Image Searching Through Shared Annotation. ACM Interactions, 11, 2 (Mar.-Apr. 2004), 18--26.
[10]
Hull, R., Kumar, B., Lieuwen, D., Patel-Schneider, P. F., Sahuguet, A., Varadarajan, S., and Vyas, A. "Enabling Context-Aware and Privacy-Conscious User Data Sharing. In Proc. of 2004 IEEE International Conference on Mobile Data Management (MDM'04) (Berkeley, CA, January 19-22, 2004). IEEE Computer Society Press, New York, NY, 2004, 187--198.
[11]
Lieberman, H., Rosenzweig, E., and Singh, P. Aria: An Agent For Annotating And Retrieving Images. IEEE Computer, 34, 7 (Jul. 2001), 57--62.
[12]
Naaman, M., Paepcke, A., and Garcia-Molina, H. From Where to What: Metadata Sharing for Digital Photographs with Geographic Coordinates. In Proc. of 10th International Conference on Cooperative Information Systems (CoopIS) (Catania, Sicily, November 3-7, 2003). Springer-Verlag, Heidelberg, Germany, 2003, 196--217.
[13]
Sarvas, R., Herrarte, E., Wilhelm, A., and Davis, M. Metadata Creation System for Mobile Images. In Proc. of Second International Conference on Mobile Systems, Applications, and Services (MobiSYS2004) (Boston, MA, June 6-9, 2004). ACM Press, New York, NY, 2004, 36--48.
[14]
Smeulders, A. W. M., Worring, M., Santini, S., Gupta, A., and Jain, R. Content-Based Image Retrieval at the End of the Early Years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 12 (Dec. 2000), 1349--1380.
[15]
Toyama, K., Logan, R., and Roseway, A. Geographic Location Tags on Digital Images. In Proc. of 11th Annual ACM International Conference on Multimedia (MM2003) (Berkeley, CA, November 2-8, 2003). ACM Press, New York, NY, 2003, 156--166.
[16]
Vartiainen, P. Using Metadata and Context Information in Sharing Personal Content of Mobile Users, Master's Thesis, University of Helsinki, Finland, 2003.
[17]
Wilhelm, A., Takhteyev, Y., Sarvas, R., Van House, N., and Davis, M. Photo Annotation on a Camera Phone. In Extended Abstracts of 2004 Conference on Human Factors in Computing Systems (CHI 2004) (Vienna, Austria, April 24-29, 2004). ACM Press, New York, NY, 2004, 1403--1406.

Cited By

View all
  • (2022)Machine Learning Methods for Forest Image Analysis and Classification: A Survey of the State of the ArtIEEE Access10.1109/ACCESS.2022.317004910(45290-45316)Online publication date: 2022
  • (2021)A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9412275(231-238)Online publication date: 10-Jan-2021
  • (2019)Automatic and semi-automatic annotation of people in photography using shared eventsMultimedia Tools and Applications10.1007/s11042-018-6715-978:10(13841-13875)Online publication date: 1-May-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia
October 2004
1028 pages
ISBN:1581138938
DOI:10.1145/1027527
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 October 2004

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content-based image retrieval
  2. context-to-content inference
  3. contextual metadata
  4. location-based services
  5. mobile camera phones
  6. wireless multimedia applications

Qualifiers

  • Article

Conference

MM04

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Machine Learning Methods for Forest Image Analysis and Classification: A Survey of the State of the ArtIEEE Access10.1109/ACCESS.2022.317004910(45290-45316)Online publication date: 2022
  • (2021)A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9412275(231-238)Online publication date: 10-Jan-2021
  • (2019)Automatic and semi-automatic annotation of people in photography using shared eventsMultimedia Tools and Applications10.1007/s11042-018-6715-978:10(13841-13875)Online publication date: 1-May-2019
  • (2018)The Evolution of Contextual Information Processing in InformaticsInformation10.3390/info90300479:3(47)Online publication date: 27-Feb-2018
  • (2018)Photo annotationMultimedia Tools and Applications10.1007/s11042-016-4281-677:1(423-457)Online publication date: 1-Jan-2018
  • (2017)Organizing photographs with geospatial and image semanticsMultimedia Systems10.1007/s00530-014-0426-523:1(53-61)Online publication date: 1-Feb-2017
  • (2016)The 32 Days of ChristmasProceedings of the 2016 CHI Conference on Human Factors in Computing Systems10.1145/2858036.2858255(5710-5714)Online publication date: 7-May-2016
  • (2015)An overview of context types within multimedia and social computingProceedings of the 16th International Conference on Engineering Applications of Neural Networks (INNS)10.1145/2797143.2797184(1-5)Online publication date: 25-Sep-2015
  • (2015)4streamsProceedings of the 2015 British HCI Conference10.1145/2783446.2783589(165-174)Online publication date: 13-Jul-2015
  • (2015)ImageCLEF annotation with explicit context-aware kernel mapsInternational Journal of Multimedia Information Retrieval10.1007/s13735-015-0082-34:2(113-128)Online publication date: 20-Mar-2015
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media