Article

From context to content: leveraging context to infer media metadata

Authors:

Risto SarvasAuthors Info & Claims

MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia

Pages 188 - 195

https://doi.org/10.1145/1027527.1027572

Published: 10 October 2004 Publication History

Abstract

The recent popularity of mobile camera phones allows for new opportunities to gather important metadata at the point of capture. This paper describes a method for generating metadata for photos using spatial, temporal, and social context. We describe a system we implemented for inferring location information for pictures taken with camera phones and its performance evaluation. We propose that leveraging contextual metadata at the point of capture can address the problems of the semantic and sensory gaps. In particular, combining and sharing spatial, temporal, and social contextual metadata from a given user and across users allows us to make inferences about media content.

References

[1]

Aigrain, P., Zhang, H. and Petkovic, D. Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review. Multimedia Tools and Applications, 3, 3 (Nov. 1996), 179--202.

[2]

Davis, M. Media Streams: An Iconic Visual Language for Video Representation. In Readings in Human-Computer Interaction: Toward the Year 2000, eds. Baecker, R., Grudin, J., Buxton, W., and Greenberg, S. 2nd ed. Morgan Kaufmann Publishers, Inc., San Francisco, CA, 1995, 854--866.

Digital Library

[3]

Davis, M. Active Capture: Integrating Human-Computer Interaction and Computer Vision/Audition to Automate Media Capture. In Proc. of 2003 IEEE International Conference on Multimedia and Expo (ICME2003) Special Session on Moving from Features to Semantics Using Computational Media Aesthetics (Baltimore, MD, July 6-9, 2003). IEEE Computer Society Press, New York, NY, 2003, Vol. II, 185--188.

Digital Library

[4]

Davis, M. Active Capture: Automatic Direction for Automatic Movies. In Video Proc. of 11th Annual ACM International Conference on Multimedia (MM2003) (Berkeley, CA, November 2-8, 2003). ACM Press, New York, NY, 2003.

Digital Library

[5]

Davis, M. and Sarvas, R. Mobile Media Metadata for Mobile Imaging. In Proc. of 2004 IEEE International Conference on Multimedia and Expo (ICME2004) Special Session on Mobile Imaging (Taipei, Taiwan, June 27-30, 2004). IEEE Computer Society Press, New York, NY, 2004.

[6]

Davis, M. Mobile Media Metadata: Metadata Creation System for Mobile Images. In Video Proc. of 12th Annual ACM International Conference on Multimedia (MM2004) (New York, NY, October 10-16, 2004). ACM Press, New York, NY, Forthcoming 2004.

Digital Library

[7]

Dey, A. K. Understanding and Using Context. Personal and Ubiquitous Computing Journal, 5, 1 (Feb. 2001), 4--7.

Digital Library

[8]

Dorai, C. and Venkatesh, S. Computational Media Aesthetics: Finding Meaning Beautiful. IEEE MultiMedia, 8, 4 (Oct.-Dec. 2001), 10--12.

Digital Library

[9]

Haase, K. and Tames, D. Babelvision: Better Image Searching Through Shared Annotation. ACM Interactions, 11, 2 (Mar.-Apr. 2004), 18--26.

Digital Library

[10]

Hull, R., Kumar, B., Lieuwen, D., Patel-Schneider, P. F., Sahuguet, A., Varadarajan, S., and Vyas, A. "Enabling Context-Aware and Privacy-Conscious User Data Sharing. In Proc. of 2004 IEEE International Conference on Mobile Data Management (MDM'04) (Berkeley, CA, January 19-22, 2004). IEEE Computer Society Press, New York, NY, 2004, 187--198.

[11]

Lieberman, H., Rosenzweig, E., and Singh, P. Aria: An Agent For Annotating And Retrieving Images. IEEE Computer, 34, 7 (Jul. 2001), 57--62.

Digital Library

[12]

Naaman, M., Paepcke, A., and Garcia-Molina, H. From Where to What: Metadata Sharing for Digital Photographs with Geographic Coordinates. In Proc. of 10th International Conference on Cooperative Information Systems (CoopIS) (Catania, Sicily, November 3-7, 2003). Springer-Verlag, Heidelberg, Germany, 2003, 196--217.

[13]

Sarvas, R., Herrarte, E., Wilhelm, A., and Davis, M. Metadata Creation System for Mobile Images. In Proc. of Second International Conference on Mobile Systems, Applications, and Services (MobiSYS2004) (Boston, MA, June 6-9, 2004). ACM Press, New York, NY, 2004, 36--48.

Digital Library

[14]

Smeulders, A. W. M., Worring, M., Santini, S., Gupta, A., and Jain, R. Content-Based Image Retrieval at the End of the Early Years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 12 (Dec. 2000), 1349--1380.

Digital Library

[15]

Toyama, K., Logan, R., and Roseway, A. Geographic Location Tags on Digital Images. In Proc. of 11th Annual ACM International Conference on Multimedia (MM2003) (Berkeley, CA, November 2-8, 2003). ACM Press, New York, NY, 2003, 156--166.

Digital Library

[16]

Vartiainen, P. Using Metadata and Context Information in Sharing Personal Content of Mobile Users, Master's Thesis, University of Helsinki, Finland, 2003.

[17]

Wilhelm, A., Takhteyev, Y., Sarvas, R., Van House, N., and Davis, M. Photo Annotation on a Camera Phone. In Extended Abstracts of 2004 Conference on Human Factors in Computing Systems (CHI 2004) (Vienna, Austria, April 24-29, 2004). ACM Press, New York, NY, 2004, 1403--1406.

Digital Library

Cited By

Kwenda CGwetu MDombeu J(2022)Machine Learning Methods for Forest Image Analysis and Classification: A Survey of the State of the ArtIEEE Access10.1109/ACCESS.2022.317004910(45290-45316)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2022.3170049
Tesan TCoscia PBallan L(2021)A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9412275(231-238)Online publication date: 10-Jan-2021
https://doi.org/10.1109/ICPR48806.2021.9412275
Firmino ASouza Baptista CFigueirêdo HPereira ESousa Pereira Amorim B(2019)Automatic and semi-automatic annotation of people in photography using shared eventsMultimedia Tools and Applications10.1007/s11042-018-6715-978:10(13841-13875)Online publication date: 1-May-2019
https://dl.acm.org/doi/10.1007/s11042-018-6715-9
Show More Cited By

Index Terms

From context to content: leveraging context to infer media metadata

Recommendations

Metadata creation system for mobile images
MobiSys '04: Proceedings of the 2nd international conference on Mobile systems, applications, and services

The amount of personal digital media is increasing, and managing it has become a pressing problem. Effective management of media content is not possible without content-related metadata. In this paper we describe a content metadata creation process for ...
Mobile media metadata: metadata creation system for mobile images
MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia

In the 2003, more camera phones were sold worldwide than digital cameras. With this new platform, we can leverage regularities in the spatio-temporal context and social community of media capture and use to infer media content. We created and deployed a ...
MMM2: mobile media metadata for photo sharing
MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia

Though cameraphones are rapidly becoming the dominant platform for consumer digital photography, users still face difficulties in transferring, managing, and sharing photos captured with cameraphones. The Mobile Media Metadata 2 (MMM2) system removes ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia

October 2004

1028 pages

ISBN:1581138938

DOI:10.1145/1027527

General Chairs:
Henning Schulzrinne
Columbia University
,
Nevenka Dimitrova
Philips Research
,
Program Chairs:
Angela Sasse
UCL
,
Sue Moon
KAIST
,
Rainer Lienhart
U Augsburg

Copyright © 2004 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 October 2004

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

MM04

Sponsor:

MM04: 2004 12th Annual ACM International Conference on Multimedia

October 10 - 16, 2004

NY, New York, USA

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

96
Total Citations
View Citations
1,605
Total Downloads

Downloads (Last 12 months)15
Downloads (Last 6 weeks)1

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Kwenda CGwetu MDombeu J(2022)Machine Learning Methods for Forest Image Analysis and Classification: A Survey of the State of the ArtIEEE Access10.1109/ACCESS.2022.317004910(45290-45316)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2022.3170049
Tesan TCoscia PBallan L(2021)A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9412275(231-238)Online publication date: 10-Jan-2021
https://doi.org/10.1109/ICPR48806.2021.9412275
Firmino ASouza Baptista CFigueirêdo HPereira ESousa Pereira Amorim B(2019)Automatic and semi-automatic annotation of people in photography using shared eventsMultimedia Tools and Applications10.1007/s11042-018-6715-978:10(13841-13875)Online publication date: 1-May-2019
https://dl.acm.org/doi/10.1007/s11042-018-6715-9
Mylonas P(2018)The Evolution of Contextual Information Processing in InformaticsInformation10.3390/info90300479:3(47)Online publication date: 27-Feb-2018
https://doi.org/10.3390/info9030047
Andrade DMaia LFigueirêdo HViana WTrinta FSouza Baptista C(2018)Photo annotationMultimedia Tools and Applications10.1007/s11042-016-4281-677:1(423-457)Online publication date: 1-Jan-2018
https://dl.acm.org/doi/10.1007/s11042-016-4281-6
Zhu ZXu C(2017)Organizing photographs with geospatial and image semanticsMultimedia Systems10.1007/s00530-014-0426-523:1(53-61)Online publication date: 1-Feb-2017
https://dl.acm.org/doi/10.1007/s00530-014-0426-5
Bentley FKaye JShamma DGuerra-Gomez JKaye JDruin ALampe CMorris DHourcade J(2016)The 32 Days of ChristmasProceedings of the 2016 CHI Conference on Human Factors in Computing Systems10.1145/2858036.2858255(5710-5714)Online publication date: 7-May-2016
https://dl.acm.org/doi/10.1145/2858036.2858255
Mylonas P(2015)An overview of context types within multimedia and social computingProceedings of the 16th International Conference on Engineering Applications of Neural Networks (INNS)10.1145/2797143.2797184(1-5)Online publication date: 25-Sep-2015
https://dl.acm.org/doi/10.1145/2797143.2797184
Zargham SĆalić JFrohlich DLawson SDickinson P(2015)4streamsProceedings of the 2015 British HCI Conference10.1145/2783446.2783589(165-174)Online publication date: 13-Jul-2015
https://dl.acm.org/doi/10.1145/2783446.2783589
Sahbi H(2015)ImageCLEF annotation with explicit context-aware kernel mapsInternational Journal of Multimedia Information Retrieval10.1007/s13735-015-0082-34:2(113-128)Online publication date: 20-Mar-2015
https://doi.org/10.1007/s13735-015-0082-3
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten