Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Intelligent Access to Digital Video: Informedia Project

Published: 01 May 1996 Publication History

Abstract

Carnegie Mellon's Informedia Digital Video Library project will establish a large, on-line digital video library featuring full-content and knowledge-based search and retrieval. Intelligent, automatic mechanisms will be developed to populate the library. Search and retrieval from digital video, audio, and text libraries will take place via desktop computer over local-, metropolitan-, and wide-area networks. The project's approach applies several techniques for content-based searching and video-sequence retrieval. Content is conveyed in both the narrative (speech and language) and the image. Only by the collaborative interaction of image, speech, and natural-language understanding technology is it possible to successfully populate, segment, index, and search diverse video collections with satisfactory recall and precision. This collaborative interaction approach uniquely compensates for problems of interpretation and search in error-ridden and ambiguous data sets. The authors have focused the work on two corpuses. One is science documentaries and lectures, the other is broadcast news content with partial closed-captions. Further work will continue to improve the accuracy and performance of the underlying processing as well as explore performance issues related to Web-based access and interoperability with other digital video resources.

References

[1]
M. Christel, et al., "Techniques for the Creation and Exploration of Digital Video Libraries," in Multimedia Tools and Applications, Vol. 2, Borko Furht, ed., Kluwer Academic Publishers, Boston, 1995.
[2]
E. Fox, et al., "Introduction," special issue on digital libraries, Comm. ACM, Apr. 1995, pp. 22-28.
[3]
M. Davis, "Knowledge Representation for Video," Proc. AAAI, AAAI Press/MIT Press, Cambridge, Mass., 1994, pp. 128-127.
[4]
M.Y. Hwang E. Thayer and X. Huang, "Semi-Continuous HMMs with Phone Dependent VQ Codebooks for Continuous Speech Recognition," Proc. ICASSP, IEEE Press, Piscataway, N.J., 1994.
[5]
M. Hawley, "Structure out of Sound," doctoral dissertation, MIT, Cambridge, Mass., 1993.
[6]
H. Zhang C. Low and S. Smoliar, "Video Parsing and Indexing of Compressed Data," Multimedia Tools and Applications, Mar. 1995, pp. 89-111.
[7]
B.D. Lucas and T. Kanade, "An Iterative Technique of Image Registration and Its Application to Stereo," Proc. 7th Int'l Joint Conf. Artificial Intelligence, Morgan Kaufmann, Los Altos, Calif., 1981, pp. 674-679.
[8]
H. Rowley S. Baluja and K. Kanade, "Human Face Detection in Visual Scenes," Tech. Report CMU-CS-95-158, Computer Science Dept., Carnegie Mellon Univ., Pittsburgh, 1995.
[9]
M. Smith and T. Kanade, "Video Skimming for Quick Browsing Based on Audio and Image Characterization," Tech. Report CMU-CS-95-186, Carnegie Mellon Univ., Pittsburgh, 1995.
[10]
M. Mauldin, "Information Retrieval by Text Skimming," doctoral dissertation, Carnegie Mellon Univ., Pittsburgh, 1989. (Also available as CMU Tech. Report CMU-CS-89-193.) Revised edition published as "Conceptual Information Retrieval: A Case Study in Adaptive Partial Parsing," Kluwer Academic Publishers, Boston, Sept. 1991.
[11]
S. Stevens, "Intelligent Interactive Video Simulation of a Code Inspection," Comm. ACM, JULY 1989, pp. 832-843.
[12]
M. Christel and S. Stevens, "Rule Base and Digital Video Technologies Applied to Training Simulations," Software Eng. Inst. Tech. Review '92, Software Eng. Inst., Pittsburgh, 1992.

Cited By

View all
  • (2022)An Exploration of Captioning Practices and Challenges of Individual Content Creators on YouTube for People with Hearing ImpairmentsProceedings of the ACM on Human-Computer Interaction10.1145/35129226:CSCW1(1-26)Online publication date: 7-Apr-2022
  • (2021)Congestion-Aware Suspicious Object Detection System Using Information-Centric Networking2021 IEEE 18th Annual Consumer Communications & Networking Conference (CCNC)10.1109/CCNC49032.2021.9369510(1-6)Online publication date: 9-Jan-2021
  • (2018)BlackthornIEEE Transactions on Multimedia10.1109/TMM.2017.275598620:3(687-698)Online publication date: 1-Mar-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computer
Computer  Volume 29, Issue 5
May 1996
99 pages

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 May 1996

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2022)An Exploration of Captioning Practices and Challenges of Individual Content Creators on YouTube for People with Hearing ImpairmentsProceedings of the ACM on Human-Computer Interaction10.1145/35129226:CSCW1(1-26)Online publication date: 7-Apr-2022
  • (2021)Congestion-Aware Suspicious Object Detection System Using Information-Centric Networking2021 IEEE 18th Annual Consumer Communications & Networking Conference (CCNC)10.1109/CCNC49032.2021.9369510(1-6)Online publication date: 9-Jan-2021
  • (2018)BlackthornIEEE Transactions on Multimedia10.1109/TMM.2017.275598620:3(687-698)Online publication date: 1-Mar-2018
  • (2016)Visual Movie AnalyticsIEEE Transactions on Multimedia10.1109/TMM.2016.261418418:11(2149-2160)Online publication date: 1-Nov-2016
  • (2016)Serving a video into an image carouselCluster Computing10.1007/s10586-016-0639-919:4(1843-1851)Online publication date: 1-Dec-2016
  • (2015)Evaluating Alternatives for Better Deaf Accessibility to Selected Web-Based MultimediaProceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility10.1145/2700648.2809857(231-238)Online publication date: 26-Oct-2015
  • (2014)Texture-based medical image retrieval in compressed domain using compressive sensingInternational Journal of Bioinformatics Research and Applications10.1504/IJBRA.2014.05951910:2(129-144)Online publication date: 1-Feb-2014
  • (2013)Annotation of endoscopic videos on mobile devicesProceedings of the 4th ACM Multimedia Systems Conference10.1145/2483977.2483996(141-145)Online publication date: 28-Feb-2013
  • (2013)Narrative theme navigation for sitcoms supported by fan-generated scriptsMultimedia Tools and Applications10.1007/s11042-011-0877-z63:2(387-406)Online publication date: 1-Mar-2013
  • (2012)On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video SoundtracksInternational Journal of Multimedia Data Engineering & Management10.4018/jmdem.20120701013:3(1-19)Online publication date: 1-Jul-2012
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media