Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2254556.2254579acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaviConference Proceedingsconference-collections
research-article

CinemaGazer: a system for watching videos at very high speed

Published: 21 May 2012 Publication History

Abstract

This paper presents a technology that enables the watching of videos at very high speed. Subtitles are widely used in DVD movies, and provide useful supplemental information for understanding video contents. We propose a "two-level fast-forwarding" scheme for videos with subtitles, which controls the speed of playback depending on the context: very fast during segments without language, such as subtitles or speech, and "understandably fast" during segments with such language. This makes it possible to watch videos at a higher speed than usual while preserving the entertainment values of the contents. We also propose "centering" and "fading" features for the display of subtitles to reduce fatigue when watching high-speed video. We implement a versatile video encoder that enables movie viewing with two-level fast-forwarding on any mobile device by specifying the speed of playback, the reading rate, or the overall viewing time. The effectiveness of our proposed method was demonstrated in an evaluation study.

References

[1]
Clifton Forlines, "Content aware video presentation on high-resolution displays," In proceedings of AVI'08, pp.57--64, 2008.
[2]
Divakaran, A., and Otsuka, I., "A video-browsing-enhanced personal video recorder," In Proceedings of IEEE International Conference of Image Analysis and Processing Workshops (ICIAPW), pp.137--142, 2007.
[3]
Foulke, W., and Sticht, T. G., "Review of research on the intelligibility and comprehension of accelerated speech," Psychological Bulletin, 72, pp.50--62, 1969.
[4]
Hidenori Aoki, and Homei Miyashita, "A Trial for Video Summarization and Chorus-Section Detecting on Nicovideo," IPSJ SIG Notes 2008-HCI-128/2008-MUS-75, Vol. 2008, No.50, pp.37--42, 2008 (In Japanese).
[5]
Hidenori Aoki, and Homei Miyashita, "Design and Evaluation of a Fast Nonvisual Interface for Searching Music," Trans. IPS Japan, Vol.51, No.2, pp.356--364, 2010 (In Japanese).
[6]
INFOGRAPHIC: What Happens Online Every 60 S. http://www.scribbal.com/2011/06/infographic-what-happens-online-every-60-s/
[7]
Kai-Yin Cheng, Sheng-Jie Luo, Bing-Yu Chen, and Hao-Hua Chu, "SmartPlayer: User-Centric Video Fast-Forwarding," In Proceedings of CHI'09, pp. 789--798, 2009.
[8]
Klaus Schoeffmann, "Facilitating interactive search and navigation in videos," In Proceedings of MM'10, pp. 1609--1612, 2010.
[9]
Klaus Schoeffmann, Mario Taschwer, and Laszlo Boeszoermenyi, "The video explorer: a tool for navigation and searching within a single video based on fast content analysis," In Proceedings of MMSys'10, pp. 247--258, 2010.
[10]
Kurihara, K., Nagano, N., Watanabe, Y., Fujimura, Y., Minaduki, A., Hayashi, H., and Tutiya, Y., "Toward localizing audiences' gaze using a multi-touch electronic whiteboard with sPieMenu," In Proceedings of IUI'11, pp.379--382, 2011.
[11]
Let's start fast reading, http://www.ponp.jp/info/speed.html, (In Japanese).
[12]
Manfred Del Fabro, Klaus Schoeffmann, Laszlo Boszormenyi, "Instant video browsing: a tool for fast non-sequential hierarchical video browsing," In Proceedings of USAB'10, pp. 443--446, 2010.
[13]
Nobumasa Seiyama, Atsushi Imai, Takeshi Mishima, Tohru Takagi, and Eiichi Miyasaka, "Development of a high-quality real-time speech rate conversion system," IEICE Trans. D, Vol. J84-D-II, No.6, pp.918--926, 2001 (In Japanese).
[14]
Peker, K. A., and Divaskaran, A., "An extended framework for adaptive playback-based video summarization," SPIE Internet Multimedia Management Systems IV 5242, pp.26--33, 2003.
[15]
Peker, K. A., Divakaran, A., and Sun, H., "Constant pace skimming and temporal sub-sampling of video using motion activity," In Proceedings of IEEE International Conference on Image Processing (ICIP), Vol.3, pp.414--417, 2001.
[16]
Pierre Dragicevic, Gonzalo Ramos, Jacobo Bibliowitcz, Derek Nowrouzezahrai, Ravin Balakrishnan, and Karan Singh, "Video browsing by direct manipulation," In Proceedings of CHI'08, pp. 237--246, 2008.
[17]
Suporn Pongnumkul, Jue Wang, Gonzalo Ramos, and Michael Cohen, "Content-aware dynamic timeline for video browsing," In Proceedings of UIST'10, pp.139--142, 2010.
[18]
Vazquez Alvarez Yolanda, and Brewster A. Stephen, "Designing spatial audio interfaces to support multiple audio streams," In Proceedings of MobileHCI '10, pp.253--256, 2010.
[19]
Vemuri, S., DeCamp, P., Bender, W., and Schmandt, C., "Improving speech playback using time-compression and speech recognition," In Proceedings of CHI'04, pp.295--302, 2004.
[20]
Victor Valdes, and Jose' M. Martinez, "Introducing risplayer: real-time interactive generation of personalized video summaries," In Proceedings of SAPMIA'10, pp.9--14, 2010.

Cited By

View all
  • (2024)Toward Effective Communication of AI-Based Decisions in Assistive Tools: Conveying Confidence and Doubt to People with Visual Impairments at Accelerated SpeechProceedings of the 21st International Web for All Conference10.1145/3677846.3677862(177-189)Online publication date: 13-May-2024
  • (2024)The Three-Stage Hierarchical Logistic Model Controlling Personalized Playback of Audio Information for Intelligent Tutoring SystemsIEEE Transactions on Learning Technologies10.1109/TLT.2024.343947017(2005-2019)Online publication date: 6-Aug-2024
  • (2022)The Design and Evaluation of Emergency Call Taking User Interfaces for Next Generation 9-1-1Frontiers in Human Dynamics10.3389/fhumd.2022.6706474Online publication date: 16-Feb-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
AVI '12: Proceedings of the International Working Conference on Advanced Visual Interfaces
May 2012
846 pages
ISBN:9781450312875
DOI:10.1145/2254556
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Consulta Umbria SRL
  • University of Salerno: University of Salerno

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 May 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. audience gaze localization
  2. two-level fast-forwarding
  3. video

Qualifiers

  • Research-article

Funding Sources

Conference

AVI'12
Sponsor:
  • University of Salerno

Acceptance Rates

Overall Acceptance Rate 128 of 490 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)3
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Toward Effective Communication of AI-Based Decisions in Assistive Tools: Conveying Confidence and Doubt to People with Visual Impairments at Accelerated SpeechProceedings of the 21st International Web for All Conference10.1145/3677846.3677862(177-189)Online publication date: 13-May-2024
  • (2024)The Three-Stage Hierarchical Logistic Model Controlling Personalized Playback of Audio Information for Intelligent Tutoring SystemsIEEE Transactions on Learning Technologies10.1109/TLT.2024.343947017(2005-2019)Online publication date: 6-Aug-2024
  • (2022)The Design and Evaluation of Emergency Call Taking User Interfaces for Next Generation 9-1-1Frontiers in Human Dynamics10.3389/fhumd.2022.6706474Online publication date: 16-Feb-2022
  • (2021)Playing chunk-transferred DASH segments at low latency with QLiveProceedings of the 12th ACM Multimedia Systems Conference10.1145/3458305.3463376(51-64)Online publication date: 24-Jun-2021
  • (2018)Detecting Utterance Scenes of a Specific PersonCompanion Proceedings of the 23rd International Conference on Intelligent User Interfaces10.1145/3180308.3180323(1-2)Online publication date: 5-Mar-2018
  • (2018)Support System to Review Manufacturing Workshop through Multiple VideosCompanion Proceedings of the 23rd International Conference on Intelligent User Interfaces10.1145/3180308.3180312(1-2)Online publication date: 5-Mar-2018
  • (2017)EgoScanningProceedings of the 2017 CHI Conference on Human Factors in Computing Systems10.1145/3025453.3025821(6536-6546)Online publication date: 2-May-2017
  • (2016)An Automatic Video Reinforcing System for TV Programs using Semantic Metadata from Closed CaptionsInternational Journal of Multimedia Data Engineering & Management10.4018/IJMDEM.20160101017:1(1-21)Online publication date: 1-Jan-2016
  • (2016)Frame-Wise Continuity-Based Video Summarization and StretchingProceedings, Part I, of the 22nd International Conference on MultiMedia Modeling - Volume 951610.1007/978-3-319-27671-7_67(806-817)Online publication date: 4-Jan-2016
  • (2015)Automatically Adjusting the Speed of E-Learning VideosProceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems10.1145/2702613.2732711(1451-1456)Online publication date: 18-Apr-2015
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media