research-article

CinemaGazer: a system for watching videos at very high speed

Author:

Kazutaka KuriharaAuthors Info & Claims

AVI '12: Proceedings of the International Working Conference on Advanced Visual Interfaces

Pages 108 - 115

https://doi.org/10.1145/2254556.2254579

Published: 21 May 2012 Publication History

Abstract

This paper presents a technology that enables the watching of videos at very high speed. Subtitles are widely used in DVD movies, and provide useful supplemental information for understanding video contents. We propose a "two-level fast-forwarding" scheme for videos with subtitles, which controls the speed of playback depending on the context: very fast during segments without language, such as subtitles or speech, and "understandably fast" during segments with such language. This makes it possible to watch videos at a higher speed than usual while preserving the entertainment values of the contents. We also propose "centering" and "fading" features for the display of subtitles to reduce fatigue when watching high-speed video. We implement a versatile video encoder that enables movie viewing with two-level fast-forwarding on any mobile device by specifying the speed of playback, the reading rate, or the overall viewing time. The effectiveness of our proposed method was demonstrated in an evaluation study.

References

[1]

Clifton Forlines, "Content aware video presentation on high-resolution displays," In proceedings of AVI'08, pp.57--64, 2008.

Digital Library

[2]

Divakaran, A., and Otsuka, I., "A video-browsing-enhanced personal video recorder," In Proceedings of IEEE International Conference of Image Analysis and Processing Workshops (ICIAPW), pp.137--142, 2007.

Digital Library

[3]

Foulke, W., and Sticht, T. G., "Review of research on the intelligibility and comprehension of accelerated speech," Psychological Bulletin, 72, pp.50--62, 1969.

[4]

Hidenori Aoki, and Homei Miyashita, "A Trial for Video Summarization and Chorus-Section Detecting on Nicovideo," IPSJ SIG Notes 2008-HCI-128/2008-MUS-75, Vol. 2008, No.50, pp.37--42, 2008 (In Japanese).

[5]

Hidenori Aoki, and Homei Miyashita, "Design and Evaluation of a Fast Nonvisual Interface for Searching Music," Trans. IPS Japan, Vol.51, No.2, pp.356--364, 2010 (In Japanese).

[6]

INFOGRAPHIC: What Happens Online Every 60 S. http://www.scribbal.com/2011/06/infographic-what-happens-online-every-60-s/

[7]

Kai-Yin Cheng, Sheng-Jie Luo, Bing-Yu Chen, and Hao-Hua Chu, "SmartPlayer: User-Centric Video Fast-Forwarding," In Proceedings of CHI'09, pp. 789--798, 2009.

Digital Library

[8]

Klaus Schoeffmann, "Facilitating interactive search and navigation in videos," In Proceedings of MM'10, pp. 1609--1612, 2010.

Digital Library

[9]

Klaus Schoeffmann, Mario Taschwer, and Laszlo Boeszoermenyi, "The video explorer: a tool for navigation and searching within a single video based on fast content analysis," In Proceedings of MMSys'10, pp. 247--258, 2010.

Digital Library

[10]

Kurihara, K., Nagano, N., Watanabe, Y., Fujimura, Y., Minaduki, A., Hayashi, H., and Tutiya, Y., "Toward localizing audiences' gaze using a multi-touch electronic whiteboard with sPieMenu," In Proceedings of IUI'11, pp.379--382, 2011.

Digital Library

[11]

Let's start fast reading, http://www.ponp.jp/info/speed.html, (In Japanese).

[12]

Manfred Del Fabro, Klaus Schoeffmann, Laszlo Boszormenyi, "Instant video browsing: a tool for fast non-sequential hierarchical video browsing," In Proceedings of USAB'10, pp. 443--446, 2010.

Digital Library

[13]

Nobumasa Seiyama, Atsushi Imai, Takeshi Mishima, Tohru Takagi, and Eiichi Miyasaka, "Development of a high-quality real-time speech rate conversion system," IEICE Trans. D, Vol. J84-D-II, No.6, pp.918--926, 2001 (In Japanese).

[14]

Peker, K. A., and Divaskaran, A., "An extended framework for adaptive playback-based video summarization," SPIE Internet Multimedia Management Systems IV 5242, pp.26--33, 2003.

[15]

Peker, K. A., Divakaran, A., and Sun, H., "Constant pace skimming and temporal sub-sampling of video using motion activity," In Proceedings of IEEE International Conference on Image Processing (ICIP), Vol.3, pp.414--417, 2001.

[16]

Pierre Dragicevic, Gonzalo Ramos, Jacobo Bibliowitcz, Derek Nowrouzezahrai, Ravin Balakrishnan, and Karan Singh, "Video browsing by direct manipulation," In Proceedings of CHI'08, pp. 237--246, 2008.

Digital Library

[17]

Suporn Pongnumkul, Jue Wang, Gonzalo Ramos, and Michael Cohen, "Content-aware dynamic timeline for video browsing," In Proceedings of UIST'10, pp.139--142, 2010.

Digital Library

[18]

Vazquez Alvarez Yolanda, and Brewster A. Stephen, "Designing spatial audio interfaces to support multiple audio streams," In Proceedings of MobileHCI '10, pp.253--256, 2010.

Digital Library

[19]

Vemuri, S., DeCamp, P., Bender, W., and Schmandt, C., "Improving speech playback using time-compression and speech recognition," In Proceedings of CHI'04, pp.295--302, 2004.

Digital Library

[20]

Victor Valdes, and Jose' M. Martinez, "Introducing risplayer: real-time interactive generation of personalized video summaries," In Proceedings of SAPMIA'10, pp.9--14, 2010.

Digital Library

Cited By

Akter TSwaminathan MKapadia A(2024)Toward Effective Communication of AI-Based Decisions in Assistive Tools: Conveying Confidence and Doubt to People with Visual Impairments at Accelerated SpeechProceedings of the 21st International Web for All Conference10.1145/3677846.3677862(177-189)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3677846.3677862
Varnavsky A(2024)The Three-Stage Hierarchical Logistic Model Controlling Personalized Playback of Audio Information for Intelligent Tutoring SystemsIEEE Transactions on Learning Technologies10.1109/TLT.2024.343947017(2005-2019)Online publication date: 6-Aug-2024
https://dl.acm.org/doi/10.1109/TLT.2024.3439470
Dash PNeustaedter CJones BYip C(2022)The Design and Evaluation of Emergency Call Taking User Interfaces for Next Generation 9-1-1Frontiers in Human Dynamics10.3389/fhumd.2022.6706474Online publication date: 16-Feb-2022
https://doi.org/10.3389/fhumd.2022.670647
Show More Cited By

Index Terms

CinemaGazer: a system for watching videos at very high speed
1. Human-centered computing
  1. Human computer interaction (HCI)
2. Information systems
  1. Information systems applications
    1. Multimedia information systems

Recommendations

Interactive video stories from user generated content: a school concert use case
ICIDS'12: Proceedings of the 5th international conference on Interactive Storytelling

This paper describes a web-based narrative system able to generate video compilations, framed as event stories, from a shared repository of video recordings of the event itself and possibly of related events. For this, it employs narrative techniques ...
Automatic generation of video narratives from shared UGC
HT '11: Proceedings of the 22nd ACM conference on Hypertext and hypermedia

This paper introduces an evaluated approach to the automatic generation of video narratives from user generated content gathered in a shared repository. In the context of social events, end-users record video material with their personal cameras and ...
Development of a Questionnaire to Measure Immersion in Video Media: The Film IEQ
TVX '19: Proceedings of the 2019 ACM International Conference on Interactive Experiences for TV and Online Video

Researchers and practitioners are keen to understand how new video viewing practices driven by technological developments impact viewers’ experiences. We detail the development of the Immersive Experience Questionnaire for Film and TV (Film IEQ). An ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

AVI '12: Proceedings of the International Working Conference on Advanced Visual Interfaces

May 2012

846 pages

ISBN:9781450312875

DOI:10.1145/2254556

Editors:
Genny Tortora
Università di Salerno, Italy
,
Stefano Levialdi
Sapienza Università di Roma, Italy
,
Maurizio Tucci
Università di Salerno, Italy

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Consulta Umbria SRL
University of Salerno: University of Salerno

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 May 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Japan Society for the Promotion of Science

Conference

AVI'12

Sponsor:

University of Salerno

AVI'12: International Working Conference on Advanced Visual Interfaces

May 21 - 25, 2012

Capri Island, Italy

Acceptance Rates

Overall Acceptance Rate 128 of 490 submissions, 26%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

13
Total Citations
View Citations
301
Total Downloads

Downloads (Last 12 months)18
Downloads (Last 6 weeks)3

Reflects downloads up to 18 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Akter TSwaminathan MKapadia A(2024)Toward Effective Communication of AI-Based Decisions in Assistive Tools: Conveying Confidence and Doubt to People with Visual Impairments at Accelerated SpeechProceedings of the 21st International Web for All Conference10.1145/3677846.3677862(177-189)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3677846.3677862
Varnavsky A(2024)The Three-Stage Hierarchical Logistic Model Controlling Personalized Playback of Audio Information for Intelligent Tutoring SystemsIEEE Transactions on Learning Technologies10.1109/TLT.2024.343947017(2005-2019)Online publication date: 6-Aug-2024
https://dl.acm.org/doi/10.1109/TLT.2024.3439470
Dash PNeustaedter CJones BYip C(2022)The Design and Evaluation of Emergency Call Taking User Interfaces for Next Generation 9-1-1Frontiers in Human Dynamics10.3389/fhumd.2022.6706474Online publication date: 16-Feb-2022
https://doi.org/10.3389/fhumd.2022.670647
Yadav PBentaleb ALim MHuang JOoi WZimmermann RAlay ÖHsu CBegen A(2021)Playing chunk-transferred DASH segments at low latency with QLiveProceedings of the 12th ACM Multimedia Systems Conference10.1145/3458305.3463376(51-64)Online publication date: 24-Jun-2021
https://dl.acm.org/doi/10.1145/3458305.3463376
Sato KRekimoto J(2018)Detecting Utterance Scenes of a Specific PersonCompanion Proceedings of the 23rd International Conference on Intelligent User Interfaces10.1145/3180308.3180323(1-2)Online publication date: 5-Mar-2018
https://dl.acm.org/doi/10.1145/3180308.3180323
Nakae KTsukada K(2018)Support System to Review Manufacturing Workshop through Multiple VideosCompanion Proceedings of the 23rd International Conference on Intelligent User Interfaces10.1145/3180308.3180312(1-2)Online publication date: 5-Mar-2018
https://dl.acm.org/doi/10.1145/3180308.3180312
Higuchi KYonetani RSato YMark GFussell SLampe Cschraefel mHourcade JAppert CWigdor D(2017)EgoScanningProceedings of the 2017 CHI Conference on Human Factors in Computing Systems10.1145/3025453.3025821(6536-6546)Online publication date: 2-May-2017
https://dl.acm.org/doi/10.1145/3025453.3025821
Wang YKitayama DKawai YSumiya KIshikawa Y(2016)An Automatic Video Reinforcing System for TV Programs using Semantic Metadata from Closed CaptionsInternational Journal of Multimedia Data Engineering & Management10.4018/IJMDEM.20160101017:1(1-21)Online publication date: 1-Jan-2016
https://dl.acm.org/doi/10.4018/IJMDEM.2016010101
Hirai TMorishima S(2016)Frame-Wise Continuity-Based Video Summarization and StretchingProceedings, Part I, of the 22nd International Conference on MultiMedia Modeling - Volume 951610.1007/978-3-319-27671-7_67(806-817)Online publication date: 4-Jan-2016
https://dl.acm.org/doi/10.1007/978-3-319-27671-7_67
Song SHong JOakley ICho JBianchi ABegole BKim JInkpen KWoo W(2015)Automatically Adjusting the Speed of E-Learning VideosProceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems10.1145/2702613.2732711(1451-1456)Online publication date: 18-Apr-2015
https://dl.acm.org/doi/10.1145/2702613.2732711
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents