Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2713168.2713178acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article

Video composition by the crowd: a system to compose user-generated videos in near real-time

Published: 18 March 2015 Publication History

Abstract

To compose high-quality movies directors need life-long learning and talent. User-generated video defines a new era of video production in which non-professionals record videos and share them on platforms such as YouTube. As hiring professional directors results in high costs, our work focuses on replacing those directors by crowdsourcing. The proposed system allows users to record and stream live videos to servers on which workers create a video mashup. A smartphone application for recording live video has been designed that supports the composition in the crowd by a multi-modal analysis of the recording quality. The contributions of this work are: The proposed system demonstrates that composing a large number of video views can be achieved in near real-time. Second, the system achieves comparable video quality for user-generated video in comparison to manual composition. Third, it offers insights on how to design real-time capable crowdsourcing systems. Fourth, by leveraging multi-modal features that can already be evaluated during recording the number of streams considered for presentation can be reduced.

References

[1]
S. A. Ay, R. Zimmermann, and S. H. Kim. Viewable scene modeling for geospatial video search. In Proceeding of the 16th ACM international conference on Multimedia - MM '08, pages 309--318, 2008.
[2]
M. S. Bernstein, J. Brandt, R. C. Miller, and D. R. Karger. Crowds in two seconds: enabling realtime crowd-powered interfaces. In ACM Symposium on User Interface Software and Technology, pages 33--42, 2011.
[3]
F. Cricri, K. Dabov, I. D. D. Curcio, S. Mate, and M. Gabbouj. Multimodal extraction of events and of information about the recording activity in user generated videos. Multimedia Tools and Applications, 70(1):119--158, 2012.
[4]
F. Cricri, M. Roininen, J. Leppaenen, S. Mate, I. Curcio, S. Uhlmann, and M. Gabbouj. Sport type classification of mobile videos. IEEE Transactions on Multimedia, 16(4): 917--932, 2014.
[5]
A. Engström, M. Esbjörnsson, and O. Juhlin. Mobile Collaborative Live Video Mixing. In Int. Conference on Human Computer Interaction with Mobile Devices and Services, pages 157--166, 2008.
[6]
R. Graham. An efficient algorithm for determining the convex hull of a finite planar set. In Information Processing Letters, 1972.
[7]
ITU. ITU-R Recommendation P. 910. Technical report, 2008.
[8]
ITU. ITU-R Recommendation BT.500. Technical report, 2012.
[9]
S. Kim, Y. Lu, and G. Constantinou. MediaQ: mobile multimedia management system. In 5th ACM Multimedia Systems Conference, pages 224--235, 2014.
[10]
W. S. Lasecki, K. I. Murray, S. White, R. C. Miller, and J. P. Bigham. Real-time crowd control of existing interfaces. In ACM Symposium on User Interface Software and Technology, pages 23--32.
[11]
M. A. Mughal and O. Juhlin. Context-dependent software solutions to handle video synchronization and delay in collaborative live mobile video production. Personal and Ubiquitous Computing, 18(3):709--721, 2013.
[12]
M. J. Murphy, C. D. Miller, W. S. Lasecki, and J. P. Bigham. Adaptive time windows for real-time crowd captioning. In Extended Abstracts on Human Factors in Computing Systems, pages 13--18, 2013.
[13]
M. K. Saini, R. Gadde, S. Yan, and W. T. Ooi. MoViMash: Online Mobile Video Mashup. In ACM Int. Conference on Multimedia, pages 139--148, 2012.
[14]
P. Shrestha, P. H. de With, H. Weda, M. Barbieri, and E. H. Aarts. Automatic mashup generation from multiple-camera concert recordings. In ACM Int. Conference on Multimedia, pages 541--550, 2010.
[15]
G. Wang, B. Seo, and R. Zimmermann. Automatic positioning data correction for sensor-annotated mobile videos. In ACM Int. Conference on Advances in Geographic Information Systems, pages 470--473, 2012.
[16]
S. Wilk and W. Effelsberg. The influence of camera shakes, harmful occlusions and camera misalignment on the perceived quality in user-generated video. In IEEE Int. Conference on Multimedia and Expo, pages 1--6, 2014.
[17]
T. Yan, V. Kumar, and D. Ganesan. Crowdsearch: exploiting crowds for accurate real-time image search on mobile phones. In ACM Int. Conference on Mobile Systems, Applications, and Services, pages 77--90, 2010.
[18]
K.-c. Yang, C. Guest, and P. Das. Perceptual Sharpness Metric (PSM) for Compressed Video. In IEEE Int. Conference on Multimedia and Expo, pages 777--780, 2006.

Cited By

View all
  • (2024)Composition and Transmission of Videos Generated by Multiple UsersFrom Multimedia Communications to the Future Internet10.1007/978-3-031-71874-8_14(202-218)Online publication date: 13-Sep-2024
  • (2020)A Controlled Benchmark of Video Violence Detection TechniquesInformation10.3390/info1106032111:6(321)Online publication date: 13-Jun-2020
  • (2018)The crowd as a cameramanMultimedia Tools and Applications10.1007/s11042-016-4257-677:1(597-629)Online publication date: 1-Jan-2018
  • Show More Cited By

Index Terms

  1. Video composition by the crowd: a system to compose user-generated videos in near real-time

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MMSys '15: Proceedings of the 6th ACM Multimedia Systems Conference
    March 2015
    277 pages
    ISBN:9781450333511
    DOI:10.1145/2713168
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 March 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. crowdsourcing
    2. video composition
    3. video mashup

    Qualifiers

    • Research-article

    Funding Sources

    • DFG

    Conference

    MMSys '15
    Sponsor:
    MMSys '15: Multimedia Systems Conference 2015
    March 18 - 20, 2015
    Oregon, Portland

    Acceptance Rates

    MMSys '15 Paper Acceptance Rate 12 of 41 submissions, 29%;
    Overall Acceptance Rate 176 of 530 submissions, 33%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Composition and Transmission of Videos Generated by Multiple UsersFrom Multimedia Communications to the Future Internet10.1007/978-3-031-71874-8_14(202-218)Online publication date: 13-Sep-2024
    • (2020)A Controlled Benchmark of Video Violence Detection TechniquesInformation10.3390/info1106032111:6(321)Online publication date: 13-Jun-2020
    • (2018)The crowd as a cameramanMultimedia Tools and Applications10.1007/s11042-016-4257-677:1(597-629)Online publication date: 1-Jan-2018
    • (2018)Automated Video Mashups: Research and ChallengesMediaSync10.1007/978-3-319-65840-7_6(167-190)Online publication date: 27-Mar-2018
    • (2017)Video Annotation by Cascading MicrotasksProceedings of the 23rd Brazillian Symposium on Multimedia and the Web10.1145/3126858.3126897(49-56)Online publication date: 17-Oct-2017
    • (2016)One Sensor is not EnoughProceedings of the 24th ACM international conference on Multimedia10.1145/2964284.2967297(626-630)Online publication date: 1-Oct-2016
    • (2016)RivuletProceedings of the ACM International Conference on Interactive Experiences for TV and Online Video10.1145/2932206.2932211(31-42)Online publication date: 17-Jun-2016
    • (2016)Scalable mobile quality assessment for User-generated Video2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)10.1109/ICMEW.2016.7574687(1-6)Online publication date: Jul-2016
    • (2015)Continuous Processing of Real-Time Multimedia Requests Using Semantic TechniquesProceedings of the 13th International Conference on Advances in Mobile Computing and Multimedia10.1145/2837126.2837170(216-220)Online publication date: 11-Dec-2015
    • (2015)Cloudlet-based Large-scale 3D Reconstruction Using Real-time Data from Mobile Depth CamerasProceedings of the 6th International Workshop on Mobile Cloud Computing and Services10.1145/2802130.2802134(8-14)Online publication date: 11-Sep-2015
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media