Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3474085.3478554acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
demonstration

VideoDiscovery: An Automatic Short-Video Generation System for E-commerce Live-streaming

Published: 17 October 2021 Publication History

Abstract

We demonstrate an end-to-end intelligent system of short-video generation for live-streaming, namely "VideoDiscovery'', which aims to automatically produce batches of high-value short-videos by discovering and organizing highlight content for commodity delivery. Traditionally, production of high-value short-videos for live-streaming is cost-expensive and time-consuming, which also demands experienced editing skills. To this end, we construct this system with three modules: 1)Semantic segment structuring first decodes live-streaming into a series of semantic candidates including commodity, Q&A, action, multi-modal, etc. 2)Hierarchical search engine performs automatically searches for semantically matching candidate shots from scripts. 3)Script-aware shot assembly is formulated combination problem over a graph of shots, considering temporal constraints and candidate idioms. Specifically, given an input live-streaming, the recommended video results illustrate diverse visual-semantic content, and follow script guidelines. Currently, our system has been launched online for Taobao stores, which enables to generate appealing videos in minutes for advertising and recommendation. The entry of our system is available at https://discovery.aliyun.com/index.

References

[1]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
[2]
Mackenzie Leake, Abe Davis, Anh Truong, and Maneesh Agrawala. 2017. Computational video editing for dialogue-driven scenes. ACM Trans. Graph. 36, 4 (2017), 130--1.
[3]
Junhua Liao, Haihan Duan, Xin Li, Haoran Xu, Yanbing Yang, Wei Cai, Yanru Chen, and Liangyin Chen. 2020. Occlusion Detection for Automatic Video Editing. In Proceedings of the 28th ACM International Conference on Multimedia. 2255--2263.
[4]
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. arXiv preprint arXiv:2103.00020 (2021).
[5]
Miao Wang, Guo-Wei Yang, Shi-Min Hu, Shing-Tung Yau, and Ariel Shamir. 2019. Write-a-video: computational video montage from themed text. ACM Trans. Graph. 38, 6 (2019), 177--1.
[6]
Songyang Zhang, Houwen Peng, Jianlong Fu, and Jiebo Luo. 2020. Learning 2d temporal adjacent networks for moment localization with natural language. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 12870--12877.
[7]
Yanhao Zhang, Qiang Wang, Pan Pan, Yun Zheng, Cheng Da, Siyang Sun, and Yinghui Xu. 2021. Fashion Focus: Multi-modal Retrieval System for Video Commodity Localization in E-commerce. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI2021, February 2-9, 2021. 16127--16128.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '21: Proceedings of the 29th ACM International Conference on Multimedia
October 2021
5796 pages
ISBN:9781450386517
DOI:10.1145/3474085
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2021

Check for updates

Author Tags

  1. content structuring
  2. short-video generation
  3. video assembly

Qualifiers

  • Demonstration

Conference

MM '21
Sponsor:
MM '21: ACM Multimedia Conference
October 20 - 24, 2021
Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 290
    Total Downloads
  • Downloads (Last 12 months)23
  • Downloads (Last 6 weeks)2
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

Cited By

View all

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media