demonstration

VideoDiscovery: An Automatic Short-Video Generation System for E-commerce Live-streaming

Authors:

Yanhao Zhang,

Qiang Wang,

Yun Zheng,

Pan Pan,

Yinghui XuAuthors Info & Claims

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

Pages 2771 - 2773

https://doi.org/10.1145/3474085.3478554

Published: 17 October 2021 Publication History

Get Access

Abstract

We demonstrate an end-to-end intelligent system of short-video generation for live-streaming, namely "VideoDiscovery'', which aims to automatically produce batches of high-value short-videos by discovering and organizing highlight content for commodity delivery. Traditionally, production of high-value short-videos for live-streaming is cost-expensive and time-consuming, which also demands experienced editing skills. To this end, we construct this system with three modules: 1)Semantic segment structuring first decodes live-streaming into a series of semantic candidates including commodity, Q&A, action, multi-modal, etc. 2)Hierarchical search engine performs automatically searches for semantically matching candidate shots from scripts. 3)Script-aware shot assembly is formulated combination problem over a graph of shots, considering temporal constraints and candidate idioms. Specifically, given an input live-streaming, the recommended video results illustrate diverse visual-semantic content, and follow script guidelines. Currently, our system has been launched online for Taobao stores, which enables to generate appealing videos in minutes for advertising and recommendation. The entry of our system is available at https://discovery.aliyun.com/index.

References

[1]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).

Google Scholar

[2]

Mackenzie Leake, Abe Davis, Anh Truong, and Maneesh Agrawala. 2017. Computational video editing for dialogue-driven scenes. ACM Trans. Graph. 36, 4 (2017), 130--1.

Digital Library

Google Scholar

[3]

Junhua Liao, Haihan Duan, Xin Li, Haoran Xu, Yanbing Yang, Wei Cai, Yanru Chen, and Liangyin Chen. 2020. Occlusion Detection for Automatic Video Editing. In Proceedings of the 28th ACM International Conference on Multimedia. 2255--2263.

Digital Library

Google Scholar

[4]

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. arXiv preprint arXiv:2103.00020 (2021).

Google Scholar

[5]

Miao Wang, Guo-Wei Yang, Shi-Min Hu, Shing-Tung Yau, and Ariel Shamir. 2019. Write-a-video: computational video montage from themed text. ACM Trans. Graph. 38, 6 (2019), 177--1.

Digital Library

Google Scholar

[6]

Songyang Zhang, Houwen Peng, Jianlong Fu, and Jiebo Luo. 2020. Learning 2d temporal adjacent networks for moment localization with natural language. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 12870--12877.

Crossref

Google Scholar

[7]

Yanhao Zhang, Qiang Wang, Pan Pan, Yun Zheng, Cheng Da, Siyang Sun, and Yinghui Xu. 2021. Fashion Focus: Multi-modal Retrieval System for Video Commodity Localization in E-commerce. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI2021, February 2-9, 2021. 16127--16128.

Google Scholar

Cited By

View all

Index Terms

VideoDiscovery: An Automatic Short-Video Generation System for E-commerce Live-streaming
1. Information systems
  1. Information retrieval
    1. Users and interactive retrieval
      1. Personalization
  2. Information systems applications
    1. Multimedia information systems
      1. Multimedia content creation

Recommendations

Smart style: combining RDF semantics with XML document transformations

'Document Web' (XML-based) and 'Semantic Web' (RDF-based) methods are often surprisingly hard to integrate. In this paper, we analyse the role of (RDF) semantics in selecting, structuring and styling (XML) content. We use our analysis to derive a more ...
Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content
MultiMedia Modeling
Abstract
The emergence of audio streaming services and evolved listening habits notwithstanding, broadcast radio is still a popular medium that plays an important role in contemporary users’ media consumption mix. While radio’s strength has traditionally ...
Spotlight browsing of resource archives
HYPERTEXT '05: Proceedings of the sixteenth ACM conference on Hypertext and hypermedia

Many organizations, particularly in the heritage sector, have large archives of digital content that they could make available to the general public or special interest groups if they had the appropriate mechanisms. Currently, these organizations can ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

October 2021

5796 pages

ISBN:9781450386517

DOI:10.1145/3474085

General Chairs:
Heng Tao Shen
University of Electronic Science&Technology of China, China
,
Yueting Zhuang
Zhejiang University, China
,
John R. Smith
IBM, USA
,
Program Chairs:
Yang Yang
University of Electronic Science and Technology of China, China
,
Pablo Cesar
CWI&TU Delft, The Netherlands
,
Florian Metze
FACEBOOK, Inc., USA
,
Balakrishnan Prabhakaran
University of Texas at Dallas, USA

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2021

Check for updates

Author Tags

Qualifiers

Demonstration

Conference

MM '21

Sponsor:

SIGMM

MM '21: ACM Multimedia Conference

October 20 - 24, 2021

Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
290
Total Downloads

Downloads (Last 12 months)23
Downloads (Last 6 weeks)2

Reflects downloads up to 14 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Smart style: combining RDF semantics with XML document transformations

Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content

Spotlight browsing of resource archives