research-article

Stable Video Style Transfer Based on Partial Convolution with Depth-Aware Supervision

Authors:

Songhua Liu,

Hao Wu,

Shoutong Luo,

Zhengxing SunAuthors Info & Claims

MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Pages 2445 - 2453

https://doi.org/10.1145/3394171.3413526

Published: 12 October 2020 Publication History

Get Access

Abstract

As a very important research issue in digital media art, neural learning based video style transfer has attracted more and more attention. A lot of recent works import optical flow method to original image style transfer framework to preserve frame-coherency and prevent flicker. However, these methods highly rely on paired video datasets of content video and stylized video, which are often difficult to obtain. Another limitation of existing methods is that while maintaining inter-frame coherency, they will introduce strong ghosting artifacts. In order to address these problems, this paper has following contributions: (1).presents a novel training framework for video style transfer without dependency on video dataset of target style; (2).firstly focuses on the ghosting problem existing in most previous works and uses partial convolution-based strategy to utilize inter-frame context and correlation, together with additional depth loss as a constrain to the generated frames to suppress ghosting artifacts and preserve stability at the same time. Extensive experiments demonstrate that our method can produce natural and stable video frames with target style. Qualitative and quantitative comparisons also show that the proposed approach outperforms previous works in terms of overall image quality and inter-frame stability. To facilitate future research, we publish our experiment code at \urlhttps://github.com/Huage001/Artistic-Video-Partial-Conv-Depth-Loss.

Supplementary Material

ZIP File (mmfp2161aux.zip)

* PartialConvDepthLossVST.wmv: Our supplementary video to offer a better visual experience for our video style transfer method.

Download
148.77 MB

MP4 File (3394171.3413526.mp4)

We propose a novel training framework for video style transfer, which learns the general style of a set of images for video style transfer and only relies on target image dataset instead of video dataset. Meanwhile, we are the first work focusing on the ghosting problem existing in most previous works and using partial convolution-based strategy to utilize inter-frame context and correlation, together with the additional depth loss as a constrain to the generated frames to suppress ghosting artifacts and preserve stability at the same time. Extensive experiments demonstrate that our method can produce natural and stable video frames with target style. Qualitative and quantitative comparisons also show that the proposed approach outperforms previous works in terms of overall image quality and inter-frame stability. To facilitate future research, we publish our experiment code at https://github.com/Huage001/Artistic-Video-Partial-Conv-Depth-Loss.

Download
31.08 MB

References

[1]

2013. Oz the Great and Powerful. https://en.wikipedia.org/wiki/Oz_the_Great_ and_Powerful.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer

Cvstgan: A Controllable Generative Adversarial Network for Video Style Transfer of Chinese Painting

Real-time arbitrary video style transfer

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations