research-article

SeqVAE: Sequence variational autoencoder with policy gradient

Authors:

Fanyu DingAuthors Info & Claims

Applied Intelligence, Volume 51, Issue 12

Pages 9030 - 9037

https://doi.org/10.1007/s10489-021-02374-7

Published: 01 December 2021 Publication History

Abstract

In the paper, we propose a variant of Variational Autoencoder (VAE) for sequence generation task, called SeqVAE, which is a combination of recurrent VAE and policy gradient in reinforcement learning. The goal of SeqVAE is to reduce the deviation of the optimization goal of VAE, which we achieved by adding the policy-gradient loss to SeqVAE. In the paper, we give two ways to calculate the policy-gradient loss, one is from SeqGAN and the other is proposed by us. In the experiments on them, our proposed method is better than all baselines, and experiments show that SeqVAE can alleviate the “post-collapse” problem. Essentially, SeqVAE can be regarded as a combination of VAE and Generative Adversarial Net (GAN) and has better learning ability than the plain VAE because of the increased adversarial process. Finally, an application of our SeqVAE to music melody generation is available online¹².

References

[1]

Arjovsky M, Chintala S, Bottou L (2017) Wasserstein gan. arXiv:1701.07875

[2]

Bachman P, Precup D Data generation as sequential decision making. In: Advances in Neural Information Processing Systems, pp. 3249–3257

[3]

Bao J, Chen D, Wen F, Li H, Hua G Cvae-gan: fine-grained image generation through asymmetric training. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2745–2754

[4]

Bengio S, Vinyals O, Jaitly N, Shazeer N Scheduled sampling for sequence prediction with recurrent neural networks. In: Advances in Neural Information Processing Systems, pp. 1171–1179

[5]

Bowman SR, Vilnis L, Vinyals O, Dai AM, Jozefowicz R, Bengio S (2015) Generating sentences from a continuous space. arXiv:1511.06349

[6]

Carter S and Nielsen M Using artificial intelligence to augment human intelligence Distill 2017 2 12 e9

[7]

Dong HW, Hsiao WY, Yang LC, Yang YH Musegan: Multi-track sequential generative adversarial networks for symbolic music generation and accompaniment. In: Thirty-Second AAAI Conference on Artificial Intelligence

[8]

Engel J, Resnick C, Roberts A, Dieleman S, Norouzi M, Eck D, Simonyan K Neural audio synthesis of musical notes with wavenet autoencoders. In: Proceedings of the 34th International Conference on Machine Learning, vol 70, pp 1068–1077. JMLR. org

[9]

Goodfellow I (2016) Generative adversarial networks for text http://goo.gl/wg9DR7

[10]

Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y Generative adversarial nets. In: Advances in neural information processing systems, pp. 2672–2680

[11]

Ha D, Eck D (2017) A neural representation of sketch drawings. arXiv:1704.03477

[12]

He K, Zhang X, Ren S, Sun J Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

[13]

Hochreiter S and Schmidhuber J Long short-term memory Neural Comput 1997 9 8 1735-1780

[14]

Huszár F (2015) How (not) to train your generative model: Scheduled sampling, likelihood, adversary. arXiv:1511.05101

[15]

Karras T, Laine S, Aila T A style-based generator architecture for generative adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4401–4410

[16]

Kim Y (2014) Convolutional neural networks for sentence classification. arXiv:1408.5882

[17]

KingmaD A (2015) A methodforstochasticoptimization. arxiv: 1412.6980

[18]

Konda VR, Tsitsiklis JN Actor-critic algorithms. In: Advances in neural information processing systems, pp 1008–1014

[19]

Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller M (2013) Playing atari with deep reinforcement learning. arXiv:1312.5602

[20]

Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, and Ostrovski G Human-level control through deep reinforcement learning Nature 2015 518 7540 529-533

[21]

Oord A.v.d, Dieleman S, Zen H, Simonyan K, Vinyals O, Graves A, Kalchbrenner N, Senior A, Kavukcuoglu K (2016) Wavenet: A generative model for raw audio. arXiv:1609.03499

[22]

Papineni K, Roukos S, Ward T, Zhu WJ Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting on association for computational linguistics, pp 311–318. Association for Computational Linguistics

[23]

Roberts A, Engel J, Raffel C, Hawthorne C, Eck D (2018) A hierarchical latent vector model for learning long-term structure in music. arXiv:1803.05428

[24]

Semeniuta S, Severyn A, Barth E (2017) A hybrid convolutional variational autoencoder for text generation. arXiv:1702.02390

[25]

Sutton RS, McAllester DA, Singh SP, Mansour Y Policy gradient methods for reinforcement learning with function approximation. In: Advances in neural information processing systems, pp. 1057–1063

[26]

Veselý K, Ghoshal A, Burget L, Povey D Sequence-discriminative training of deep neural networks. In: Interspeech, vol 2013, pp 2345–2349

[27]

Wang H, Qin Z, Wan T Text generation based on generative adversarial nets with latent variables. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, pp 92–103

[28]

Yu L, Zhang W, Wang J, Yu Y Seqgan: Sequence generative adversarial nets with policy gradient. In: Thirty-First AAAI Conference on Artificial Intelligence

[29]

Zhou F, Yang S, Fujita H, Chen D, and Wen C Deep learning fault diagnosis method based on global optimization gan for unbalanced data Knowl-Based Syst 2020 187 104 837

[30]

Zhu JY, Park T, Isola P, Efros AA Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp. 2223–2232

Cited By

Wang JWu JJia CZhang Z(2023)Self-supervised variational autoencoder towards recommendation by nested contrastive learningApplied Intelligence10.1007/s10489-023-04488-653:15(18887-18897)Online publication date: 1-Aug-2023
https://dl.acm.org/doi/10.1007/s10489-023-04488-6
Qiu DChen LYu Y(2023)Document-level paraphrase generation base on attention enhanced graph LSTMApplied Intelligence10.1007/s10489-022-04031-z53:9(10459-10471)Online publication date: 1-May-2023
https://dl.acm.org/doi/10.1007/s10489-022-04031-z

Recommendations

VAEPP: Variational Autoencoder with a Pull-Back Prior
Neural Information Processing
Abstract
Many approaches to training generative models by distinct training objectives have been proposed in the past. Variational Autoencoder (VAE) is an outstanding model of them based on log-likelihood. In this paper, we propose a novel learnable prior, ...
Text Generation Based on Generative Adversarial Nets with Latent Variables
Advances in Knowledge Discovery and Data Mining
Abstract
In this paper, we propose a model using generative adversarial net (GAN) to generate realistic text. Instead of using standard GAN, we combine variational autoencoder (VAE) with generative adversarial net. The use of high-level latent random ...
Initial Study of Batik Generation using Variational Autoencoder
Abstract
One of the most promising architectures for generative models is the variational autoencoder (VAE). To reconstruct Batik patterns for this work, we used a deep convolutional VAE architecture. Reconstruction outcomes from various batik motifs are ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Applied Intelligence

Applied Intelligence Volume 51, Issue 12

Dec 2021

516 pages

ISSN:0924-669X

Issue’s Table of Contents

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021.

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 December 2021

Accepted: 23 March 2021

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 25 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Wang JWu JJia CZhang Z(2023)Self-supervised variational autoencoder towards recommendation by nested contrastive learningApplied Intelligence10.1007/s10489-023-04488-653:15(18887-18897)Online publication date: 1-Aug-2023
https://dl.acm.org/doi/10.1007/s10489-023-04488-6
Qiu DChen LYu Y(2023)Document-level paraphrase generation base on attention enhanced graph LSTMApplied Intelligence10.1007/s10489-022-04031-z53:9(10459-10471)Online publication date: 1-May-2023
https://dl.acm.org/doi/10.1007/s10489-022-04031-z

View Options

View options

Media

Figures

Other

Tables

View Issue’s Table of Contents