Computer Science > Machine Learning

arXiv:2407.13734 (cs)

[Submitted on 18 Jul 2024]

Title:Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review

Authors:Masatoshi Uehara, Yulai Zhao, Tommaso Biancalani, Sergey Levine

Abstract:This tutorial provides a comprehensive survey of methods for fine-tuning diffusion models to optimize downstream reward functions. While diffusion models are widely known to provide excellent generative modeling capability, practical applications in domains such as biology require generating samples that maximize some desired metric (e.g., translation efficiency in RNA, docking score in molecules, stability in protein). In these cases, the diffusion model can be optimized not only to generate realistic samples but also to explicitly maximize the measure of interest. Such methods are based on concepts from reinforcement learning (RL). We explain the application of various RL algorithms, including PPO, differentiable optimization, reward-weighted MLE, value-weighted sampling, and path consistency learning, tailored specifically for fine-tuning diffusion models. We aim to explore fundamental aspects such as the strengths and limitations of different RL-based fine-tuning algorithms across various scenarios, the benefits of RL-based fine-tuning compared to non-RL-based approaches, and the formal objectives of RL-based fine-tuning (target distributions). Additionally, we aim to examine their connections with related topics such as classifier guidance, Gflownets, flow-based diffusion models, path integral control theory, and sampling from unnormalized distributions such as MCMC. The code of this tutorial is available at this https URL

Comments:	We plan to add more content/codes. Please let us know if there are any comments
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
Cite as:	arXiv:2407.13734 [cs.LG]
	(or arXiv:2407.13734v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.13734

Submission history

From: Masatoshi Uehara [view email]
[v1] Thu, 18 Jul 2024 17:35:32 UTC (2,289 KB)

Computer Science > Machine Learning

Title:Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators