Computer Science > Computer Vision and Pattern Recognition

arXiv:2204.02035 (cs)

[Submitted on 5 Apr 2022]

Title:DT2I: Dense Text-to-Image Generation from Region Descriptions

Authors:Stanislav Frolov, Prateek Bansal, Jörn Hees, Andreas Dengel

View PDF

Abstract:Despite astonishing progress, generating realistic images of complex scenes remains a challenging problem. Recently, layout-to-image synthesis approaches have attracted much interest by conditioning the generator on a list of bounding boxes and corresponding class labels. However, previous approaches are very restrictive because the set of labels is fixed a priori. Meanwhile, text-to-image synthesis methods have substantially improved and provide a flexible way for conditional image generation. In this work, we introduce dense text-to-image (DT2I) synthesis as a new task to pave the way toward more intuitive image generation. Furthermore, we propose DTC-GAN, a novel method to generate images from semantically rich region descriptions, and a multi-modal region feature matching loss to encourage semantic image-text matching. Our results demonstrate the capability of our approach to generate plausible images of complex scenes using region captions.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2204.02035 [cs.CV]
	(or arXiv:2204.02035v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2204.02035

Submission history

From: Stanislav Frolov [view email]
[v1] Tue, 5 Apr 2022 07:57:11 UTC (3,375 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DT2I: Dense Text-to-Image Generation from Region Descriptions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DT2I: Dense Text-to-Image Generation from Region Descriptions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators