Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.00307 (cs)

[Submitted on 1 Oct 2024]

Title:RadGazeGen: Radiomics and Gaze-guided Medical Image Generation using Diffusion Models

Authors:Moinak Bhattacharya, Gagandeep Singh, Shubham Jain, Prateek Prasanna

Abstract:In this work, we present RadGazeGen, a novel framework for integrating experts' eye gaze patterns and radiomic feature maps as controls to text-to-image diffusion models for high fidelity medical image generation. Despite the recent success of text-to-image diffusion models, text descriptions are often found to be inadequate and fail to convey detailed disease-specific information to these models to generate clinically accurate images. The anatomy, disease texture patterns, and location of the disease are extremely important to generate realistic images; moreover the fidelity of image generation can have significant implications in downstream tasks involving disease diagnosis or treatment repose assessment. Hence, there is a growing need to carefully define the controls used in diffusion models for medical image generation. Eye gaze patterns of radiologists are important visuo-cognitive information, indicative of subtle disease patterns and spatial location. Radiomic features further provide important subvisual cues regarding disease phenotype. In this work, we propose to use these gaze patterns in combination with standard radiomics descriptors, as controls, to generate anatomically correct and disease-aware medical images. RadGazeGen is evaluated for image generation quality and diversity on the REFLACX dataset. To demonstrate clinical applicability, we also show classification performance on the generated images from the CheXpert test set (n=500) and long-tailed learning performance on the MIMIC-CXR-LT test set (n=23550).

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.00307 [cs.CV]
	(or arXiv:2410.00307v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.00307

Submission history

From: Moinak Bhattacharya [view email]
[v1] Tue, 1 Oct 2024 01:10:07 UTC (23,764 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RadGazeGen: Radiomics and Gaze-guided Medical Image Generation using Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RadGazeGen: Radiomics and Gaze-guided Medical Image Generation using Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators