Rethinking Data Augmentation for Single-Source Domain Generalization in Medical Image Segmentation

Authors

  • Zixian Su University of Liverpool, Liverpool, the United Kingdom Xi'an Jiaotong-Liverpool University, Suzhou, China
  • Kai Yao University of Liverpool, Liverpool, the United Kingdom Xi'an Jiaotong-Liverpool University, Suzhou, China
  • Xi Yang Xi’an Jiaotong Liverpool University, Suzhou, China
  • Kaizhu Huang Duke Kunshan University, Kunshan, China
  • Qiufeng Wang Xi'an Jiaotong-Liverpool University, Suzhou, China
  • Jie Sun Xi'an Jiaotong-Liverpool University, Suzhou, China

DOI:

https://doi.org/10.1609/aaai.v37i2.25332

Keywords:

CV: Medical and Biological Imaging

Abstract

Single-source domain generalization (SDG) in medical image segmentation is a challenging yet essential task as domain shifts are quite common among clinical image datasets. Previous attempts most conduct global-only/random augmentation. Their augmented samples are usually insufficient in diversity and informativeness, thus failing to cover the possible target domain distribution. In this paper, we rethink the data augmentation strategy for SDG in medical image segmentation. Motivated by the class-level representation invariance and style mutability of medical images, we hypothesize that unseen target data can be sampled from a linear combination of C (the class number) random variables, where each variable follows a location-scale distribution at the class level. Accordingly, data augmented can be readily made by sampling the random variables through a general form. On the empirical front, we implement such strategy with constrained Bezier transformation on both global and local (i.e. class-level) regions, which can largely increase the augmentation diversity. A Saliency-balancing Fusion mechanism is further proposed to enrich the informativeness by engaging the gradient information, guiding augmentation with proper orientation and magnitude. As an important contribution, we prove theoretically that our proposed augmentation can lead to an upper bound of the generalization risk on the unseen target domain, thus confirming our hypothesis. Combining the two strategies, our Saliency-balancing Location-scale Augmentation (SLAug) exceeds the state-of-the-art works by a large margin in two challenging SDG tasks. Code is available at https://github.com/Kaiseem/SLAug.

Downloads

Published

2023-06-26

How to Cite

Su, Z., Yao, K., Yang, X., Huang, K., Wang, Q., & Sun, J. (2023). Rethinking Data Augmentation for Single-Source Domain Generalization in Medical Image Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 2366-2374. https://doi.org/10.1609/aaai.v37i2.25332

Issue

Section

AAAI Technical Track on Computer Vision II