TY - GEN
T1 - MedCDA
T2 - 8th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2025
AU - Yao, Kexin
AU - Zhao, Jing
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
PY - 2026
Y1 - 2026
N2 - Recent advances in medical imaging have intensified the demand for robust analysis methods to support diagnosis, treatment planning, and monitoring. However, current approaches still face significant challenges in two critical limitations: i) scarcity of annotated data, ii) poor model interpretability. To address these challenges, we propose MedCDA, a zero-shot segmentation-driven counterfactual data augmentation framework. Our approach introduces: (1) a boundary-aware gradient attention mechanism that sharpens focus on target boundaries via automatically simulated click points; (2) systematic counterfactual generation that removes lesions while preserving healthy semantics, enhancing diversity and reducing spurious correlations; and (3) multimodal large model integration for vision-language alignment, paired with a weighted loss fine-tuning strategy to improve classification robustness. Quantitative results demonstrate consistent improvements over state-of-the-art methods on three public benchmarks: dermoscopic image dataset HAM10000, breast ultrasound image dataset BUSI, and mammography dataset CBIS-DDSM. Ablation studies validate the effectiveness of each module. MedCDA establishes a novel “segmentation-augmentation-reasoning” paradigm, offering an extensible framework for other healthcare decision-making scenarios and providing new solutions for medical image analysis.
AB - Recent advances in medical imaging have intensified the demand for robust analysis methods to support diagnosis, treatment planning, and monitoring. However, current approaches still face significant challenges in two critical limitations: i) scarcity of annotated data, ii) poor model interpretability. To address these challenges, we propose MedCDA, a zero-shot segmentation-driven counterfactual data augmentation framework. Our approach introduces: (1) a boundary-aware gradient attention mechanism that sharpens focus on target boundaries via automatically simulated click points; (2) systematic counterfactual generation that removes lesions while preserving healthy semantics, enhancing diversity and reducing spurious correlations; and (3) multimodal large model integration for vision-language alignment, paired with a weighted loss fine-tuning strategy to improve classification robustness. Quantitative results demonstrate consistent improvements over state-of-the-art methods on three public benchmarks: dermoscopic image dataset HAM10000, breast ultrasound image dataset BUSI, and mammography dataset CBIS-DDSM. Ablation studies validate the effectiveness of each module. MedCDA establishes a novel “segmentation-augmentation-reasoning” paradigm, offering an extensible framework for other healthcare decision-making scenarios and providing new solutions for medical image analysis.
KW - Counterfactual Data Augmentation
KW - Medical Image Analysis
KW - Zero-Shot Segmentation
UR - https://www.scopus.com/pages/publications/105036957450
U2 - 10.1007/978-981-95-5631-1_32
DO - 10.1007/978-981-95-5631-1_32
M3 - 会议稿件
AN - SCOPUS:105036957450
SN - 9789819556304
T3 - Lecture Notes in Computer Science
SP - 451
EP - 464
BT - Pattern Recognition and Computer Vision - 8th Chinese Conference, PRCV 2025, Proceedings
A2 - Kittler, Josef
A2 - Xiong, Hongkai
A2 - Lin, Weiyao
A2 - Yang, Jian
A2 - Chen, Xilin
A2 - Lu, Jiwen
A2 - Yu, Jingyi
A2 - Zheng, Weishi
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 15 October 2025 through 18 October 2025
ER -