Adaptive Control of Data Augmentation for Thyroid Ultrasound Image Segmentation

D. V. Kholod, E. V. Bobrova, K. S. Zaitsev, A. A. Trukhin, E. A. Troshina

Abstract


The aim of this study was to investigate the influence of the proportion of real and augmented data on the performance of thyroid nodule segmentation in ultrasound images. For this purpose, a parameter R was introduced, defined as the ratio of the number of augmented images to the number of original images, and its effect on segmentation quality was analyzed for different encoder architectures. The study considered the formation of training datasets with varying shares of augmented data, followed by model training and evaluation of segmentation results on the DDTI dataset. Segmentation performance was assessed using standard evaluation metrics, namely the Dice coefficient and Intersection over Union (IoU). The results show that the dependence of segmentation quality on the parameter R is non-monotonic: there exists an optimal value 𝑅maxat which the best performance is achieved, as well as a region 𝑅minwhere the segmentation quality deteriorates. It is demonstrated that the ViT-Base architecture is the most sensitive to excessive augmentation, whereas ConvNeXt-Tiny and Swin Transformer-Tiny exhibit greater robustness to changes in the proportion of synthetic data.

Full Text:

PDF (Russian)

References


Ronneberger O., Fischer P., Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation // MICCAI. 2015. P. 234–241.

Shorten C., Khoshgoftaar T. A survey on Image Data Augmentation for Deep Learning // Journal of Big Data. 2019. Vol. 6. Art. 60.

Kim M., Bae H.-J. Data Augmentation Techniques for Deep Learning-Based Medical Image Analyses // J Korean Soc Radiol. 2020. Vol. 81(6). P. 1290–1304.

Garcı́a S., et al. Data augmentation for medical imaging: A systematic literature review. 2022.

Dosovitskiy A., Beyer L., Kolesnikov A., et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv:2010.11929. 2020.

Touvron H., Cord M., Douze M., Massa F., Sablayrolles A., Jégou H. Training data-efficient image transformers & distillation through attention // ICML. 2021. P. 10347–10357.

Liu Z., Lin Y., Cao Y., et al. Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows // ICCV. 2021. P. 10012–10022.

Liu Z., Mao H., Wu C.-Y., Feichtenhofer C., Darrell T., Xie S. A ConvNet for the 2020s. arXiv:2201.03545. 2022.

Lee L. H., Gao Y., Noble J. A. Principled Ultrasound Data Augmentation for Classification of Standard Planes. arXiv:2103.07895. 2021.

Pedraza L., Vargas C., Narváez F., Durán O., Muñoz E., Romero E. An Open Access Thyroid Ultrasound Image Database // Proc. SPIE. 2015. Vol. 9287. Art. 92870W. DOI: 10.1117/12.2073532.

Kim K., Lee H. S. Probabilistic Anchor Assignment with IoU Prediction for Object Detection // European Conference on Computer Vision. Cham: Springer International Publishing, 2020. P. 355–371.

Shamir R. R., et al. Continuous Dice Coefficient: a Method for Evaluating Probabilistic Segmentations. arXiv:1906.11031. 2019.

M. Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review // J. Med. Syst. 2024. Vol. 48. Article 84.

Advantages of transformer and its application for medical image segmentation: a survey // BioMed Eng. Online. 2024.

Chen J., et al. TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation. arXiv. 2021.

Medical image segmentation by combining feature enhancement Swin Transformer and UperNet // Sci. Rep. 2025


Refbacks

  • There are currently no refbacks.


Abava  Кибербезопасность Monetec 2026 СНЭ

ISSN: 2307-8162