Home > Published Issues > 2025 > Volume 16, No. 1, 2025 >
JAIT 2025 Vol.16(1): 37-48
doi: 10.12720/jait.16.1.37-48

An Improved Pix2pix Generative Adversarial Network Model to Enhance Thyroid Nodule Segmentation

Huda F. AL-Shahad 1,2, Razali Yaakob 1,*, Nurfadhlina Mohd Sharef 1, Hazlina Hamdan 1, and Hasyma Abu Hassan 3
1. Department of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Selangor, Malaysia
2. College of Science, University of Kerbala, Karbala, Iraq
3. Department of Radiology, Faculty of Medicine and Health Sciences, Universiti Putra Malaysia, Selangor, Malaysia
Email: gs59967@student.upm.edu.my (H.F.A.); razaliy@upm.edu.my (R.Y.); nurfadhlina@upm.edu.my (N.M.S.); hazlina@upm.edu.my (H.H.); hasyma@upm.edu.my (H.A.H.)
*Corresponding author

Manuscript received April 16, 2024; revised June 12, 2024; accepted September 10, 2024; published January 9, 2025.

Abstract—Thyroid nodules are a type of lesion, which doctors often need advanced diagnostic tools to detect and conduct follow-up diagnoses. Supervised deep learning techniques, particularly Generative Adversarial Networks (GANs), have been used to extract essential features, detect nodules and generate thyroid masks. However, these approaches suffer significant challenges in obtaining training data due to the high cost of identifying the cancer area and mode collapse during training. Therefore, this study proposed an improvement to one GAN model, namely, the pixel-to-pixel (pix2pix) model, for thyroid nodule segmentation, where the generator was incorporated with a supervised loss function to address instabilities during GAN training. The model used a generator with an encode-decoder structure inspired by U-Net architecture to produce the mask. The discriminator of the model consists of a multilayered Convolutional Neural Network (CNN) to compare the real and generated masks. In addition, three loss functions, namely, binary cross-entropy loss, soft dice loss and Jaccard loss, combined with loss GAN were used to stabilise the GAN model. Based on the results, the proposed model achieved 97% detection accuracy of the cancer area from the ultrasound thyroid nodule images and segmented it using the stabilised model with a generator loss function value of 0.5. In short, this study showed that the improved pix2pix model produced greater flexibility in nodule segmentation accuracy compared with semisupervised segmentation models.
 
Keywords—thyroid nodules segmentation, ultrasound image, deep learning, generative adversarial networks, pix2pix, loss function

Cite: Huda F. AL-Shahad, Razali Yaakob, Nurfadhlina Mohd Sharef, Hazlina Hamdan, and Hasyma Abu Hassan, "An Improved Pix2pix Generative Adversarial Network Model to Enhance Thyroid Nodule Segmentation," Journal of Advances in Information Technology, Vol. 16, No. 1, pp. 37-48, 2025. doi: 10.12720/jait.16.1.37-48

Copyright © 2025 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).