TY - GEN
T1 - The Battle of the Titans
T2 - 25th IEEE International Conference on Data Mining Workshops, ICDMW 2025
AU - Debnath, Tanmoy
AU - Narbutt, Miroslaw
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Accurate hippocampal segmentation is critical for neuroimaging research, clinical diagnosis, and surgical planning. This paper presents a rigorous comparison of three leading deep learning frameworks - 3D U-Net, nnU-Net V2, and MONAI U-Net - on the Medical Segmentation Decathlon (MSD) Task 4 hippocampus dataset. Using five-fold cross-validation and fifteen complementary metrics spanning overlap, boundary accuracy, classification, and statistical reliability, we evaluate each model's strengths and limitations. Results show that nnU-Net delivers the highest volumetric accuracy (Dice: 0.891) and strongest statistical consistency (Pearson: 0.989), 3D U-Net achieves superior boundary precision (ASSD: 0.159 mm), and MONAI offers high sensitivity (Recall: 0.991) with strong adaptability for research workflows. We provide application-driven recommendations: nnU-Net is optimal for longitudinal and multi-center studies, 3D U-Net for boundary-sensitive surgical tasks, and MONAI for rapid prototyping. This work establishes a comprehensive benchmark for hippocampal segmentation and offers practical guidance for framework selection in clinical and research settings.
AB - Accurate hippocampal segmentation is critical for neuroimaging research, clinical diagnosis, and surgical planning. This paper presents a rigorous comparison of three leading deep learning frameworks - 3D U-Net, nnU-Net V2, and MONAI U-Net - on the Medical Segmentation Decathlon (MSD) Task 4 hippocampus dataset. Using five-fold cross-validation and fifteen complementary metrics spanning overlap, boundary accuracy, classification, and statistical reliability, we evaluate each model's strengths and limitations. Results show that nnU-Net delivers the highest volumetric accuracy (Dice: 0.891) and strongest statistical consistency (Pearson: 0.989), 3D U-Net achieves superior boundary precision (ASSD: 0.159 mm), and MONAI offers high sensitivity (Recall: 0.991) with strong adaptability for research workflows. We provide application-driven recommendations: nnU-Net is optimal for longitudinal and multi-center studies, 3D U-Net for boundary-sensitive surgical tasks, and MONAI for rapid prototyping. This work establishes a comprehensive benchmark for hippocampal segmentation and offers practical guidance for framework selection in clinical and research settings.
KW - Artificial Intelligence
KW - Brain
KW - Medical Imaging
UR - https://www.scopus.com/pages/publications/105035384305
U2 - 10.1109/ICDMW69685.2025.00084
DO - 10.1109/ICDMW69685.2025.00084
M3 - Conference contribution
AN - SCOPUS:105035384305
T3 - IEEE International Conference on Data Mining Workshops, ICDMW
SP - 691
EP - 700
BT - Proceedings - 25th IEEE International Conference on Data Mining Workshops, ICDMW 2025
PB - IEEE Computer Society
Y2 - 12 November 2025 through 15 November 2025
ER -