Skip to main navigation Skip to search Skip to main content

An Ensemble Deep Learning Approach for Breast Cancer Classification Using Vision Transformers and CNNs

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Breast cancer is still one of the most prevalent causes of cancer death globally, indicating that there is a critical need for early and precise diagnostic methods. While histopathological examination remains best method for breast cancer diagnosis, its accuracy and reliability depend significantly on the experience and subjective interpretations of pathologists. This research examined three state-of-the-art deep neural network architectures - Inception-v4, EfficientNet-B0, and Vision Trans former (ViT) - for the binary classification of breast histology images into benign and malignant categories. This research utilized the BreakHis dataset containing 7909 histopathological images as the training dataset, in addition, an external validation BACH dataset was used consisting of 400 images. Diagnostic performance on the external test set was further increased using a novel stacking ensemble method that combined predictions from individual models via Random Forest as a meta-classifier. The stacking ensemble method significantly outperformed individual models, achieving an accuracy of 96.0% and ROC-AUC of 0.99 on the external BACH dataset. Robustness analysis was also conducted to evaluate performance against common imaging artifacts, and visual interpretability was provided through Grad-CAM analyses, enhancing the clinical relevance of the models. The study also checked how the models performed when there were common image issues or distortions. Grad-CAM visualizations were used to see which areas in the image the models relied on for making predictions. In this work, CNNs were used to capture fine details from the images, while the Vision Transformer helped to recognize the overall structure. Bringing both models together improved the accuracy and made the results clearer for medical use.

Original languageEnglish
Title of host publicationInternational Conference on Electrical, Computer, and Energy Technologies, ICECET 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331535599
DOIs
Publication statusPublished - 2025
Externally publishedYes
EventIEEE International Conference on Electrical, Computer and Energy Technologies, ICECET 2025 - Paris, France
Duration: 3 Jul 20256 Jul 2025

Publication series

NameInternational Conference on Electrical, Computer, and Energy Technologies, ICECET 2025

Conference

ConferenceIEEE International Conference on Electrical, Computer and Energy Technologies, ICECET 2025
Country/TerritoryFrance
CityParis
Period3/07/256/07/25

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • BACH
  • BreakHis
  • breast cancer
  • CNN
  • histopathology
  • image classification
  • stacking ensemble
  • Vision Transformer

Fingerprint

Dive into the research topics of 'An Ensemble Deep Learning Approach for Breast Cancer Classification Using Vision Transformers and CNNs'. Together they form a unique fingerprint.

Cite this