Skip to main navigation Skip to search Skip to main content

From Clinic to Code: Using Clinician Insights to Develop a Framework for Fair and Representative Datasets in Women’s Health AI

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The development of Artificial Intelligence (AI) in healthcare is largely dependent on the quality of medical datasets. However, these datasets often fail to accurately represent women (and those seen as women) due to historic, implicit and biological biases. This under-representation can lead to biased, inequitable and even harmful models. Building upon the findings of previously conducted qualitative semi-structured semantic interviews with clinicians on their perceptions of women’s health, this paper presents a framework for translating qualitative findings to dataset characteristics via operationalisation. This framework outlines the key characteristics and considerations a dataset should include or consider to more accurately represent women in these datasets. Some of these factors include: pregnancy status, gender of provider of the care, menstruation and menopause, and ethnicity. Rather than considering fairness after model development and employing de-biasing metrics, this approach places fairness at the initial selection stages, with the goal of embedding equity throughout the entire development pipeline. It is both a checklist for data selection for model developers and also a guideline for those who collect medical data. The framework is divided into Necessities, Data Comprehension, Gender-Specific Factors, Clinician Information, Patient-Specific Factors, and Additional considerations. This framework is a step towards creating more gender-conscious, equitable and fair medical AI systems.

Original languageEnglish
Title of host publicationArtificial Intelligence in Healthcare - 2nd International Conference, AIiH 2025, Proceedings
EditorsDaniele Cafolla, Timothy Rittman, Hao Ni
PublisherSpringer Science and Business Media Deutschland GmbH
Pages100-114
Number of pages15
ISBN (Print)9783032006554
DOIs
Publication statusPublished - 2026
Event2nd International Conference on Artificial Intelligence on Healthcare, AIiH 2025 - Cambridge, United Kingdom
Duration: 8 Sep 202510 Sep 2025

Publication series

NameLecture Notes in Computer Science
Volume16039 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd International Conference on Artificial Intelligence on Healthcare, AIiH 2025
Country/TerritoryUnited Kingdom
CityCambridge
Period8/09/2510/09/25

Keywords

  • Equitable Healthcare
  • Ethical AI
  • Health Informatics

Fingerprint

Dive into the research topics of 'From Clinic to Code: Using Clinician Insights to Develop a Framework for Fair and Representative Datasets in Women’s Health AI'. Together they form a unique fingerprint.

Cite this