Skip to main navigation Skip to search Skip to main content

Towards a framework for Bias Evaluation of AI Agents in Agentic Workflows

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Multi-agent AI systems, with minimal human oversight, are increasingly deployed in high-stakes domains like hiring, healthcare, and criminal justice. However, existing bias evaluation methodologies focus on isolated Large Language Model responses, failing to address sequential agent interactions where biases can propagate through decision chains undetected. Current approaches suffer from three limitations: absence of multi-stage bias tracking, inability to distinguish systematic discrimination from model non-determinism, and risk of contamination when agents infer bias evaluation intent. This study presents a novel evaluation framework incorporating demographic swapping methodology, contamination prevention architecture, and statistical analysis for multi-agent workflows. The framework employs dual presentation systems separating agent-visible content from research metadata and control scenarios to isolate bias from random variation. Evaluation across multiple models revealed unexpected preferences favouring traditionally disadvantaged groups, while 85% of apparent variation was attributable to model non-determinism rather than demographic factors. This approach advances methodological frameworks for tracking bias propagation in autonomous AI systems.

Original languageEnglish
Title of host publicationHCAI-ep 2026 - Proceedings of the 2026 Conference on Human Centered Artificial Intelligence - Education and Practice
PublisherAssociation for Computing Machinery (ACM)
Pages46-52
Number of pages7
ISBN (Electronic)9798400721533
DOIs
Publication statusPublished - 16 Feb 2026
Event3rd International Conference on Human-Centred AI - Education and Practice, HCAI-ep 2026 - Kildare, Ireland
Duration: 21 Jan 202622 Jan 2026

Publication series

NameHCAI-ep 2026 - Proceedings of the 2026 Conference on Human Centered Artificial Intelligence - Education and Practice

Conference

Conference3rd International Conference on Human-Centred AI - Education and Practice, HCAI-ep 2026
Country/TerritoryIreland
CityKildare
Period21/01/2622/01/26

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 16 - Peace, Justice and Strong Institutions
    SDG 16 Peace, Justice and Strong Institutions

Keywords

  • Agentic Artificial Intelligence
  • Bias Detection
  • Ethical AI
  • Human-Centred AI
  • Non-determinism

Fingerprint

Dive into the research topics of 'Towards a framework for Bias Evaluation of AI Agents in Agentic Workflows'. Together they form a unique fingerprint.

Cite this