Abstract
Multi-agent AI systems, with minimal human oversight, are increasingly deployed in high-stakes domains like hiring, healthcare, and criminal justice. However, existing bias evaluation methodologies focus on isolated Large Language Model responses, failing to address sequential agent interactions where biases can propagate through decision chains undetected. Current approaches suffer from three limitations: absence of multi-stage bias tracking, inability to distinguish systematic discrimination from model non-determinism, and risk of contamination when agents infer bias evaluation intent. This study presents a novel evaluation framework incorporating demographic swapping methodology, contamination prevention architecture, and statistical analysis for multi-agent workflows. The framework employs dual presentation systems separating agent-visible content from research metadata and control scenarios to isolate bias from random variation. Evaluation across multiple models revealed unexpected preferences favouring traditionally disadvantaged groups, while 85% of apparent variation was attributable to model non-determinism rather than demographic factors. This approach advances methodological frameworks for tracking bias propagation in autonomous AI systems.
| Original language | English |
|---|---|
| Title of host publication | HCAI-ep 2026 - Proceedings of the 2026 Conference on Human Centered Artificial Intelligence - Education and Practice |
| Publisher | Association for Computing Machinery (ACM) |
| Pages | 46-52 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798400721533 |
| DOIs | |
| Publication status | Published - 16 Feb 2026 |
| Event | 3rd International Conference on Human-Centred AI - Education and Practice, HCAI-ep 2026 - Kildare, Ireland Duration: 21 Jan 2026 → 22 Jan 2026 |
Publication series
| Name | HCAI-ep 2026 - Proceedings of the 2026 Conference on Human Centered Artificial Intelligence - Education and Practice |
|---|
Conference
| Conference | 3rd International Conference on Human-Centred AI - Education and Practice, HCAI-ep 2026 |
|---|---|
| Country/Territory | Ireland |
| City | Kildare |
| Period | 21/01/26 → 22/01/26 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
-
SDG 16 Peace, Justice and Strong Institutions
Keywords
- Agentic Artificial Intelligence
- Bias Detection
- Ethical AI
- Human-Centred AI
- Non-determinism
Fingerprint
Dive into the research topics of 'Towards a framework for Bias Evaluation of AI Agents in Agentic Workflows'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver