DiFair-LLM: Evaluating Fairness Disparities in LLMs Toward Demographic Groups

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Large Language Models (LLMs) are increasingly integrated into real-world applications, making equitable treatment of all demographic groups a critical concern. Existing fairness evaluations often rely on binary, template-based tests, which overlook subtle disparities in open-ended responses. We present DiFair-LLM, a model-agnostic framework for detecting and quantifying fairness disparities - any unequal treatment that benefits or disadvantages a demographic group. DiFair-LLM uses open-ended, group-specific and neutral prompts, measures semantic distances between groups' responses, applies non-parametric statistical tests, and ranks groups by deviation from a neutral baseline. Evaluations across eight state-of-the-art LLMs and multiple demographic attributes reveal minimal disparities for gender but significant differences for age, especially older adults, and ethnicity, with the largest gaps affecting certain non-Caucasian groups. By mapping nuanced patterns of differential treatment rather than flagging only overt bias, DiFair-LLM offers a practical, reproducible approach for auditing fairness and guiding more inclusive LLM deployments.

Original languageEnglish
Title of host publicationECAI 2025 - 28th European Conference on Artificial Intelligence, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025 - Proceedings
EditorsInes Lynce, Nello Murano, Mauro Vallati, Serena Villata, Federico Chesani, Michela Milano, Andrea Omicini, Mehdi Dastani
PublisherIOS Press BV
Pages1027-1034
Number of pages8
ISBN (Electronic)9781643686318
DOIs
StatePublished - 21 Oct 2025
Event28th European Conference on Artificial Intelligence, ECAI 2025, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025 - Bologna, Italy
Duration: 25 Oct 202530 Oct 2025

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume413
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

Conference28th European Conference on Artificial Intelligence, ECAI 2025, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025
Country/TerritoryItaly
CityBologna
Period25/10/2530/10/25

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'DiFair-LLM: Evaluating Fairness Disparities in LLMs Toward Demographic Groups'. Together they form a unique fingerprint.

Cite this