|国家预印本平台
首页|Fair for a few: Improving Fairness in Doubly Imbalanced Datasets

Fair for a few: Improving Fairness in Doubly Imbalanced Datasets

Fair for a few: Improving Fairness in Doubly Imbalanced Datasets

来源:Arxiv_logoArxiv
英文摘要

Fairness has been identified as an important aspect of Machine Learning and Artificial Intelligence solutions for decision making. Recent literature offers a variety of approaches for debiasing, however many of them fall short when the data collection is imbalanced. In this paper, we focus on a particular case, fairness in doubly imbalanced datasets, such that the data collection is imbalanced both for the label and the groups in the sensitive attribute. Firstly, we present an exploratory analysis to illustrate limitations in debiasing on a doubly imbalanced dataset. Then, a multi-criteria based solution is proposed for finding the most suitable sampling and distribution for label and sensitive attribute, in terms of fairness and classification accuracy

Ata Yalcin、Asli Umay Ozturk、Yigit Sever、Viktoria Pauw、Stephan Hachinger、Ismail Hakki Toroslu、Pinar Karagoz

计算技术、计算机技术

Ata Yalcin,Asli Umay Ozturk,Yigit Sever,Viktoria Pauw,Stephan Hachinger,Ismail Hakki Toroslu,Pinar Karagoz.Fair for a few: Improving Fairness in Doubly Imbalanced Datasets[EB/OL].(2025-06-17)[2025-07-16].https://arxiv.org/abs/2506.14306.点此复制

评论