Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Discovering the Hidden Role of Gini Index In Prompt-based Classification
0
Zitationen
1
Autoren
2026
Jahr
Abstract
In classification tasks, the long-tailed minority classes usually offer the predictions that are most important. Yet these classes consistently exhibit low accuracies, whereas a few high-performing classes dominate the game. We pursue a foundational understanding of the hidden role of Gini Index as a tool for detecting and optimizing (debiasing) disparities in class accuracy, focusing on the case of prompt-based classification. We introduce the intuitions, benchmark Gini scores in real-world LLMs and vision models, and thoroughly discuss the insights of Gini not only as a measure of relative accuracy dominance but also as a direct optimization metric. Through rigorous case analyses, we first show that weak to strong relative accuracy imbalance exists in both prompt-based, text and image classification results and regardless of whether the classification is high-dimensional or low-dimensional. Then, we harness the Gini metric to propose a post-hoc model-agnostic bias mitigation method. Experimental results across few-shot news, biomedical, and zero-shot image classification show that our method significantly reduces both relative and absolute accuracy imbalances, minimizing top class relative dominance while elevating weakest classes.
Ähnliche Arbeiten
SMOTE: Synthetic Minority Over-sampling Technique
2002 · 30.223 Zit.
An introduction to ROC analysis
2005 · 20.796 Zit.
Mining association rules between sets of items in large databases
1993 · 14.765 Zit.
pROC: an open-source package for R and S+ to analyze and compare ROC curves
2011 · 13.669 Zit.
Fast algorithms for mining association rules
1998 · 10.753 Zit.