Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Fairness
1139 directly classified papers
Papers per year
2013: 1
2017: 7
2018: 15
2019: 33
2020: 64
2021: 96
2022: 166
2023: 167
2024: 221
2025: 364
2026: 5
Papers
Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias
COLING 2025
Towards Inclusive Arabic LLMs: A Culturally Aligned Benchmark in Arabic Large Language Model Evaluation
COLING 2025
Evaluating Dialect Robustness of Language Models via Conversation Understanding
COLING 2025
Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection
COLING 2025
Dehumanization of LGBTQ+ Groups in Sexual Interactions with ChatGPT
NAACL 2025
LLMs are Biased Teachers: Evaluating LLM Bias in Personalized Education
NAACL 2025
Guardrails, not Guidance: Understanding Responses to LGBTQ+ Language in Large Language Models
NAACL 2025
Do Prevalent Bias Metrics Capture Allocational Harms from LLMs?
NAACL 2025
HEARTS: A Holistic Framework for Explainable, Sustainable and Robust Text Stereotype Detection
IJCNLP 2025
Leveraging Large Language Models in Detecting Anti-LGBTQIA+ User-generated Texts
NAACL 2025
From Anger to Joy: How Nationality Personas Shape Emotion Attribution in Large Language Models
IJCNLP 2025
BiasEdit: Debiasing Stereotyped Language Models via Model Editing
NAACL 2025
Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models
IJCNLP 2025
Gender Bias in Instruction-Guided Speech Synthesis Models
NAACL 2025
“Women do not have heart attacks!” Gender Biases in Automatically Generated Clinical Cases in French
NAACL 2025
Judging the Judges: A Systematic Study of Position Bias in LLM-as-a-Judge
IJCNLP 2025
Enhancing Training Data Quality through Influence Scores for Generalizable Classification: A Case Study on Sexism Detection
IJCNLP 2025
Rejected Dialects: Biases Against African American Language in Reward Models
NAACL 2025
Social Bias in Popular Question-Answering Benchmarks
IJCNLP 2025
Aligning to What? Limits to RLHF Based Alignment
NAACL 2025
FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models
NAACL 2025
VLind-Bench: Measuring Language Priors in Large Vision-Language Models
NAACL 2025
Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models
NAACL 2025
Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations
NAACL 2025
Analysis of LLM as a grammatical feature tagger for African American English
NAACL 2025
<
1
2
3
4
5
…
46
>