Artificial Intelligence › Core AI ›

Responsible AI

1991 directly classified papers

Papers per year

Papers

Knowing When to Abstain: Medical LLMs Under Clinical Uncertainty EACL 2026

Detecting Subtle Biases: An Ethical Lens on Underexplored Areas in AI Language Models Biases EACL 2026

Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs EACL 2026

CAIRE: Cultural Attribution of Images with Retrieval EACL 2026

When Words Wear Masks: Detecting Malicious Intents and Hostile Impacts of Online Hate Speech EACL 2026

Integrity Shield A System for Ethical AI Use & Authorship Transparency in Assessments EACL 2026

Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety EACL 2026

Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts EACL 2026

Beyond Bias Scores: Unmasking Vacuous Neutrality in Small Language Models EACL 2026

VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy EACL 2026

CodeGuard: Improving LLM Guardrails in CS Education EACL 2026

Do Large Language Models Reflect Demographic Pluralism in Safety? EACL 2026

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health EACL 2026

Bias in the East, Bias in the West: A Bilingual Analysis of LLM Political Bias on U.S.- and China-Related Issues EACL 2026

Seeing All Sides: Multi-Perspective In-Context Learning for Subjective NLP EACL 2026

What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance EACL 2026

From Numbers to Narratives: Efficient Language Model-Based Detection for Safety-Critical Minority Classes EACL 2026

When Do Language Models Endorse Limitations on Human Rights Principles? EACL 2026

GlobLingDiv: A global dataset linking linguistic diversity and digital support to reveal landscapes with under-resourced languages for NLP EACL 2026

Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations EACL 2026

MPD-SGR: Robust Spiking Neural Networks with Membrane Potential Distribution-Driven Surrogate Gradient Regularization AAAI 2026

An Information Theoretic Evaluation Metric for Strong Unlearning AAAI 2026

On the Misalignment Between Data Learnability and Forgettability in Machine Unlearning AAAI 2026

Robust Learning from Noisily Labeled Long-Tailed Data via Fairness Regularizer AAAI 2026

Hallucination as a Computational Boundary: A Hierarchy of Inevitability and the Oracle Escape AAAI 2026