Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
Knowing When to Abstain: Medical LLMs Under Clinical Uncertainty
EACL 2026
Detecting Subtle Biases: An Ethical Lens on Underexplored Areas in AI Language Models Biases
EACL 2026
Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs
EACL 2026
CAIRE: Cultural Attribution of Images with Retrieval
EACL 2026
When Words Wear Masks: Detecting Malicious Intents and Hostile Impacts of Online Hate Speech
EACL 2026
Integrity Shield A System for Ethical AI Use & Authorship Transparency in Assessments
EACL 2026
Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety
EACL 2026
Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts
EACL 2026
Beyond Bias Scores: Unmasking Vacuous Neutrality in Small Language Models
EACL 2026
VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy
EACL 2026
CodeGuard: Improving LLM Guardrails in CS Education
EACL 2026
Do Large Language Models Reflect Demographic Pluralism in Safety?
EACL 2026
Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health
EACL 2026
Bias in the East, Bias in the West: A Bilingual Analysis of LLM Political Bias on U.S.- and China-Related Issues
EACL 2026
Seeing All Sides: Multi-Perspective In-Context Learning for Subjective NLP
EACL 2026
What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance
EACL 2026
From Numbers to Narratives: Efficient Language Model-Based Detection for Safety-Critical Minority Classes
EACL 2026
When Do Language Models Endorse Limitations on Human Rights Principles?
EACL 2026
GlobLingDiv: A global dataset linking linguistic diversity and digital support to reveal landscapes with under-resourced languages for NLP
EACL 2026
Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations
EACL 2026
MPD-SGR: Robust Spiking Neural Networks with Membrane Potential Distribution-Driven Surrogate Gradient Regularization
AAAI 2026
An Information Theoretic Evaluation Metric for Strong Unlearning
AAAI 2026
On the Misalignment Between Data Learnability and Forgettability in Machine Unlearning
AAAI 2026
Robust Learning from Noisily Labeled Long-Tailed Data via Fairness Regularizer
AAAI 2026
Hallucination as a Computational Boundary: A Hierarchy of Inevitability and the Oracle Escape
AAAI 2026
<
1
2
3
4
5
…
80
>