Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
Towards a Deep Multi-layered Dialectal Language Analysis: A Case Study of African-American English
NAACL 2022
Uncertainty and Inclusivity in Gender Bias Annotation: An Annotation Taxonomy and Annotated Datasets of British English Text
NAACL 2022
Efficient Counterfactual Debiasing for Visual Question Answering
WACV 2022
An Investigation of Critical Issues in Bias Mitigation Techniques
WACV 2022
An Information-Theoretic Approach and Dataset for Probing Gender Stereotypes in Multilingual Masked Language Models
NAACL 2022
Towards Automatic Generation of Messages Countering Online Hate Speech and Microaggressions
NAACL 2022
On Facility Location Problem in the Local Differential Privacy Model
AISTATS 2022
Flexible Accuracy for Differential Privacy
AISTATS 2022
UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA
IJCNLP 2022
Grammatical Error Correction Systems for Automated Assessment: Are They Susceptible to Universal Adversarial Attacks?
IJCNLP 2022
An Embarrassingly Simple Approach for Intellectual Property Rights Protection on Recurrent Neural Networks
IJCNLP 2022
StereoKG: Data-Driven Knowledge Graph Construction For Cultural Knowledge and Stereotypes
NAACL 2022
Lost in Distillation: A Case Study in Toxicity Modeling
NAACL 2022
Flexible text generation for counterfactual fairness probing
NAACL 2022
A Study on the Distribution of Social Biases in Self-Supervised Learning Visual Models
CVPR 2022
Training Text-to-Text Transformers with Privacy Guarantees
ACL 2022
Improving Factual Consistency in Summarization with Compression-Based Post-Editing
EMNLP 2022
Faithful to the Document or to the World? Mitigating Hallucinations via Entity-Linked Knowledge in Abstractive Summarization
EMNLP 2022
ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection
ACL 2022
Open Problem: Better Differentially Private Learning Algorithms with Margin Guarantees
COLT 2022
Differentially private multi-party data release for linear regression
UAI 2022
Director: Generator-Classifiers For Supervised Language Modeling
AACL 2022
Does Representational Fairness Imply Empirical Fairness?
AACL 2022
UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA
AACL 2022
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models
AACL 2022
<
1
…
67
68
69
…
80
>