Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
Can We Improve Model Robustness through Secondary Attribute Counterfactuals?
EMNLP 2021
Guiding Principles for Participatory Design-inspired Natural Language Processing
IJCNLP 2021
You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership
NIPS 2021
Sociolectal Analysis of Pretrained Language Models
EMNLP 2021
Annotating and Modeling Fine-grained Factuality in Summarization
NAACL 2021
On the State of Social Media Data for Mental Health Research
NAACL 2021
On Releasing Annotator-Level Labels and Information in Datasets
EMNLP 2021
Sexism in the Judiciary: The Importance of Bias Definition in NLP and In Our Courts
ACL 2021
Agree to Disagree: Analysis of Inter-Annotator Disagreements in Human Evaluation of Machine Translation Output
EMNLP 2021
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens
EMNLP 2021
Detect and Perturb: Neutral Rewriting of Biased and Sensitive Text via Gradient-based Decoding
EMNLP 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
NAACL 2021
Eliciting Bias in Question Answering Models through Ambiguity
EMNLP 2021
Privacy Regularization: Joint Privacy-Utility Optimization in LanguageModels
NAACL 2021
Preregistering NLP research
NAACL 2021
Decision Making with Differential Privacy under a Fairness Lens
IJCAI 2021
Learning with Selective Forgetting
IJCAI 2021
“It seemed like an annoying woman”: On the Perception and Ethical Considerations of Affective Language in Text-Based Conversational Agents
EMNLP 2021
Never guess what I heard... Rumor Detection in Finnish News: a Dataset and a Baseline
NAACL 2021
Leveraging Community and Author Context to Explain the Performance and Bias of Text-Based Deception Detection Models
NAACL 2021
Identifying Automatically Generated Headlines using Transformers
NAACL 2021
Quantifying and Avoiding Unfair Qualification Labour in Crowdsourcing
ACL 2021
StereoSet: Measuring stereotypical bias in pretrained language models
ACL 2021
All That’s ‘Human’ Is Not Gold: Evaluating Human Evaluation of Generated Text
ACL 2021
Gender Bias in Text: Origin, Taxonomy, and Implications
ACL 2021
<
1
…
74
75
76
…
80
>