Artificial Intelligence › Core AI ›

Responsible AI

1991 directly classified papers

Papers per year

Papers

Can We Improve Model Robustness through Secondary Attribute Counterfactuals? EMNLP 2021

Guiding Principles for Participatory Design-inspired Natural Language Processing IJCNLP 2021

You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership NIPS 2021

Sociolectal Analysis of Pretrained Language Models EMNLP 2021

Annotating and Modeling Fine-grained Factuality in Summarization NAACL 2021

On the State of Social Media Data for Mental Health Research NAACL 2021

On Releasing Annotator-Level Labels and Information in Datasets EMNLP 2021

Sexism in the Judiciary: The Importance of Bias Definition in NLP and In Our Courts ACL 2021

Agree to Disagree: Analysis of Inter-Annotator Disagreements in Human Evaluation of Machine Translation Output EMNLP 2021

Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens EMNLP 2021

Detect and Perturb: Neutral Rewriting of Biased and Sensitive Text via Gradient-based Decoding EMNLP 2021

What Will it Take to Fix Benchmarking in Natural Language Understanding? NAACL 2021

Eliciting Bias in Question Answering Models through Ambiguity EMNLP 2021

Privacy Regularization: Joint Privacy-Utility Optimization in LanguageModels NAACL 2021

Preregistering NLP research NAACL 2021

Decision Making with Differential Privacy under a Fairness Lens IJCAI 2021

Learning with Selective Forgetting IJCAI 2021

“It seemed like an annoying woman”: On the Perception and Ethical Considerations of Affective Language in Text-Based Conversational Agents EMNLP 2021

Never guess what I heard... Rumor Detection in Finnish News: a Dataset and a Baseline NAACL 2021

Leveraging Community and Author Context to Explain the Performance and Bias of Text-Based Deception Detection Models NAACL 2021

Identifying Automatically Generated Headlines using Transformers NAACL 2021

Quantifying and Avoiding Unfair Qualification Labour in Crowdsourcing ACL 2021

StereoSet: Measuring stereotypical bias in pretrained language models ACL 2021

All That’s ‘Human’ Is Not Gold: Evaluating Human Evaluation of Generated Text ACL 2021

Gender Bias in Text: Origin, Taxonomy, and Implications ACL 2021