Artificial Intelligence › Core AI ›

Responsible AI

1991 directly classified papers

Papers per year

Papers

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment NIPS 2022

Fairness-Aware Adversarial Perturbation Towards Bias Mitigation for Deployed Deep Models CVPR 2022

Red Teaming Language Models with Language Models EMNLP 2022

Contextualizing Language Models for Norms Diverging from Social Majority EMNLP 2022

Language Model Detoxification in Dialogue with Contextualized Stance Control EMNLP 2022

Analyzing Gender Translation Errors to Identify Information Flows between the Encoder and Decoder of a NMT System EMNLP 2022

Challenges and Opportunities in Information Manipulation Detection: An Examination of Wartime Russian Media EMNLP 2022

Geographic Citation Gaps in NLP Research EMNLP 2022

Towards Procedural Fairness: Uncovering Biases in How a Toxic Language Classifier Uses Sentiment Information EMNLP 2022

Large-Scale Differentially Private BERT EMNLP 2022

Controlling Bias Exposure for Fair Interpretable Predictions EMNLP 2022

Improving the Factual Correctness of Radiology Report Generation with Semantic Rewards EMNLP 2022

What Do Compressed Multilingual Machine Translation Models Forget? EMNLP 2022

Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language Models EMNLP 2022

Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation EMNLP 2022

Mitigating Covertly Unsafe Text within Natural Language Systems EMNLP 2022

Are Large Pre-Trained Language Models Leaking Your Personal Information? EMNLP 2022

Handling and Presenting Harmful Text in NLP Research EMNLP 2022

Extracted BERT Model Leaks More Information than You Think! EMNLP 2022

How Large Language Models are Transforming Machine-Paraphrase Plagiarism EMNLP 2022

A Robust Bias Mitigation Procedure Based on the Stereotype Content Model EMNLP 2022

Securely Capturing People’s Interactions with Voice Assistants at Home: A Bespoke Tool for Ethical Data Collection EMNLP 2022

Should I disclose my dataset? Caveats between reproducibility and individual data rights EMNLP 2022

Analyzing the Limits of Self-Supervision in Handling Bias in Language EMNLP 2022

Balancing out Bias: Achieving Fairness Through Balanced Training EMNLP 2022