Artificial Intelligence › Core AI ›

Responsible AI

1991 directly classified papers

Papers per year

Papers

Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4 EMNLP 2023

Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition EMNLP 2023

Mirages. On Anthropomorphism in Dialogue Systems EMNLP 2023

The Troubling Emergence of Hallucination in Large Language Models - An Extensive Definition, Quantification, and Prescriptive Remediations EMNLP 2023

PEFTDebias : Capturing debiasing information using PEFTs EMNLP 2023

SeqXGPT: Sentence-Level AI-Generated Text Detection EMNLP 2023

Incorporating Worker Perspectives into MTurk Annotation Practices for NLP EMNLP 2023

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation EMNLP 2023

ToViLaG: Your Visual-Language Generative Model is Also An Evildoer EMNLP 2023

A State-Vector Framework for Dataset Effects EMNLP 2023

The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who EMNLP 2023

“Fifty Shades of Bias”: Normative Ratings of Gender Bias in GPT Generated English Text EMNLP 2023

Improving Bias Mitigation through Bias Experts in Natural Language Understanding EMNLP 2023

An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives EMNLP 2023

MaNtLE: Model-agnostic Natural Language Explainer EMNLP 2023

Measuring bias in Instruction-Following models with P-AT EMNLP 2023

Detoxifying Online Discourse: A Guided Response Generation Approach for Reducing Toxicity in User-Generated Text ACL 2023

Risks and NLP Design: A Case Study on Procedural Document QA ACL 2023

Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers ACL 2023

What Do NLP Researchers Believe? Results of the NLP Community Metasurvey ACL 2023

Evaluating Verifiability in Generative Search Engines EMNLP 2023

Countering Misinformation via Emotional Response Generation EMNLP 2023

‘Person’ == Light-skinned, Western Man, and Sexualization of Women of Color: Stereotypes in Stable Diffusion EMNLP 2023

The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis EMNLP 2023

ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation EMNLP 2023