Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
EMNLP 2023
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition
EMNLP 2023
Mirages. On Anthropomorphism in Dialogue Systems
EMNLP 2023
The Troubling Emergence of Hallucination in Large Language Models - An Extensive Definition, Quantification, and Prescriptive Remediations
EMNLP 2023
PEFTDebias : Capturing debiasing information using PEFTs
EMNLP 2023
SeqXGPT: Sentence-Level AI-Generated Text Detection
EMNLP 2023
Incorporating Worker Perspectives into MTurk Annotation Practices for NLP
EMNLP 2023
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
EMNLP 2023
ToViLaG: Your Visual-Language Generative Model is Also An Evildoer
EMNLP 2023
A State-Vector Framework for Dataset Effects
EMNLP 2023
The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who
EMNLP 2023
“Fifty Shades of Bias”: Normative Ratings of Gender Bias in GPT Generated English Text
EMNLP 2023
Improving Bias Mitigation through Bias Experts in Natural Language Understanding
EMNLP 2023
An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives
EMNLP 2023
MaNtLE: Model-agnostic Natural Language Explainer
EMNLP 2023
Measuring bias in Instruction-Following models with P-AT
EMNLP 2023
Detoxifying Online Discourse: A Guided Response Generation Approach for Reducing Toxicity in User-Generated Text
ACL 2023
Risks and NLP Design: A Case Study on Procedural Document QA
ACL 2023
Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers
ACL 2023
What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
ACL 2023
Evaluating Verifiability in Generative Search Engines
EMNLP 2023
Countering Misinformation via Emotional Response Generation
EMNLP 2023
‘Person’ == Light-skinned, Western Man, and Sexualization of Women of Color: Stereotypes in Stable Diffusion
EMNLP 2023
The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
EMNLP 2023
ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation
EMNLP 2023
<
1
…
65
66
67
…
80
>