CrowdAgent: Multi-Agent Managed Multi-Source Annotation System

Maosheng Qin; Renyu Zhu; Mingxuan Xia; Chenchenkai; Zhen Zhu; Minmin Lin; Junbo Zhao; Lu Xu; Changjie Fan; Runze Wu; Haobo Wang

2025 EMNLP EMNLP 2025

CrowdAgent: Multi-Agent Managed Multi-Source Annotation System

Abstract

AbstractHigh-quality annotated data is a cornerstone of modern Natural Language Processing (NLP). While recent methods begin to leverage diverse annotation sources—including Large Language Models (LLMs), Small Language Models (SLMs), and human experts—they often focus narrowly on the labeling step itself. A critical gap remains in the holistic process control required to manage these sources dynamically, addressing complex scheduling and quality-cost trade-offs in a unified manner. Inspired by real-world crowdsourcing companies, we introduce CrowdAgent, a multi-agent system that provides end-to-end process control by integrating task assignment, data annotation, and quality/cost management. It implements a novel methodology that rationally assigns tasks, enabling LLMs, SLMs, and human experts to advance synergistically in a collaborative annotation workflow. We demonstrate the effectiveness of CrowdAgent through extensive experiments on six diverse multimodal classification tasks. The source code and video demo are available at https://github.com/QMMMS/CrowdAgent.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Maosheng Qin , Renyu Zhu , Mingxuan Xia , Chenchenkai , Zhen Zhu , Minmin Lin , Junbo Zhao , Lu Xu , Changjie Fan , Runze Wu , Haobo Wang

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Machine Learning > Application Areas > Data Augmentation Natural Language Processing > Applications > Text Classification Natural Language Processing > Resources & Methods Machine Learning > Learning Types > Multi-Agent Systems Deep Learning > Learning Types > Multi-Modal Learning

Keywords

text classification data annotation language model multimodal classification annotation quality small language model large language model multi-agent system annotation workflow

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025