Papers
3,434 papers found
TRACE: Trajectory-based Activation Change Estimation for Task-specific Data Selection
Ye He, Shangzhan Li, Yuxin Zhou et al.
OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification
Shangyu Li, Juyong Jiang, Tiancheng Zhao et al.
Learning from Guidelines: Structured Prompt Optimization for Expert Annotation Tasks
Wenliang Zhong, Haiqing Li, Thao M. Dang et al.
SharedRep-RLHF: A Shared Representation Approach to RLHF with Diverse Preferences
Arpan Mukherjee, Marcello Bullo, Deniz Gündüz
Overview of BLP-2025 Task 2: Code Generation in Bangla
Nishat Raihan, Mohammad Anas Jawad, Md Mezbaur Rahman et al.
Overview of BLP-2025 Task 1: Bangla Hate Speech Identification
Md Arid Hasan, Firoj Alam, Md Fahad Hossain et al.
Task-Aware Evaluation and Error-Overlap Analysis for Large Language Models
Pranava Madhyastha
Findings of the First Patent Claims Translation Task at WAT2025
Toshiaki Nakazawa, Takashi Tsunakawa, Isao Goto et al.
Findings of WAT2025 English-to-Indic Multimodal Translation Task
Shantipriya Parida, Ondřej Bojar