Papers
17,973 papers found
1+1>2: A Synergistic Sparse and Low-Rank Compression Method for Large Language Models
Zeliang Zong, Kai Zhang, Zheyang Li et al.
2Columns1Row: A Russian Benchmark for Textual and Multimodal Table Understanding and Reasoning
Vildan Saburov, Daniil Vodolazsky, Danil Sazanakov et al.
3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation
Seonho Lee, Jiho Choi, Inha Kang et al.
3DS: Medical Domain Adaptation of LLMs via Decomposed Difficulty-based Data Selection
Hongxin Ding, Yue Fang, Runchuan Zhu et al.
3LM: Bridging Arabic, STEM, and Code through Benchmarking
Basma El Amel Boussaha, Leen Al Qadi, Mugariya Farooq et al.
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
Ivan Sviridov, Amina Miftakhova, Artemiy Tereshchenko et al.
3R: Enhancing Sentence Representation Learning via Redundant Representation Reduction
Longxuan Ma, Xiao Wu, Yuxin Huang et al.
A Benchmark for Hindi Verb-Argument Structure Alternations
Kanishka Jain, Ashwini Vaidya
A Benchmark for Translations Across Styles and Language Variants
Xin Tan, Bowei Zou, AiTi Aw
AbsVis – Benchmarking How Humans and Vision-Language Models “See” Abstract Concepts in Images
Tarun Tater, Diego Frassinelli, Sabine Schulte im Walde
A Case Against Implicit Standards: Homophone Normalization in Machine Translation for Languages that use the Ge’ez Script.
Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Henok Biadglign Ademtew et al.
A Category-Theoretic Approach to Neural-Symbolic Task Planning with Bidirectional Search
Shuhui Qu, Jie Wang, Kincho Law
A Causal Lens for Evaluating Faithfulness Metrics
Kerem Zaman, Shashank Srivastava
Accelerated Test-Time Scaling with Model-Free Speculative Sampling
Woomin Song, Saket Dingliwal, Sai Muralidhar Jayanthi et al.
Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence
Yijiong Yu, Wei Wang, Ran Chen et al.
Accelerating LLM Reasoning via Early Rejection with Partial Reward Modeling
Seyyed Saeid Cheshmi, Azal Ahmad Khan, Xinran Wang et al.
Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Israel Abebe Azime, Deborah D. Kanubala, Tejumade Afonja et al.
AccessEval: Benchmarking Disability Bias in Large Language Models
Srikant Panda, Amit Agarwal, Hitesh Laxmichand Patel
ACEBench: A Comprehensive Evaluation of LLM Tool Usage
Chen Chen, Xinlong Hao, Weiwen Liu et al.
ACING: Actor-Critic for Instruction Learning in Black-Box LLMs
Salma Kharrat, Fares Fourati, Marco Canini
A Closer Look at Bias and Chain-of-Thought Faithfulness of Large (Vision) Language Models
Sriram Balasubramanian, Samyadeep Basu, Soheil Feizi
A Comparison of Elementary Baselines for BabyLM
Rareș Păpușoi, Sergiu Nisioi
A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation
Neal Gregory Lawton, Alfy Samuel, Anoop Kumar et al.
A Comprehensive Framework to Operationalize Social Stereotypes for Responsible AI Evaluations
Aida Mostafazadeh Davani, Sunipa Dev, Héctor Pérez-Urbina et al.
A Comprehensive Literary Chinese Reading Comprehension Dataset with an Evidence Curation Based Solution
Dongning Rao, Rongchu Zhou, Peng Chen et al.