Papers
10,712 papers found
Stationary and Clustering Transformer Hashing for Cross-modal Retrieval
Zhan Yang, Yiran Liu, Youyuan Huang et al.
Learning Adaptive and Expandable Mixture Model for Continual Learning
Fei Ye, YongCheng Zhong, Qihe Liu et al.
A Flat Minima Perspective on Understanding Augmentations and Model Robustness
Weebum Yoo, Sung Whan Yoon
Activating Visual Context and Commonsense Reasoning Through Masked Prediction in VLMs
Jiaao Yu, Shenwei Li, Mingjie Han et al.
PCFormer: Accelerating Privacy-preserving Transformer Inference by Partition and Combination
Bo Zeng, Zhi Pang, Yuyang Zhang et al.
On the Robustness of Bandit Multiple Testing
Zhengyu Zhou, Weiwei Liu
ArchetypeTrader: Reinforcement Learning for Selecting and Refining Learnable Strategic Archetypes in Quantitative Trading
Chuqiao Zong, Molei Qin, Haochong Xia et al.
CAMAR: Continuous Actions Multi-Agent Routing
Artem Pshenitsyn, Aleksandr Panov, Alexey Skrynnik
Enhancing PIBT via Multi-Action Operations
Egor Yukhnevich, Anton Andreychuk
Where Norms and References Collide: Evaluating LLMs on Normative Reasoning
Mitchell Abrams, Kaveh Eskandari Miandoab, Felix Gervits et al.
RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability
Kaitong Cai, Jusheng Zhang, Yijia Fan et al.
Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT
Da Chang, Peng Xue, Yu Li et al.
ModalSyncSum: Synchronizing Image and Text for Reliable Summary Generation
Xuanqi Chen, Ziying Rong, Xinfeng Liao et al.
CounterBench: Evaluating and Improving Counterfactual Reasoning in Large Language Models
Yuefei Chen, Vivek K. Singh, Jing Ma et al.
Reasoning with Exploration: An Entropy Perspective
Daixuan Cheng, Shaohan Huang, Xuekai Zhu et al.
Guess or Recall? Training CNNs to Classify and Localize Memorization in LLMs
Jérémie Dentan, Davide Buscaldi, Sonia Vanier
Expert-Guided Prompting and Retrieval-Augmented Generation for Emergency Medical Service Question Answering
Xueren Ge, Sahil Murtaza, Anthony Cortez et al.
Uncovering and Mitigating Transient Blindness in Multimodal Model Editing
XiaoQi Han, Ru Li, Ran Yi et al.
ConInstruct: Evaluating Large Language Models on Conflict Detection and Resolution in Instructions
Xingwei He, Qianru Zhang, Pengfei Chen et al.
Long-form RewardBench: Evaluating Reward Models for Long-form Generation
Hui Huang, Yancheng He, Wei Liu et al.
Backdooring Rationalization
Lingxiao Kong, Jiahui Jiang, Wenchao Xu et al.
Safe RAG by RAG: Untying the Bell That RAG Rang with the RAG Hand
Xun Liang, Mengwei Wang, Yuefeng Ma et al.
LLM Collaboration with Multi-Agent Reinforcement Learning
Shuo Liu, Zeyu Liang, Xueguang Lyu et al.
Focusing on Language: Revealing and Exploiting Language Attention Heads in Multilingual Large Language Models
Xin Liu, Qiyang Song, Qihang Zhou et al.