Papers

235 papers found
Aligned LLMs Are Not Aligned Browser Agents
Priyanshu Kumar, Elaine Lau, Saranya Vijayakumar et al.
2025 ICLR
Do LLMs have Consistent Values?
Naama Rozen, Liat Bezalel, Gal Elidan et al.
2025 ICLR
BOND: Aligning LLMs with Best-of-N Distillation
Pier Giuseppe Sessa, Robert Dadashi-Tazehozi, Leonard Hussenot et al.
2025 ICLR
PersonalLLM: Tailoring LLMs to Individual Preferences
Thomas P Zollo, Andrew Wei Tung Siah, Naimeng Ye et al.
2025 ICLR
Shh, don't say that! Domain Certification in LLMs
Cornelius Emde, Alasdair Paren, Preetham Arvind et al.
2025 ICLR
Transformer-Squared: Self-adaptive LLMs
Qi Sun, Edoardo Cetin, Yujin Tang
2025 ICLR
PAD: Personalized Alignment of LLMs at Decoding-time
Ruizhe Chen, Xiaotian Zhang, Meng Luo et al.
2025 ICLR
Scaling FP8 training to trillion-token LLMs
Maxim Fishman, Brian Chmiel, Ron Banner et al.
2025 ICLR
Mixture Compressor for Mixture-of-Experts LLMs Gains More
Wei Huang, Yue Liao, Jianhui Liu et al.
2025 ICLR
Can LLMs Solve Longer Math Word Problems Better?
Xin Xu, Tong Xiao, Zitong Chao et al.
2025 ICLR
What If LLMs Can Smell: A Prototype
Xueyi Zhou, Qi Lu, Dong-Kyu Chae
2025 IJCAI
Interpreting the Effects of Quantization on LLMs
Manpreet Singh, Hassan Sajjad
2025 IJCNLP
Gatsby without the ‘E’: Creating Lipograms with LLMs
Nitish Gokulakrishnan, Rohan Balasubramanian, Syeda Jannatus Saba et al.
2025 IJCNLP
2025 IJCNLP
Atomic Calibration of LLMs in Long-Form Generations
Caiqi Zhang, Ruihan Yang, Zhisong Zhang et al.
2025 IJCNLP
Fake Alignment: Are LLMs Really Aligned Well?
Yixu Wang, Yan Teng, Kexin Huang et al.
2024 NAACL
2024 NAACL
Human-AI Interaction in the Age of LLMs
Diyi Yang, Sherry Tongshuang Wu, Marti A. Hearst
2024 NAACL
Efficiently Distilling LLMs for Edge Applications
Achintya Kundu, Yu Chin Fabian Lim, Aaron Chew et al.
2024 NAACL
Leveraging LLMs for Dialogue Quality Measurement
Jinghan Jia, Abi Komma, Timothy Leffel et al.
2024 NAACL
What Makes Math Word Problems Challenging for LLMs?
Kv Aditya Srivatsa, Ekaterina Kochmar
2024 NAACL
Evaluating Vocabulary Usage in LLMs
Matthew Durward, Christopher Thomson
2024 NAACL
Data-Augmentation-Based Dialectal Adaptation for LLMs
Fahim Faisal, Antonios Anastasopoulos
2024 NAACL
Can LLMs Convert Graphs to Text-Attributed Graphs?
Zehong Wang, Sidney Liu, Zheyuan Zhang et al.
2025 NAACL