Papers
235 papers found
Aligned LLMs Are Not Aligned Browser Agents
Priyanshu Kumar, Elaine Lau, Saranya Vijayakumar et al.
Do LLMs have Consistent Values?
Naama Rozen, Liat Bezalel, Gal Elidan et al.
BOND: Aligning LLMs with Best-of-N Distillation
Pier Giuseppe Sessa, Robert Dadashi-Tazehozi, Leonard Hussenot et al.
PersonalLLM: Tailoring LLMs to Individual Preferences
Thomas P Zollo, Andrew Wei Tung Siah, Naimeng Ye et al.
Shh, don't say that! Domain Certification in LLMs
Cornelius Emde, Alasdair Paren, Preetham Arvind et al.
Transformer-Squared: Self-adaptive LLMs
Qi Sun, Edoardo Cetin, Yujin Tang
PAD: Personalized Alignment of LLMs at Decoding-time
Ruizhe Chen, Xiaotian Zhang, Meng Luo et al.
Scaling FP8 training to trillion-token LLMs
Maxim Fishman, Brian Chmiel, Ron Banner et al.
Mixture Compressor for Mixture-of-Experts LLMs Gains More
Wei Huang, Yue Liao, Jianhui Liu et al.
Can LLMs Solve Longer Math Word Problems Better?
Xin Xu, Tong Xiao, Zitong Chao et al.
What If LLMs Can Smell: A Prototype
Xueyi Zhou, Qi Lu, Dong-Kyu Chae
Interpreting the Effects of Quantization on LLMs
Manpreet Singh, Hassan Sajjad
Gatsby without the ‘E’: Creating Lipograms with LLMs
Nitish Gokulakrishnan, Rohan Balasubramanian, Syeda Jannatus Saba et al.
Testing Simulation Theory in LLMs’ Theory of Mind
Koshiro Aoki, Daisuke Kawahara
Atomic Calibration of LLMs in Long-Form Generations
Caiqi Zhang, Ruihan Yang, Zhisong Zhang et al.
Fake Alignment: Are LLMs Really Aligned Well?
Yixu Wang, Yan Teng, Kexin Huang et al.
CPopQA: Ranking Cultural Concept Popularity by LLMs
Ming Jiang, Mansi Joshi
Human-AI Interaction in the Age of LLMs
Diyi Yang, Sherry Tongshuang Wu, Marti A. Hearst
Efficiently Distilling LLMs for Edge Applications
Achintya Kundu, Yu Chin Fabian Lim, Aaron Chew et al.
Leveraging LLMs for Dialogue Quality Measurement
Jinghan Jia, Abi Komma, Timothy Leffel et al.
What Makes Math Word Problems Challenging for LLMs?
Kv Aditya Srivatsa, Ekaterina Kochmar
Evaluating Vocabulary Usage in LLMs
Matthew Durward, Christopher Thomson
Data-Augmentation-Based Dialectal Adaptation for LLMs
Fahim Faisal, Antonios Anastasopoulos
Can LLMs Convert Graphs to Text-Attributed Graphs?
Zehong Wang, Sidney Liu, Zheyuan Zhang et al.