Papers
235 papers found
On scalable oversight with weak LLMs judging strong LLMs
Zachary Kenton, Noah Y. Siegel, János Kramár et al.
LLMs + Persona-Plug = Personalized LLMs
Jiongnan Liu, Yutao Zhu, Shuting Wang et al.
QLoRA: Efficient Finetuning of Quantized LLMs
Tim Dettmers, Artidoro Pagnoni, Ari Holtzman et al.
VPGTrans: Transfer Visual Prompt Generator across LLMs
Ao Zhang, Hao Fei, Yuan Yao et al.
Evaluating the Moral Beliefs Encoded in LLMs
Nino Scherrer, Claudia Shi, Amir Feder et al.
QBB: Quantization with Binary Bases for LLMs
Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos
Benchmarking LLMs via Uncertainty Quantification
Fanghua Ye, Mingming Yang, Jianhui Pang et al.
Efficient multi-prompt evaluation of LLMs
Felipe Maia Polo, Ronald Xu, Lucas Weber et al.
Protecting Your LLMs with Information Bottleneck
Zichuan Liu, Zefan Wang, Linjie Xu et al.
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs
Zhao XU, Fan LIU, Hao LIU
StackEval: Benchmarking LLMs in Coding Assistance
Nidhish Shah, Zulkuf Genc, Dogu Araci
Verified Code Transpilation with LLMs
Sahil Bhatia, Jie Qiu, Niranjan Hasabnis et al.
Is Programming by Example Solved by LLMs?
Wen-Ding Li, Kevin Ellis
Hypothesis Testing the Circuit Hypothesis in LLMs
Claudia Shi, Nicolas Beltran-Velez, Achille Nazaret et al.
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Saleh Ashkboos, Amirkeivan Mohtashami, Maximilian L. Croci et al.
Truth is Universal: Robust Detection of Lies in LLMs
Lennart Bürger, Fred A. Hamprecht, Boaz Nadler
Enhancing LLMs via High-Knowledge Data Selection
Feiyu Duan, Xuemiao Zhang, Sirui Wang et al.
Scaling Trends for Data Poisoning in LLMs
Dillon Bowen, Brendan Murphy, Will Cai et al.
Interpreting the Effects of Quantization on LLMs
Manpreet Singh, Hassan Sajjad
Gatsby without the ‘E’: Creating Lipograms with LLMs
Nitish Gokulakrishnan, Rohan Balasubramanian, Syeda Jannatus Saba et al.
The ROOTS Search Tool: Data Transparency for LLMs
Aleksandra Piktus, Christopher Akiki, Paulo Villegas et al.
FineSurE: Fine-grained Summarization Evaluation using LLMs
Hwanjun Song, Hang Su, Igor Shalyminov et al.
Metaphor Understanding Challenge Dataset for LLMs
Xiaoyu Tong, Rochelle Choenni, Martha Lewis et al.
A Multi-Task Embedder For Retrieval Augmented LLMs
Peitian Zhang, Zheng Liu, Shitao Xiao et al.
Learning to Edit: Aligning LLMs with Knowledge Editing
Yuxin Jiang, Yufei Wang, Chuhan Wu et al.