Research Explorer

On scalable oversight with weak LLMs judging strong LLMs

Zachary Kenton, Noah Y. Siegel, János Kramár et al.

2024 NIPS

LLMs + Persona-Plug = Personalized LLMs

Jiongnan Liu, Yutao Zhu, Shuting Wang et al.

2025 ACL

QLoRA: Efficient Finetuning of Quantized LLMs

Tim Dettmers, Artidoro Pagnoni, Ari Holtzman et al.

2023 NIPS

VPGTrans: Transfer Visual Prompt Generator across LLMs

Ao Zhang, Hao Fei, Yuan Yao et al.

2023 NIPS

Evaluating the Moral Beliefs Encoded in LLMs

Nino Scherrer, Claudia Shi, Amir Feder et al.

2023 NIPS

QBB: Quantization with Binary Bases for LLMs

Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos

2024 NIPS

Benchmarking LLMs via Uncertainty Quantification

Fanghua Ye, Mingming Yang, Jianhui Pang et al.

2024 NIPS

Efficient multi-prompt evaluation of LLMs

Felipe Maia Polo, Ronald Xu, Lucas Weber et al.

2024 NIPS

Protecting Your LLMs with Information Bottleneck

Zichuan Liu, Zefan Wang, Linjie Xu et al.

2024 NIPS

Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs

Zhao XU, Fan LIU, Hao LIU

2024 NIPS

StackEval: Benchmarking LLMs in Coding Assistance

Nidhish Shah, Zulkuf Genc, Dogu Araci

2024 NIPS

Verified Code Transpilation with LLMs

Sahil Bhatia, Jie Qiu, Niranjan Hasabnis et al.

2024 NIPS

Is Programming by Example Solved by LLMs?

Wen-Ding Li, Kevin Ellis

2024 NIPS

Hypothesis Testing the Circuit Hypothesis in LLMs

Claudia Shi, Nicolas Beltran-Velez, Achille Nazaret et al.

2024 NIPS

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Saleh Ashkboos, Amirkeivan Mohtashami, Maximilian L. Croci et al.

2024 NIPS

Truth is Universal: Robust Detection of Lies in LLMs

Lennart Bürger, Fred A. Hamprecht, Boaz Nadler

2024 NIPS

Enhancing LLMs via High-Knowledge Data Selection

Feiyu Duan, Xuemiao Zhang, Sirui Wang et al.

2025 AAAI

Scaling Trends for Data Poisoning in LLMs

Dillon Bowen, Brendan Murphy, Will Cai et al.

2025 AAAI

Interpreting the Effects of Quantization on LLMs

Manpreet Singh, Hassan Sajjad

2025 AACL

Gatsby without the ‘E’: Creating Lipograms with LLMs

Nitish Gokulakrishnan, Rohan Balasubramanian, Syeda Jannatus Saba et al.

2025 AACL

The ROOTS Search Tool: Data Transparency for LLMs

Aleksandra Piktus, Christopher Akiki, Paulo Villegas et al.

2023 ACL

FineSurE: Fine-grained Summarization Evaluation using LLMs

Hwanjun Song, Hang Su, Igor Shalyminov et al.

2024 ACL

Metaphor Understanding Challenge Dataset for LLMs

Xiaoyu Tong, Rochelle Choenni, Martha Lewis et al.

2024 ACL

A Multi-Task Embedder For Retrieval Augmented LLMs

Peitian Zhang, Zheng Liu, Shitao Xiao et al.

2024 ACL

Learning to Edit: Aligning LLMs with Knowledge Editing

Yuxin Jiang, Yufei Wang, Chuhan Wu et al.

2024 ACL

Papers