Research Explorer
Papers Conferences Authors Topics Keywords Trends Achievements Explore
← Back to papers
2024 ICML ICML 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning

👥 Mega-Team — 46 authors

Authors

Nathaniel Li , Alexander Pan , Anjali Gopal , Summer Yue , Daniel Berrios , Alice Gatti , Justin D. Li , Ann-Kathrin Dombrowski , Shashwat Goel , Gabriel Mukobi , Nathan Helm-Burger , Rassin Lababidi , Lennart Justen , Andrew Bo Liu , Michael Chen , Isabelle Barrass , Oliver Zhang , Xiaoyuan Zhu , Rishub Tamirisa , Bhrugu Bharathi , Ariel Herbert-Voss , Cort B Breuer , Andy Zou , Mantas Mazeika , Zifan Wang , Palash Oswal , Weiran Lin , Adam Alfred Hunt , Justin Tienken-Harder , Kevin Y. Shih , Kemper Talley , John Guan , Ian Steneker , David Campbell , Brad Jokubaitis , Steven Basart , Stephen Fitz , Ponnurangam Kumaraguru , Kallol Krishna Karmakar , Uday Tupakula , Vijay Varadharajan , Yan Shoshitaishvili , Jimmy Ba , Kevin M. Esvelt , Alexandr Wang , Dan Hendrycks
Download PDF

Related papers

Learning Latent Dynamic Robust Representations for World Models 2024
Beyond Individual Input for Deep Anomaly Detection on Tabular Data 2024
Risk Estimation in a Markov Cost Process: Lower and Upper Bounds 2024
Collapse-Aware Triplet Decoupling for Adversarially Robust Image Retrieval 2024
Ranking-based Client Imitation Selection for Efficient Federated Learning 2024