Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Back to papers
2024
ICML
ICML 2024
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Authors
Boyi Wei
,
Kaixuan Huang
,
Yangsibo Huang
,
Tinghao Xie
,
Xiangyu Qi
,
Mengzhou Xia
,
Prateek Mittal
,
Mengdi Wang
,
Peter Henderson
Download PDF
Related papers
Learning Latent Dynamic Robust Representations for World Models
2024
Beyond Individual Input for Deep Anomaly Detection on Tabular Data
2024
Risk Estimation in a Markov Cost Process: Lower and Upper Bounds
2024
Collapse-Aware Triplet Decoupling for Adversarially Robust Image Retrieval
2024
Ranking-based Client Imitation Selection for Efficient Federated Learning
2024