2025 ICML ICML 2025

The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence