2025 ICML ICML 2025

To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models

The Questioner