2025 UAI UAI 2025

Contrast-CAT: Contrasting Activations for Enhanced Interpretability in Transformer-based Text Classifiers