2025 AISTATS AISTATS 2025

InnerThoughts: Disentangling Representations and Predictions in Large Language Models