2025 AISTATS AISTATS 2025

Exposing Privacy Gaps: Membership Inference Attack on Preference Data for LLM Alignment