Tag
1 articles
This paper argues that accuracy alone is not enough to trust LLM safety judges, and proposes policy invariance as a reliability test.