Hacker News May 2, 2026Refusal in Language Models Is Mediated by a Single DirectionRead full article Share: