ThinkingNews

Back to feed
Hacker News July 2, 2026

Is One Layer Enough? A Single Transformer Layer Matches Full-Parameter RL Train