ThinkingNews

Back to feed
Hacker News May 5, 2026

Accelerating Gemma 4: faster inference with multi-token prediction drafters