Hacker News May 5, 2026Accelerating Gemma 4: faster inference with multi-token prediction draftersRead full article Share: