Hacker News May 7, 2026ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on MathRead full article Share: