71
AI 摘要
Google为Gemma 4系列发布了多令牌预测(MTP)草稿模型。它在不损失性能的情况下带来了3倍的速度提升。 期待在Mac Mini上测试带有MTP草稿模型的量化版Gemma 4!
Google released Multi-Token Prediction (MTP) drafters for the Gemma 4 family. It comes with a 3x speed boost without losing performance.
Looking forward to testing a quantized Gemma 4 with MTP drafters on a Mac Mini!
Gemma 4: Now up to 3x Faster. ⚡ Same quality, way more speed. Our new MTP drafters allow Gemma 4 to predict multiple tokens at once, effectively tripling your o...