Google DeepMind 的 Gemma 4 12B 已在硅基流动上线,定价输入 $0.1/1M tokens,输出 $0.3/1M tokens。支持 262K 上下文、内置思考、原生工具调用及 140+ 种语言。采用无编码器架构,视觉和音频输入直接注入 LLM 主干,降低处理延迟。12B 参数但配备 26B “大脑”,性能接近 Google 26B 级别,擅长多步推理与智能体工作流。
If you need one model for agents, long context, and multimodal inputs - this is it. Meet @GoogleDeepMind 's Gemma 4 12B on SiliconFlow 🔥
💰Input / Output: $0.1 / $0.3 per 1M tokens on SiliconFlow 🛠️ 262K Context | Built-in Thinking | Native Tool Calling | 140+ Languages ✨ Encoder-free architecture: vision and audio inputs flow directly into the LLM backbone, reducing process latency 🧠 12B Size, 26B Brain: nearing Google's 26B performance, excel at multi-step reasoning and agentic workflows
Try it on SiliconFlow ⬇️