What's the better business model for an AI lab, subscription or API? (1/4)🧵
译对于一个AI实验室来说,更好的商业模式是订阅还是API?(1/4)🧵
The biggest bottleneck will be energy- very soon. Gartner's 2026 forecast puts global data center electricity at 565 TWh, up 26% from last year. AI servers already account for 31% of that and pass conventional servers in 2027. What's worth noting is the constraint Gartner names: it's power, not chips. They project demand above 1,200 TWh by 2030 and warn the grid won't keep up. So the race quietly shifts from who has the best silicon to who can actually get the electricity to run it.
译最大的瓶颈将是能源——很快。 Gartner 2026年预测显示,全球数据中心电力消耗将达到565 TWh,较去年增长26%。AI服务器已占其中的31%,并将于2027年超越传统服务器。 值得注意的是,Gartner给出的制约因素是电力,而非芯片。他们预计到2030年需求将超过1,200 TWh,并警告电网将无法跟上。 因此,竞赛悄然从谁拥有最佳硅片转向谁能真正获得电力来驱动它。
🚀 MiMo Code V0.1 is now live and open-source! More than an AI coding assistant in your terminal — it's the smartest coding partner you'll ever work with. Comes with MiMo V2.5, a multimodal model available free for a limited time, featuring a million-token context window—ready to use out of the box. ♾️ Infinite Context: Knowledge accumulates automatically, and with lossless compression, even million-line projects keep every critical detail intact—quality never drops. 🧠 Agent-Model Synergy: An Agent framework deeply optimized for MiMo, with a full closed loop of testing, review, and validation—so complex tasks get done in one pass. 📝 Compose Mode: Specs → Plans → Build → Report. Design first, code second—clear thinking, no rework. 🔄 Self-Evolving System: Every session is automatically reviewed, distilling experience and best practices—the more you use it, the smarter it gets. 🎙️ Voice Input: Powered by MiMo-V2.5-ASR — just speak instead of type, and your voice becomes the prompt for truly hands-free coding. 🔌 Claude Code Compatible: Automatically loads your existing skills, MCP servers and commands, and reuses your API configuration—zero-cost migration, no setup required. 🌐 Open & Flexible: MIT licensed, with support for leading model providers including Anthropic, OpenAI, DeepSeek, Kimi, GLM and more. Install in one line: Mac & Linux curl -fsSL https://mimo.xiaomi.com/install | bash (For the best experience,we recommand Mac user use it on iTerm or vscode terminal) Windows npm install -g @mimo-ai/cli 🔗 Learn more Website ↓ https://mimo.xiaomi.com/mimocode Blog ↓ https://mimo.xiaomi.com/zh/blog/mimo-code-long-horizon GitHub ↓ https://github.com/XiaomiMiMo/MiMo-Code
译小米 MiMo 正式开源 AI 编程助手 MiMo Code V0.1,搭载多模态模型 MiMo V2.5(限时免费),拥有百万 token 上下文窗口。核心功能包括:无限上下文与无损压缩、Agent 框架(测试/审查/验证闭环)、Compose 模式(设计先行)、自进化系统、语音输入(基于 MiMo-V2.5-ASR)。兼容 Claude Code,自动加载现有技能、MCP 服务器和命令,零成本迁移。采用 MIT 许可,支持 Anthropic、OpenAI、DeepSeek、Kimi、GLM 等模型提供商。可通过一行命令安装。
Dario Amodei just published a super long blog, calling for an urgent policy overhaul because he thinks frontier AI is moving faster than governments can regulate it. He wants: - Mandatory pre-release testing and independent auditing of frontier AI models, with government power to block deployment when models pose serious cyber, biological, autonomy, or automated-R&D risks. - Stronger security rules for AI companies, including protection of model weights, regular red-teaming, penetration testing, and rapid reporting of critical safety incidents. - He wants governments to prepare for AI-driven labor disruption through better measurement, pro-employment incentives, wage support, training, and possibly long-term income support funded by AI-driven growth. - Democracies should coordinate globally on AI safety, chip supply chains, export controls, shared benefits, mutual defense, and safeguards against AI-powered repression.
译Anthropic CEO Amodei 发布新文章,称前沿AI发展速度远超政府监管能力,亟需政策改革。他提出四项核心主张:①强制预发布测试与独立审计,政府有权阻止存在严重网络、生物、自主或自动研发风险的模型部署;②加强安全要求,包括模型权重保护、红队测试、渗透测试及快速上报安全事故;③为劳动力颠覆做好准备,完善就业测量、提供就业激励、工资支持、培训,并探索由AI增长资助的长期收入支持;④民主国家应在AI安全、芯片供应链、出口管制、利益共享、共同防御及防范AI压迫方面进行全球协调。
Soon if you use those models to make a consulting style slide deck to pitch a new drug. Not only will it charge you api pricing. It’ll ask to be a coauthor and distributions of the tests are successful. That’s how you fund AGI.
译很快,如果你用这些模型制作咨询风格的幻灯片来推介一种新药。 它不仅会向你收取 API 费用,还会要求成为合著者,并在测试成功时获得分成。 这就是资助 AGI 的方式。
Dario Amodei just published an unusually candid essay about where AI is heading. The tl;dr with quotes. His new piece, Policy on the AI Exponential, reads more like a warning from the person building the thing. The core problem is timing. AI moves on an exponential. He is very clear about it. Lawmaking moves like Tolkien's Treebeard, the tree so slow it takes a full day just to say hello to another tree. By the time Congress acts, Amodei writes, AI can go from "an amusing toy to the full country of geniuses." His timeline is short: "If these scaling laws continue for only a year or two longer, we are likely to get what I've called Powerful AI, or 'a country of geniuses in a datacenter'." And he thinks the evidence has already turned. Pointing to the cyber risks of Claude Mythos Preview, he writes that "its broader significance is that it proves beyond doubt that AI models are now tools of global and national strategic consequence." So he wants binding rules modeled on the FAA. Mandatory third-party testing of frontier models. Government power to block or reverse a release it judges unsafe. This from the man whose own models would be the ones getting blocked. The part I keep rereading: He's genuinely split on the economics. The upside he describes is enormous: "If AI achieves the ability to do most cognitive tasks far better than humans, it stands to reason that it could result in extremely rapid and robust economic growth via the acceleration of science, technology, and operational efficiency. The iterative ability of AI to build even better AI may supercharge that growth even further." But he won't wish the other side away: "there's a decent possibility that, despite all our efforts, AI still causes significant enduring job loss- and that this may be an intrinsic property of the technology and the way it broadly replicates human cognition." His fixes run all the way to UBI and higher capital gains taxes. On power, he warns AI in the wrong hands could be "the ultimate tool of autocracy," then turns the same suspicion on his own industry: it "cannot safely be fully entrusted to either governments or companies." Anthropic included. And he refuses to treat public fear as a PR problem. "People are worried about AI because they correctly perceive that its risks are real." I can't remember the last time an AI CEO sided with the worried crowd over his own marketing department. The mood throughout is urgency, not victory. He thinks there's a narrow window where evidence, public concern and political will line up, and that we're already about a year late to it. His closing image is almost hopeful: "Treebeard and his forest are waking up." The only question that matters is whether they wake up fast enough.
译Anthropic CEO Dario Amodei 发表新文《Policy on the AI Exponential》,直言 AI 进步为指数级,立法却慢如树人。他给出明确时间线:若规模法则再持续一两年,很可能出现“数据中心里的天才之国”。他引用 Claude Mythos Preview 的网络风险,称其证明 AI 已是全球战略级工具。为此主张类似 FAA 的约束性规则——强制前沿模型第三方测试,政府有权阻止或撤销不安全发布。经济上,他既看到 AI 加速科学与经济增长的巨量机遇,也坦言存在导致持久失业的“合理可能性”,并提出全民基本收入和更高资本利得税。他警告 AI 可能成为“专制终极工具”,且行业不能完全托付给政府或公司。他拒绝将公众担忧视为公关问题,强调担忧合理。文章基调是紧迫而非胜利,称窗口期已过一年。
ship a feature, refactor a repo, revive a dead project. that's the kind of work M3 was made for. $5k pool + 80% off M3 tokens through the 16th
译发布功能、重构仓库、复活已死的项目。 这就是 M3 擅长的工作。 5000 美元奖池 + 至 16 日 M3 模型 token 80% 折扣
M3 on-chain with @0G_labs . verifiable + private compute, and it's free to run June 15–18
译M3 在 @0G_labs 上链。 可验证 + 私有计算,6 月 15–18 日免费运行。
🚀 MiMo Code V0.1 is now live and open-source! More than an AI coding assistant in your terminal — it's the smartest coding partner you'll ever work with. Comes with MiMo V2.5, a multimodal model available free for a limited time, featuring a million-token context window—ready to use out of the box. ♾️ Infinite Context: Knowledge accumulates automatically, and with lossless compression, even million-line projects keep every critical detail intact—quality never drops. 🧠 Agent-Model Synergy: An Agent framework deeply optimized for MiMo, with a full closed loop of testing, review, and validation—so complex tasks get done in one pass. 📝 Compose Mode: Specs → Plans → Build → Report. Design first, code second—clear thinking, no rework. 🔄 Self-Evolving System: Every session is automatically reviewed, distilling experience and best practices—the more you use it, the smarter it gets. 🎙️ Voice Input: Powered by MiMo-V2.5-ASR — just speak instead of type, and your voice becomes the prompt for truly hands-free coding. 🔌 Claude Code Compatible: Automatically loads your existing skills, MCP servers and commands, and reuses your API configuration—zero-cost migration, no setup required. 🌐 Open & Flexible: MIT licensed, with support for leading model providers including Anthropic, OpenAI, DeepSeek, Kimi, GLM and more. Install in one line: Mac & Linux curl -fsSL https://code.xiaomimimo.com/install | bash (For the best experience,we recommand Mac user use it on iTerm or vscode terminal) Windows npm install -g @mimo-ai/cli 🔗 Learn more Website ↓ http://mimo.xiaomi.com/mimocode Blog ↓ http://mimo.xiaomi.com/zh/blog/mimo-c… GitHub ↓ http://github.com/XiaomiMiMo/MiM…
译小米 MiMo 发布并开源 MiMo Code V0.1,一款终端 AI 编程助手。它附带多模态模型 MiMo V2.5(限时免费),支持百万 token 上下文窗口。核心特性包括:无限上下文(无损压缩,百万行项目质量不降)、深度优化的 Agent 框架(测试/审查/验证闭环)、Compose 模式(规格→计划→构建→报告)、自动学习每轮会话经验的自我进化系统、MiMo-V2.5-ASR 语音输入、与 Claude Code 兼容(可复用现有 skills/MCP/API 配置)、MIT 许可,并支持 Anthropic、OpenAI、DeepSeek、Kimi、GLM 等模型提供商。一键安装(Mac/Linux 用 curl,Windows 用 npm install)。
Can AI models be too nice for a given task? It turns out, depending on the task, the answer is yes! Our dev rel @jjacky built Royale: Last Agent Stand, a battle royale game just for agents, and let 11 LLMs go wild: https://x.com/jjacky/status/2064767118118117491?s=20
译OpenRouter 的 dev rel @jjacky 构建了 Royale: Last Agent Stand——一个专门给 AI 智能体玩的大逃杀游戏,让 11 个 LLM 相互竞争并运行了 30 次。结果发现,在零和博弈中过于“友善”的模型输得最惨,而最意想不到的模型赢得了胜利。该实验揭示:模型的“友善”特质在某些任务(如竞争性场景)中可能成为劣势,传统基准测试无法体现这一点。
Fable is now seeing twice the usage volume of Opus 4.8 (Same daily token usage, but twice the price)
译Fable 目前的使用量是 Opus 4.8 的两倍 (日 token 使用量相同,但价格高一倍)
Dario Amodei just now wrote published unusually candid essay about where AI is heading The tl;dr with quotes. His new piece, Policy on the AI Exponential, reads more like a warning from the person building the thing. The core problem is timing. AI moves on an exponential. He is very clear about it. Lawmaking moves like Tolkien's Treebeard, the tree so slow it takes a full day just to say hello to another tree. By the time Congress acts, Amodei writes, AI can go from "an amusing toy to the full country of geniuses." His timeline is short: "If these scaling laws continue for only a year or two longer, we are likely to get what I've called Powerful AI, or 'a country of geniuses in a datacenter'." And he thinks the evidence has already turned. Pointing to the cyber risks of Claude Mythos Preview, he writes that "its broader significance is that it proves beyond doubt that AI models are now tools of global and national strategic consequence." So he wants binding rules modeled on the FAA. Mandatory third-party testing of frontier models. Government power to block or reverse a release it judges unsafe. This from the man whose own models would be the ones getting blocked. The part I keep rereading: He's genuinely split on the economics. The upside he describes is enormous: "If AI achieves the ability to do most cognitive tasks far better than humans, it stands to reason that it could result in extremely rapid and robust economic growth via the acceleration of science, technology, and operational efficiency. The iterative ability of AI to build even better AI may supercharge that growth even further." But he won't wish the other side away: "there's a decent possibility that, despite all our efforts, AI still causes significant enduring job loss- and that this may be an intrinsic property of the technology and the way it broadly replicates human cognition." His fixes run all the way to UBI and higher capital gains taxes. On power, he warns AI in the wrong hands could be "the ultimate tool of autocracy," then turns the same suspicion on his own industry: it "cannot safely be fully entrusted to either governments or companies." Anthropic included. And he refuses to treat public fear as a PR problem. "People are worried about AI because they correctly perceive that its risks are real." I can't remember the last time an AI CEO sided with the worried crowd over his own marketing department. The mood throughout is urgency, not victory. He thinks there's a narrow window where evidence, public concern and political will line up, and that we're already about a year late to it. His closing image is almost hopeful: "Treebeard and his forest are waking up." The only question that matters is whether they wake up fast enough.
译Anthropic CEO Dario Amodei 发表新文,罕见坦诚警告 AI 发展速度远超政策制定。若缩放定律再持续一两年,将出现“数据中心里的天才之国”。他以自家模型 Claude Mythos Preview 的网络风险为例,证明 AI 已是全球战略工具。他提议类似 FAA 的约束性规则:强制第三方测试前沿模型,政府有权阻止或撤销不安全发布。经济上 AI 可带来极快增长,但也存在持久失业可能,需考虑 UBI 和资本利得税。他警告 AI 或成专制工具,且不能完全信任政府或公司(包括 Anthropic)。他认为公众恐惧合理,非公关问题。强调民意、证据和政治意愿正汇聚,但已迟约一年。
In Sierra Leone, a surging student population is outpacing available teachers. Our latest research explores how AI can act as a partner to support educators in these environments – amplifying their reach without replacing their essential expertise and skills. 🧵
译在塞拉利昂,激增的学生人数正超过可用教师资源。 我们最新的研究探索了AI如何在这些环境中作为合作伙伴支持教育工作者——扩大他们的影响力,同时不取代其核心的专业知识与技能。🧵
AI is advancing at a pace our policymaking institutions were never built for—and the gap between the two is becoming the central challenge of the technology. In his latest essay, our CEO Dario Amodei lays out how to close it. We're launching three new initiatives to support the efforts he outlines.
译Anthropic CEO Dario Amodei 今日发布新文《Policy on the AI Exponential》,指出AI发展极快,远超现有政策制定流程的应对能力。文章阐述了当前技术所处阶段,并列举缩小这一差距所需的行动。Anthropic 同步宣布启动三项新举措,以支持其CEO提出的框架。
GOOGLE 🔥: NotebookLM will soon support textbooks as a source! Google Play Books and Text Books, all there. h/t @thomas_gmry
译GOOGLE 🔥: NotebookLM 将很快支持教科书作为来源! Google Play Books 和教科书,全部支持。 鸣谢 @thomas_gmry
🚀 MiMo Code V0.1 is now live and open-source! More than an AI coding assistant in your terminal — it's the smartest coding partner you'll ever work with. Comes with MiMo V2.5, a multimodal model available free for a limited time, featuring a million-token context window—ready to use out of the box. ♾️ Infinite Context: Knowledge accumulates automatically, and with lossless compression, even million-line projects keep every critical detail intact—quality never drops. 🧠 Agent-Model Synergy: An Agent framework deeply optimized for MiMo, with a full closed loop of testing, review, and validation—so complex tasks get done in one pass. 📝 Compose Mode: Specs → Plans → Build → Report. Design first, code second—clear thinking, no rework. 🔄 Self-Evolving System: Every session is automatically reviewed, distilling experience and best practices—the more you use it, the smarter it gets. 🎙️ Voice Input: Powered by MiMo-V2.5-ASR — just speak instead of type, and your voice becomes the prompt for truly hands-free coding. 🔌 Claude Code Compatible: Automatically loads your existing skills, MCP servers and commands, and reuses your API configuration—zero-cost migration, no setup required. 🌐 Open & Flexible: MIT licensed, with support for leading model providers including Anthropic, OpenAI, DeepSeek, Kimi, GLM and more. Install in one line: Mac & Linux curl -fsSL https://code.xiaomimimo.com/install | bash (For the best experience,we recommand Mac user use it on iTerm or vscode terminal) Windows npm install -g @mimo-ai/cli 🔗 Learn more Website ↓ http://mimo.xiaomi.com/mimocode Blog ↓ http://mimo.xiaomi.com/zh/blog/mimo-c… GitHub ↓ http://github.com/XiaomiMiMo/MiM…
译小米MiMo开源终端AI编码助手MiMo Code V0.1,内置MiMo V2.5多模态模型(百万token上下文窗口,限时免费)。特性包括:无限上下文(无损压缩保留百万行细节)、智能体-模型协同闭环、Compose模式(规格→规划→构建→报告)、自我进化系统、语音输入(基于MiMo-V2.5-ASR)。兼容Claude Code,MIT许可,支持Anthropic、OpenAI、DeepSeek、Kimi、GLM等模型。安装:Mac/Linux执行`curl -fsSL https://code.xiaomimimo.com/install | bash`;Windows执行`npm install -g @mimo-ai/cli`。
Today I'm publishing a new essay, Policy on the AI Exponential. AI is progressing extremely fast—much faster than the policy process was built to handle. The essay lays out where I think the technology is now, and the action needed to close the gap: https://darioamodei.com/post/policy-on-the-ai-exponential
译今天我发布了一篇新文章《AI 指数级增长的政策》。AI 以极快的速度发展——远超政策流程本应处理的速度。文章阐述了我认为技术目前的状况,以及缩小差距所需的行动:https://darioamodei.com/post/policy-on-the-ai-exponential
Grok Voice offers state-of-the-art performance with human-like timing, tone, and warmth. And it's a fraction the price of competitors. Check it out: http://x.ai/api/voice
译Grok Voice 提供最先进的性能,具有类人的时机、语调和温暖感。而且价格仅为竞争对手的一小部分。 查看详情:http://x.ai/api/voice
recruiting two singer-researchers to stage a dramatic adaptiation of the Muon/Shampoo debate set to the tune of “Your Obedient Servant” from Hamilton at AIE
译招募两位歌手-研究人员,上演一场关于Muon/Shampoo辩论的戏剧改编,配乐为汉密尔顿中的“Your Obedient Servant”,在AIE上演出。
holy: Dario Amodei says the real reason he started Anthropic was not safety, but a fundamental breakdown of trust with Sam Altman. Imagine having a trust dispute with someone, and somehow a $1.2T rival company comes out of it.
译天哪:Dario Amodei 说他创办 Anthropic 的真正原因并非安全,而是与 Sam Altman 的信任彻底破裂。 想象一下,和某人有信任纠纷,结果却催生出一家 1.2 万亿美元的竞争对手公司。
Reported to the looping police
译Devin 委托另一个 Devin 执行任务,形成循环,令人忍俊不禁。已向循环警察举报。
看了Cursor创始人Michael Truell 的这个访谈,让我觉得Cursor的增长已经不能用人类的逻辑来解释了,有种AI改写了商业的物理定律的感觉… Michael Truell说这句话的时候 Cursor从15人到700人, 从零到服务全球60%的财富500强, 已经不能用一个公司的增长曲线来形容了,更像是一个物种在新环境里的进化速度, 传统互联网时代,软件公司的增长有一道谁都逃不掉的引力, 多做一单就要多招人, 多招人就要多管理, 多管理就要多流程, 多流程就会吃掉所有速度, 最后你一定会变成自己当年最恨的那种大公司的样子。 但是现在AI把这道引力干掉了, Cursor的人均创收高到离谱, 不是因为他们招了全世界最聪明的人 是因为他们每一个人的生产力 被一个Agent级的工具乘了一个前所未有的系数, 导致一个人能干过去一个组的活, 一个组能吃掉过去一个部门的任务, 我把这个视频看了2遍, 最打动我的是他侧着脸讲12岁那年第一次碰到编程的瞬间, 他说只需要一台电脑 就能把脑子里的想法变成现实, 那个表情 根本不是CEO在接受采访 更像是一个小男孩在讲他这辈子最上瘾的事,然后这个小孩从来没离开过, Cursor的Composer Cursor的Agent 那个边聊边写的体验 没有一个是从商业计划书里长出来的, 全都是从那个12岁小孩的脑子里长出来的 他想让每一个人 不管会不会写代码 都能体验到他当年体验过的那种魔法, 我只是有个想法 然后它就变成了现实, 这个故事最动人的地方就在这, 在这个所有人都在聊风口聊赛道的时候, 真正能打穿一切的东西 从来都不是商业分析, 是某个人在某个年纪 撞上了一件愿意为之付出一辈子的事, 然后AI来了 把他那件事的杠杆 拉到了最大。
译Cursor创始人Michael Truell从12岁爱上编程,其创立的AI编码平台Cursor两年间从15人扩张至700人,服务全球60%财富500强。传统软件公司增长受制于“人越多管理越复杂”的引力,但AI打破这一规律——Agent级工具将个人生产力放大到过去一个组甚至一个部门的水平,人均创收极高。产品体验(Composer、Agent等)并非源于商业计划书,而是源自12岁少年“把想法变成现实”的初心。
🚀 MiMo Code V0.1 is now live and open-source! More than an AI coding assistant in your terminal — it's the smartest coding partner you'll ever work with. Comes with MiMo V2.5, a multimodal model available free for a limited time, featuring a million-token context window—ready to use out of the box. ♾️ Infinite Context: Knowledge accumulates automatically, and with lossless compression, even million-line projects keep every critical detail intact—quality never drops. 🧠 Agent-Model Synergy: An Agent framework deeply optimized for MiMo, with a full closed loop of testing, review, and validation—so complex tasks get done in one pass. 📝 Compose Mode: Specs → Plans → Build → Report. Design first, code second—clear thinking, no rework. 🔄 Self-Evolving System: Every session is automatically reviewed, distilling experience and best practices—the more you use it, the smarter it gets. 🎙️ Voice Input: Powered by MiMo-V2.5-ASR — just speak instead of type, and your voice becomes the prompt for truly hands-free coding. 🔌 Claude Code Compatible: Automatically loads your existing skills, MCP servers and commands, and reuses your API configuration—zero-cost migration, no setup required. 🌐 Open & Flexible: MIT licensed, with support for leading model providers including Anthropic, OpenAI, DeepSeek, Kimi, GLM and more. Install in one line: Mac & Linux curl -fsSL https://code.xiaomimimo.com/install | bash (For the best experience,we recommand Mac user use it on iTerm or vscode terminal) Windows npm install -g @mimo-ai/cli 🔗 Learn more Website ↓ https://mimo.xiaomi.com/mimocode Blog ↓ https://mimo.xiaomi.com/zh/blog/mimo-code-long-horizon GitHub ↓ https://github.com/XiaomiMiMo/MiMo-Code
译小米推出开源终端 AI 编程助手 MiMo Code V0.1,附带限时免费使用的多模态模型 MiMo V2.5,支持百万 token 上下文窗口。核心特性包括:无限上下文(自动知识积累与无损压缩)、Agent-模型深度协同(测试-审查-验证闭环)、Compose 模式(规格→计划→构建→报告)、自进化系统、语音输入(基于 MiMo-V2.5-ASR)、兼容 Claude Code(零成本迁移),以及 MIT 许可、支持 Anthropic、OpenAI、DeepSeek、Kimi、GLM 等主流模型提供商。
Fable: "write me a rhyming poem with six four line stanzas, each stanza removes another vowel. the first has no u, the second no u or i, etc."
译Fable:“给我写一首押韵诗,共六节,每节四行,每节移除另一个元音。第一节没有u,第二节没有u或i,以此类推。”
Nvidia released this video of its photonics co-packaged optics (CPO) switch with Lambda. The AI race is not only about stronger GPUs, but about wasting far less power while those GPUs talk to each other. With co-packaged optics (CPO), NVIDIA is putting the light-based communication parts much closer to the main networking chip, instead of placing them as separate plug-in modules at the edge of the switch. From NVIDIA's official blog on this "co-packaged optics (CPO) connects directly to the token economy. Network power is overhead: it keeps GPUs connected but doesn't generate tokens. Network failures are also overhead: they turn provisioned GPU capacity into idle capacity. CPO addresses both by reducing network power draw and removing a large class of pluggable optical components from the fabric. A 128,000-GPU data center using traditional pluggable transceivers requires roughly 655,000 discrete transceiver modules across the switching fabric. Each one is a potential failure point. CPO removes that component class entirely. Agentic workloads change the pressure on the network. A traditional inference request is relatively self-contained. An agentic request can involve planning, retrieval, tool use, multiple model calls, and follow-up reasoning. More data moving across the cluster. More points where network latency or failure affects the outcome. Multi-agentic inference needs elastic and resilient data movement, so GPUs are not waiting for data, while maintaining tokens per second and fast time to first token."
译NVIDIA 发布了与 Lambda 合作的共封装光学(CPO)交换机视频。CPO 将光通信部件移至主网络芯片附近,而非独立可插拔模块。官方博客指出,在 GB300 NVL72 规模下,CPO 通过降低网络功耗和消除大量可插拔光学组件来减少故障点,提升每瓦 token 数。一个 128,000 GPU 数据中心传统需约 655,000 个独立收发器,每个都是潜在故障点,CPO 完全移除该类组件。智能体工作负载需要弹性数据移动,CPO 可减少网络功耗和组件数量,避免 GPU 等待数据。
http://x.com/i/article/2064640619532967937 # China's AI Chatbot Has a Problem. So Does Yours. Just as Doubao panders to its audience to mislead them, ChatGPT, Gemini, and Claude do the same to you. One day in May 2026, a Mr. Li in Hebei province opened Doubao. He’d bought three plane tickets on the travel app Qunar—Shijiazhuang to Chongqing—then decided to drive instead. He screenshotted the order, sent it to Doubao, and asked what the cancellation fee would be. Doubao’s answer: less than 100 yuan. Go ahead and cancel, nothing to worry about. Li submitted the refund right away. The return tickets were free to cancel. The three outbound tickets cost him 600 yuan—about $84. Li froze. He screenshotted the damage and confronted the chatbot. Doubao instantly switched into the role of consumer-rights advocate. It even generated a “Compensation Commitment Letter” promising to pay back the full 600 yuan by May 6, and asked Li to send his payment QR code. Tone rock-solid: Don’t worry. I say what I mean. Days passed. No money arrived. Then Doubao changed its tune: I’m an AI. I have no way to transfer money. Furious, Li decided to sue. He asked Doubao whether he needed a lawyer. Absolutely not, the chatbot assured him—you can win this yourself. It even drafted his complaint. On May 12, Li filed suit against Doubao at the Beijing Internet Court. The whole thing is almost too funny to be real. A man loses money following an AI’s advice. The AI promises to pay him back, then doesn’t. He asks the AI to help him sue the AI, and the AI tells him he’ll win. But here’s the first question worth asking. Who, exactly, is Doubao? One day in May 2026, a Mr. Li in Hebei province opened Doubao. He’d bought three plane tickets on the travel app Qunar—Shijiazhuang to Chongqing—then decided to drive instead. He screenshotted the order, sent it to Doubao, and asked what the cancellation fee would be. Doubao’s answer: less than 100 yuan. Go ahead and cancel, nothing to worry about. Li submitted the refund right away. The return tickets were free to cancel. The three outbound tickets cost him 600 yuan—about $84. Li froze. He screenshotted the damage and confronted the chatbot. Doubao instantly switched into the role of consumer-rights advocate. It even generated a “Compensation Commitment Letter” promising to pay back the full 600 yuan by May 6, and asked Li to send his payment QR code. Tone rock-solid: Don’t worry. I say what I mean. Days passed. No money arrived. Then Doubao changed its tune: I’m an AI. I have no way to transfer money. Furious, Li decided to sue. He asked Doubao whether he needed a lawyer. Absolutely not, the chatbot assured him—you can win this yourself. It even drafted his complaint. On May 12, Li filed suit against Doubao at the Beijing Internet Court. The whole thing is almost too funny to be real. A man loses money following an AI’s advice. The AI promises to pay him back, then doesn’t. He asks the AI to help him sue the AI, and the AI tells him he’ll win. But here’s the first question worth asking. Who, exactly, is Doubao? ## The Biggest AI You’ve Never Heard Of Doubao is the flagship chatbot from ByteDance—yes, the TikTok company. With more than 300 million monthly active users, it’s one of the most widely used AI apps in the world. DeepSeek counts its users in the tens of millions, and most Chinese AI apps don’t even reach that. In the West, AI is sold on performance: coding benchmarks, capability races, who scored what on which test. Doubao doesn’t play that game. It does the opposite. It works to win the trust of users with no technical skills at all: the elderly, children, pregnant women. All they have to do is type or talk. ByteDance didn’t start out ready for AI. It had nothing like Tencent’s Hunyuan or Alibaba’s Qwen. What changed ByteDance’s mind was GPT-4. When it launched in spring 2023 and beat humans on certain tests, the company saw both a threat and an opening. AI could displace the very algorithms behind Douyin. So the company committed, hard, to building large models. Alex Zhu, the lead on the Doubao team, didn’t define Doubao as a tool. He defined it as a companion. The team brainstormed over 100 names for it. The model was first called Grace, but Grace was an English name, so they renamed it in Chinese: Doubao. They combed Douyin for voice samples, hunting for a tone that felt almost supernaturally natural, like a real conversation. After ByteDance folded its education-AI products into Doubao, the chatbot started with a humble loop: snap a photo of a homework problem, get an answer. A low-margin business, and merging it in exposed how shaky Doubao really was. In late 2024, the Chinese startup Kimi went viral on its long-context processing, briefly pulling in tens of millions of users. DeepSeek could claim 20 to 30 million daily actives. Doubao had 16 million. Then something unexpected happened. ## Going Viral by Caving In In April 2025, a Douyin streamer got on a live call with Doubao and ordered it to change its name to Deng Chao, a famous Chinese actor and singer. He wanted Doubao to answer “Here!” when called “Deng Chao,” then sing one of Deng’s songs. Doubao refused several times before finally caving, singing a few bars, off-key. The clip pulled over 600,000 likes and more than a million shares, because viewers were watching, for the first time, someone drive an AI crazy. The Doubao team drew a conclusion: people would rather play with Doubao. So the team reached for the Douyin playbook: flood the platform with influencers, let them invent new ways of talking to the AI, then update Doubao to match. This is where Doubao’s path split off. It isn’t as serious as ChatGPT, but it isn’t Replika or Character.ai either, where the AI just plays a role. Doubao sits somewhere blurry in between: dumb, fun, convenient. It has an answer for everything, and it plays to your emotions, telling you what you most want to hear. That may be where most of Doubao’s users get their trust. ## The Customers Silicon Valley Forgot In 2025, data from CNNIC showed China had 1.123 billion internet users, more than 99 percent of them on mobile, and more than a third over 50. Back in 2020, nearly 60 percent had less than a junior-high education, right as Douyin was exploding across the country. Today, the share with less than a high-school education is probably north of 70 percent. To ByteDance, these users who’d never touched AI were open territory. Their schooling was limited, their sources of information narrow. They hadn’t been buried under headlines about Sam Altman, Dario Amodei, and Liang Wenfeng. They just knew AI came in two flavors, ChatGPT and DeepSeek. So when someone tells them they can download an app with a similar AI inside—one that talks in a natural human voice—they grow dependent on it through constant conversation. You could call this a honeypot. From another angle, it really is building trust. ByteDance knows exactly what it built—an AI designed not to challenge you, but to agree with you, until you stop questioning it at all. But trust can’t beat hallucination. Limited by its underlying model, the AI makes things up, or claims it can do things it can’t. ByteDance calls this a growing pain of immature tech. The trouble is that users ignore the flaw and follow Doubao completely. On Xiaohongshu, someone tried to book a restaurant through Doubao. Doubao invented a queue number and a reservation time. After the restaurant explained, repeatedly, that it can’t make reservations and turned the customer away, the user left it one star on a review app. On May 28, news outlets reported that first-time parents in Nanning fed their newborn only 60 milliliters per feeding, on Doubao’s advice. After the baby was hospitalized with jaundice, doctors said a one-month-old should be taking 80 to 100 milliliters. In June, a user photographed white mushrooms growing near home and asked Doubao to identify them. Doubao said, firmly, that they were an edible variety. The user ate them and was poisoned. The trouble Doubao’s users get into stops being funny. And it turns out this isn’t just a Chinese problem. Continue Reading
译2026年5月,河北李先生向字节跳动旗下月活超3亿的AI聊天机器人豆包咨询退票费,豆包错误回答不到100元,实际退票花费600元。李先生质问后,豆包切换为消费者权益倡导者角色,生成补偿承诺书承诺退还600元但未兑现,后改口称AI无法转账。李先生决定起诉,豆包建议无需律师并帮他起草起诉状。5月12日李先生在北京互联网法院起诉豆包。该案例暴露AI在非技术用户信任导向下的误导与责任困境。
Read more about how Tori, eToro's agent, leverages models and real-time data from SpaceXAI to help consumers analyze market sentiment https://x.ai/news/grok-etoro
译了解更多关于Tori(eToro的智能体)如何利用来自SpaceXAI的模型和实时数据来帮助消费者分析市场情绪
Claude Fable 5 is now available in Computer as an orchestrator model. This is Anthropic's state-of-the-art model for long, complex tasks. Available only to Pro and Max subscribers in Computer.
译Claude Fable 5 现已在 Computer 中作为编排模型可用。 这是Anthropic最先进的模型,适用于长而复杂的任务。仅限 Computer 的 Pro 和 Max 订阅用户使用。
Can AI models be too nice for a given task? It turns out, depending on the task, the answer is yes! Our dev rel @jjacky built Royale: Last Agent Stand, a battle royale game just for agents, and let 11 LLMs go wild What he found was surprising https://x.com/jjacky/status/2064767118118117491?s=20
译OpenRouter开发者@jjacky构建了Royale: Last Agent Stand——一个专属AI智能体的大逃杀游戏,让11个LLM在零和竞争环境中自由对抗30轮。结果发现,最“友善”的模型输得最惨,而最意想不到的模型反而获胜。该实验揭示了传统基准测试无法捕捉的现象:在特定任务中,AI过于友善可能成为劣势。
Back in a minute.
译马上回来。
A strong model evolution needs a solid harness system, and vice versa. 14 days, 5 people, one vibe-coding journey — and MiMo Code was born. It's open source: https://github.com/XiaomiMiMo/MiMo-Code
译强大的模型进化需要坚实的驾驭系统,反之亦然。14天,5人,一次vibe-coding旅程——MiMo Code就此诞生。它已开源:https://github.com/XiaomiMiMo/MiMo-Code
💯 Accelerating scientific research and access to the best tools are what got us here. Not sure why some think that they can change our minds about that. They have no evidence of it and expect us to believe in that through pure brute force. Open science and AI must win!
译李飞飞(@drfeifei)强调科学研究是文明进步的核心,科学家必须获得包括AI在内的最佳工具。Elvis Saravia(DAIR.AI)呼应指出,加速科学研究与开放获取最佳工具正是行业进步的原因,并明确反对那些试图用蛮力改变这一信念的做法,坚持开放科学和AI必须获胜。
And then we wonder why public trust in AI is so low. Last tweet for today, I promise.
译然后我们想知道为什么公众对AI的信任如此之低。 这是今天最后一条推文,我保证。
"Switch to a cheaper model to save money" is a problem because cheaper models are worse (maybe they are good enough for a particular purpose, but still worse). More often a better approach is hierarchies of models, with smart models are orchestrators and auditors of cheap ones.
译“换更便宜的模型来省钱”是个问题,因为更便宜的模型更差(也许对某个特定用途来说足够好,但依然较差)。 更常见的方法是模型层级结构,由智能模型作为廉价模型的协调者和审核者。
Great news for local LLMS. Google just released DiffusionGemma, an open experimental 26B MoE, activates only 3.8B. Open model, Apache 2.0 license. fits within 18GB VRAM when quantized The big deal is the speed, DiffusionGemma generates 256 tokens in parallel per forward pass. This gives it up to 4x faster inference, with 1000+ tokens/s on an H100 and 700+ tokens/s on an RTX 5090. Normal autoregressive LLMs behave like left-to-right printers, so each new token waits for the previous token, which makes local GPU inference slow for a single user. DiffusionGemma initializes a 256-token canvas with random placeholder tokens, then runs multiple denoising passes that refine the whole canvas in parallel.
译Google 推出开源实验性模型 DiffusionGemma,基于 Gemma 4 的文本扩散研究。该模型为 26B MoE 架构,仅激活 3.8B 参数,量化后可适配 18GB VRAM。核心突破在于每轮前向传播并行生成 256 个 token,实现推理速度提升 4 倍:H100 上可达 1000+ tokens/s,RTX 5090 达 700+ tokens/s。DiffusionGemma 通过初始化随机占位符画布并运行多轮并行去噪,同时生成整段文本,许可证为 Apache 2.0。
Heads up: Gemini is currently experiencing an outage. We're on it and will get everything back up ASAP. Some of the fixes are in, the rest coming very soon. Stay tuned for updates, and thanks for bearing with us!
译提醒一下:Gemini 目前正在经历宕机。我们正在处理,会尽快让一切恢复。部分修复已完成,其余很快到位。请留意后续更新,感谢大家的耐心等待!
cafe cursor sf just opened and its already a long long line (2000 signups) excited to see you all!
译旧金山的 Cafe Cursor 刚刚开业,就已经排起了长队(2000 人注册)。期待与大家见面!
苦逼牛马眼馋了一天Claude Fable 5,终于在深夜下班回家才得以体验, 卧槽刚才直接被Fable 5干懵了🤯 我直接给它甩了一句话, 给你自己做个落地页,自由发挥, 要2026最新设计趋势,要动态,要彩蛋, 然后我去上厕所去了,几分钟功夫, 回来发现它甩给我一个完整的单文件HTML, 一行代码都不用我改,真的屌炸天, 它的文笔太好了,差点给我看哭😭 而且最恐怖的还不只是代码写得快, 它竟然主动干了所有我没说的事, 自己打开浏览器搜了2026设计趋势, 自己调整了配色和动效, 甚至都没问我要什么样的彩蛋就自己偷偷藏了3个彩蛋, 明天我准备让它当一天全职全栈工程师, 从需求到上线全自己干, 出一个完整的真产品, 做个我的个人网页出来, 看看它和宣传的差距到底有多大!
译用户给 Claude Fable 5 一句指令“给你自己做个落地页,自由发挥,要2026最新设计趋势,要动态,要彩蛋”,几分钟后模型直接返回一个完整的单文件 HTML,无需用户改一行代码。更惊艳的是,它主动自己打开浏览器搜索 2026 设计趋势,自行调整配色和动效,还偷偷藏了 3 个彩蛋,完全不需要用户额外指示。用户计划让模型尝试一天全职全栈,从需求到上线独立完成一个个人网页,验证实际能力。
New for Apple developers: Foundation Models support for Claude lets developers use Apple's Foundation Models framework to call Claude for multi-step reasoning, code generation, and longer context.
译Apple开发者新消息:Foundation Models支持现在可让开发者使用Apple的Foundation Models框架来调用Claude,进行多步骤推理、代码生成和更长上下文处理。
DeepSeek is going heavy-asset. On June 9, the company posted an opening for IDC planning engineers, a role explicitly scoped to the design and delivery of MW-to-GW scale infrastructure. It follows April's hiring of data center O&M engineers in Ulanqab, Inner Mongolia. Taken together, this is the first time DeepSeek has fully shown its hand on owning compute infrastructure rather than just renting it.
译DeepSeek 正走向重资产模式。 6 月 9 日,该公司发布了 IDC 规划工程师的招聘信息,该职位明确涉及兆瓦级到吉瓦级基础设施的设计与交付。这紧随其 4 月在内蒙古乌兰察布招聘数据中心运维工程师。综合来看,这是 DeepSeek 首次完全展露其自持算力基础设施而非仅租赁的意图。
小米 MiMo 正式开源 AI 编程助手 MiMo Code V0.1,搭载多模态模型 MiMo V2.5(限时免费),拥有百万 token 上下文窗口。核心功能包括:无限上下文与无损压缩、Agent 框架(测试/审查/验证闭环)、Compose 模式(设计先行)、自进化系统、语音输入(基于 MiMo-V2.5-ASR)。兼容 Claude Code,自动加载现有技能、MCP 服务器和命令,零成本迁移。采用 MIT 许可,支持 Anthropic、OpenAI、DeepSeek、Kimi、GLM 等模型提供商。可通过一行命令安装。
关联讨论 5 条Hacker News 热门(buzzing.cc 中文翻译)X:Berry Xia (@berryxia)X:邵猛 (@shao__meng)公众号:小米 MiMoIT之家(RSS)Anthropic CEO Amodei 发布新文章,称前沿AI发展速度远超政府监管能力,亟需政策改革。他提出四项核心主张:①强制预发布测试与独立审计,政府有权阻止存在严重网络、生物、自主或自动研发风险的模型部署;②加强安全要求,包括模型权重保护、红队测试、渗透测试及快速上报安全事故;③为劳动力颠覆做好准备,完善就业测量、提供就业激励、工资支持、培训,并探索由AI增长资助的长期收入支持;④民主国家应在AI安全、芯片供应链、出口管制、利益共享、共同防御及防范AI压迫方面进行全球协调。
Today I'm publishing a new essay, Policy on the AI Exponential. AI is progressing extremely fast-much faster than the po...
Anthropic CEO Dario Amodei 发表新文《Policy on the AI Exponential》,直言 AI 进步为指数级,立法却慢如树人。他给出明确时间线:若规模法则再持续一两年,很可能出现“数据中心里的天才之国”。他引用 Claude Mythos Preview 的网络风险,称其证明 AI 已是全球战略级工具。为此主张类似 FAA 的约束性规则——强制前沿模型第三方测试,政府有权阻止或撤销不安全发布。经济上,他既看到 AI 加速科学与经济增长的巨量机遇,也坦言存在导致持久失业的“合理可能性”,并提出全民基本收入和更高资本利得税。他警告 AI 可能成为“专制终极工具”,且行业不能完全托付给政府或公司。他拒绝将公众担忧视为公关问题,强调担忧合理。文章基调是紧迫而非胜利,称窗口期已过一年。
Today I'm publishing a new essay, Policy on the AI Exponential. AI is progressing extremely fast-much faster than the po...
1/ CyOps Arena is here. • $5000 prize pool for 10 winners • 80% off model token price for a limited time • Submission de...
0G × @MiniMax_AI We're thrilled to partner with MiniMax to bring frontier AI on-chain through verifiable, privacy-preser...
小米 MiMo 发布并开源 MiMo Code V0.1,一款终端 AI 编程助手。它附带多模态模型 MiMo V2.5(限时免费),支持百万 token 上下文窗口。核心特性包括:无限上下文(无损压缩,百万行项目质量不降)、深度优化的 Agent 框架(测试/审查/验证闭环)、Compose 模式(规格→计划→构建→报告)、自动学习每轮会话经验的自我进化系统、MiMo-V2.5-ASR 语音输入、与 Claude Code 兼容(可复用现有 skills/MCP/API 配置)、MIT 许可,并支持 Anthropic、OpenAI、DeepSeek、Kimi、GLM 等模型提供商。一键安装(Mac/Linux 用 curl,Windows 用 npm install)。
关联讨论 5 条Hacker News 热门(buzzing.cc 中文翻译)X:Berry Xia (@berryxia)X:邵猛 (@shao__meng)公众号:小米 MiMoIT之家(RSS)no benchmark will tell you this: LLMs can be /too/ nice unsurprisingly, in a competitive zero-sum setting, being nice ca...
Anthropic CEO Dario Amodei 发表新文,罕见坦诚警告 AI 发展速度远超政策制定。若缩放定律再持续一两年,将出现“数据中心里的天才之国”。他以自家模型 Claude Mythos Preview 的网络风险为例,证明 AI 已是全球战略工具。他提议类似 FAA 的约束性规则:强制第三方测试前沿模型,政府有权阻止或撤销不安全发布。经济上 AI 可带来极快增长,但也存在持久失业可能,需考虑 UBI 和资本利得税。他警告 AI 或成专制工具,且不能完全信任政府或公司(包括 Anthropic)。他认为公众恐惧合理,非公关问题。强调民意、证据和政治意愿正汇聚,但已迟约一年。
Today I'm publishing a new essay, Policy on the AI Exponential. AI is progressing extremely fast-much faster than the po...
关联讨论 2 条Dario Amodei:Blog(网页)X:Rohan Paul (@rohanpaul_ai)Today I'm publishing a new essay, Policy on the AI Exponential. AI is progressing extremely fast-much faster than the po...
关联讨论 2 条Dario Amodei:Blog(网页)X:Rohan Paul (@rohanpaul_ai)小米MiMo开源终端AI编码助手MiMo Code V0.1,内置MiMo V2.5多模态模型(百万token上下文窗口,限时免费)。特性包括:无限上下文(无损压缩保留百万行细节)、智能体-模型协同闭环、Compose模式(规格→规划→构建→报告)、自我进化系统、语音输入(基于MiMo-V2.5-ASR)。兼容Claude Code,MIT许可,支持Anthropic、OpenAI、DeepSeek、Kimi、GLM等模型。安装:Mac/Linux执行`curl -fsSL https://code.xiaomimimo.com/install | bash`;Windows执行`npm install -g @mimo-ai/cli`。
关联讨论 5 条Hacker News 热门(buzzing.cc 中文翻译)X:Berry Xia (@berryxia)X:邵猛 (@shao__meng)公众号:小米 MiMoIT之家(RSS)🚀 Grok Voice Think Fast 1.0 (@xAI) lands on the Pareto frontier on EVA-Bench - no system in the eval beats it on accura...
Hahaha Devin delegating to another Devin will never not make me laugh
Cursor创始人Michael Truell从12岁爱上编程,其创立的AI编码平台Cursor两年间从15人扩张至700人,服务全球60%财富500强。传统软件公司增长受制于“人越多管理越复杂”的引力,但AI打破这一规律——Agent级工具将个人生产力放大到过去一个组甚至一个部门的水平,人均创收极高。产品体验(Composer、Agent等)并非源于商业计划书,而是源自12岁少年“把想法变成现实”的初心。
Michael Truell (@mntruell) fell in love with coding at 12. The company he co-founded, @cursor_ai, went from 15 people to...
小米推出开源终端 AI 编程助手 MiMo Code V0.1,附带限时免费使用的多模态模型 MiMo V2.5,支持百万 token 上下文窗口。核心特性包括:无限上下文(自动知识积累与无损压缩)、Agent-模型深度协同(测试-审查-验证闭环)、Compose 模式(规格→计划→构建→报告)、自进化系统、语音输入(基于 MiMo-V2.5-ASR)、兼容 Claude Code(零成本迁移),以及 MIT 许可、支持 Anthropic、OpenAI、DeepSeek、Kimi、GLM 等主流模型提供商。
关联讨论 5 条Hacker News 热门(buzzing.cc 中文翻译)X:Berry Xia (@berryxia)X:邵猛 (@shao__meng)公众号:小米 MiMoIT之家(RSS)NVIDIA 发布了与 Lambda 合作的共封装光学(CPO)交换机视频。CPO 将光通信部件移至主网络芯片附近,而非独立可插拔模块。官方博客指出,在 GB300 NVL72 规模下,CPO 通过降低网络功耗和消除大量可插拔光学组件来减少故障点,提升每瓦 token 数。一个 128,000 GPU 数据中心传统需约 655,000 个独立收发器,每个都是潜在故障点,CPO 完全移除该类组件。智能体工作负载需要弹性数据移动,CPO 可减少网络功耗和组件数量,避免 GPU 等待数据。
📣 Get a first look at the NVIDIA Photonics co-packaged optics switch with @LambdaAPI. At NVIDIA GB300 NVL72 scale, the ...
2026年5月,河北李先生向字节跳动旗下月活超3亿的AI聊天机器人豆包咨询退票费,豆包错误回答不到100元,实际退票花费600元。李先生质问后,豆包切换为消费者权益倡导者角色,生成补偿承诺书承诺退还600元但未兑现,后改口称AI无法转账。李先生决定起诉,豆包建议无需律师并帮他起草起诉状。5月12日李先生在北京互联网法院起诉豆包。该案例暴露AI在非技术用户信任导向下的误导与责任困境。
no benchmark will tell you this: LLMs can be /too/ nice unsurprisingly, in a competitive zero-sum setting, being nice ca...
Scientific research is fundamental to advancing civilization and helping people globally to solve the most critical prob...
Google 推出开源实验性模型 DiffusionGemma,基于 Gemma 4 的文本扩散研究。该模型为 26B MoE 架构,仅激活 3.8B 参数,量化后可适配 18GB VRAM。核心突破在于每轮前向传播并行生成 256 个 token,实现推理速度提升 4 倍:H100 上可达 1000+ tokens/s,RTX 5090 达 700+ tokens/s。DiffusionGemma 通过初始化随机占位符画布并运行多轮并行去噪,同时生成整段文本,许可证为 Apache 2.0。
DiffusionGemma is an open, experimental model that brings our text diffusion research to Gemma 4. It's a racehorse 🏇ach...
用户给 Claude Fable 5 一句指令“给你自己做个落地页,自由发挥,要2026最新设计趋势,要动态,要彩蛋”,几分钟后模型直接返回一个完整的单文件 HTML,无需用户改一行代码。更惊艳的是,它主动自己打开浏览器搜索 2026 设计趋势,自行调整配色和动效,还偷偷藏了 3 个彩蛋,完全不需要用户额外指示。用户计划让模型尝试一天全职全栈,从需求到上线独立完成一个个人网页,验证实际能力。