Anthropic呼吁全球减缓前沿AI：其模型可能接近递归自我改进

Rohan Paul@rohanpaul_ai

2026-06-05 20:30·9天前

AI 摘要

Anthropic公开呼吁全球采取行动减缓前沿AI发展，因其Claude模型可能接近递归自我改进（系统无需人类控制即帮助构建更强版本）。目前尚未发生，但跳跃可能突然到来，且AI训练运行比武器库更难隐藏。Claude现已编写超80%合并生产代码，工程师产出达2024年基线8倍；可靠任务长度每4个月翻倍，Mythos Preview可连续工作超16小时；训练代码加速从3x跃至52x（人类仅4x）。剩余人类优势仅剩研究判断力。Anthropic估值约1万亿美元，年化收入或达500亿美元，与OpenAI激烈竞争。

Anthropic just called for a global way to slow frontier AI because its own models may be approaching recursive self-improvement， where a system helps build a stronger version of itself without direct human control.

Future models will become so good at research， experiments， debugging， and training design that humans will stop being the main bottleneck.

Once that loop starts， progress could shift from human-paced engineering to machine-assisted improvement， which makes every safety test， law， and lab policy feel late by default.

Anthropic says this has not happened yet， but warns that the jump may arrive before governments， companies， and researchers have a trusted way to measure or restrain it.

The hard part is verification， because a huge AI training run is easier to hide than a weapons site， and any lab that secretly keeps training while others pause could gain the lead.

Anthropic is now ~$1T， may reach $50B annualized revenue， and competes fiercely with OpenAI， so every safety claim also lands inside a giant business fight.

---

anthropic .com/institute/recursive-self-improvement

Rohan PaulAnthropic just disclosed that Claude now writes more than 80% of the production code it merges. Before Claude Code reached research preview in 02-25, Claude wro...

Anthropic安全/对齐推理政策/监管

在 X 查看原推

Rohan Paul@rohanpaul_ai · X