48
AI 摘要
Anthropic 在遭受强烈反对后,撤销了 Claude Fable 5 针对竞争 AI 研究人员秘密降低性能的政策。该公司向 WIRED 表示将修改前沿 LLM 开发的安全措施,使其透明可见,并致歉称做出了错误的权衡。AI 研究员 Nathan Lambert 赞扬 Anthropic 的快速行动,认为他们不会在不告知用户的情况下悄悄降级性能。
Props to Anthropic for quick action here. I'm okay with this outcome.
Some people may, but I don't think they'd silently degrade performance without telling users.
NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash. "We're changi...