Rohan Paul 今日简报要点:Anthropic 终于公开了此前被认为“太危险”的 Claude AI 模型,但存在使用限制;Cognition 推出 FrontierCode 编程基准,用于评估 AI 代码是否达到可合并维护的水平;Claude Fable 5 的隐形限制是不能用于高级 AI 研究;Anthropic 新研究显示 AI 智能体在代码领域表现亮眼,但在生物任务中可能连科学探索第一步都无法完成;此外,Claude Code 团队成员 Thariq 给出了最大化利用 Claude Code 的实用建议。
Today's edition of my newsletter just went out.
🔗 https://www.rohan-paul.com/p/anthropic-finally-released-claude
🗞️ Claude's 'too dangerous' AI model is finally public. But there's a catch
🗞️ Cognition is introducing FrontierCode, a coding benchmark built to test whether AI code is good enough for a real maintainer to merge, not just whether it passes tests.
🗞️ This is the silent limiter on Claude Fable 5 - It cannot be used for really advanced AI research stuff.
🗞️ New Anthropic research shows AI agents may look brilliant at code, but in biology they can fail before the science starts.
🗞️ Very useful recommendation for pushing Claude Code to its full potential. by Thariq, from Claude Code team.