总体来说: “你不应该再提示编码智能体了。你应该设计循环来提示你的智能体。” 循环是临时解决方案:如今的大语言模型判断力很差。它们难以判断何时继续、何时停止或何时调用工具。循环强制智能体更长时间地工作。 对于目前可验证的目标,循环非常强大,正如AutoResearch所示。
On the whole:
"You shouldn't be prompting coding agents anymore. You should be designing loops that prompt your agents."
Loops are the temporary workaround: today's LLMs have poor judgment. They struggle to know when to keep going, when to stop, or when to call a tool. Loops force agents to work longer.
Loops are incredibly powerful for verifiable goals for now, as AutoResearch shows.