Cursor 正用当前版 Composer 训练下一代，形成递归自我改进循环

Lee Robinson@leerob

2026-06-11 21:50·3天前

AI 摘要

Cursor 正用当前版 Composer 训练下一代 Composer，形成递归自我改进循环。训练大型模型需要大量 RL 数据（模型通过“游戏”提升能力），新模型能自动配置开发环境（如自动安装依赖、修复故障）。Composer 2 在环境配置能力上显著优于版本 1，模型越强，越擅长创造训练其继任者的条件。Cursor 的 autoinstall 系统让前代 Composer 设置 RL 训练环境，使下一代专注于解决更难题，每一代都解锁先前版本不具备的能力。

We're training the next version of Composer… with Composer！

The model is always learning from itself. This kind of "recursive self-improvement" might sound new， but it's been happening for many months！

For example， training big models requires creating *lots* of data for RL - essentially games the model plays to improve at any task you can grade.

The newest models can configure their own environments to make those games playable （auto-installing dependencies， fixing broken setups）.

Composer 2 was *dramatically* better at this than version 1. So the better the model gets， the better it gets at creating the conditions to train its successor.

Each generation unlocks capabilities the previous one didn't have！ So cool.

https：//x.com/cursor_ai/status/2052116064474161556

CursorWe use previous generations of Composer to train future ones. Our autoinstall system has earlier Composer models set up dev environments for RL training. That w...

智能体大佬观点编码

在 X 查看原推

Lee Robinson@leerob · X