商汤把 SenseNova-U1 的训练代码全量开源,支持多模态任务和 MoE,还给了完整的并行策略,做多模态训练的可以直接 fork 过去用,Apache-2.0 很友好。
商汤开源了SenseNova-U1(8B dense + A3B MoE)的完整训练代码库。这是一个统一的框架,支持文本到图像、图像编辑、交错生成、文本与视觉理解等多种多模态任务的训练。其设计注重实用性与大规模训练,采用混合并行、流式可恢复数据管道、环境变量配置、解耦模块化设计,并支持从1×8 GPU扩展到多节点集群的规模。代码库以Apache-2.0协议开源。
🚀 We've open-sourced the full training codebase for SenseNova-U1 (8B dense + A3B MoE).
One stack for training multimodal tasks across: text-to-image · editing · interleaved generation · text & vision understanding.
Built for practical large-scale training: ⚙ Hybrid WP/TP/PP + ISP parallelism 🌊 Streaming, resumable, packed data pipeline 🎛 Env-var driven configs for easy experimentation 🧱 Decoupled backbone, data, and objective modules 📈 Scales from 1×8 GPUs to multi-node clusters
Apache-2.0 👇 https://github.com/OpenSenseNova/SenseNova-U1 Discord: https://discord.gg/BuTXPHmQub
@GitHub