OpenSenseNova开源了SenseNova-U1的完整训练代码库,支持其8B密集模型与A3B MoE架构。该代码库使用一个统一的框架,可同时训练多种多模态任务,包括文本到图像生成、图像编辑、交错生成及文本与视觉理解。工程上为大规模训练设计,支持混合并行、流式可恢复数据管线、环境变量驱动配置以及从1×8 GPUs到多节点集群的扩展能力。代码已在GitHub开源,采用Apache-2.0协议。
🚀 We've open-sourced the full training codebase for SenseNova-U1 (8B dense + A3B MoE).
One stack for training multimodal tasks across: text-to-image · editing · interleaved generation · text & vision understanding.
Built for practical large-scale training:
⚙ Hybrid WP/TP/PP + ISP parallelism 🌊 Streaming, resumable, packed data pipeline 🎛 Env-var driven configs for easy experimentation 🧱 Decoupled backbone, data, and objective modules 📈 Scales from 1×8 GPUs to multi-node clusters
Apache-2.0 👇 https://github.com/OpenSenseNova/SenseNova-U1 Discord: https://discord.gg/BuTXPHmQub
@github