v0.12.3

Latest

jerryli1981 released this 31 Oct 08:26

· 4 commits to main since this release

v0.12.3

4f7ae50

--支持Qwen3-VL系列模型使用Mcore进行微调。
--支持Qwen3-Next-80B-A3B使用Chatlearn进行强化学习。
--通过上下文并行(Context Parallel)与序列打包(Sequence Packing)提升Moonlight/DeepSeek-V3等MLA模型的强化学习训练稳定性和效率。
--修复已知的issues。

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.12.3

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!