Fix multi‑turn loss masking, clarify qwen25/qwen3 types, and strengthen mask tests by Daucloud · Pull Request #1418 · THUDM/slime

Daucloud · 2026-01-14T11:01:26Z

Summary

Fixed gen_multi_turn_loss_mask_distill_qwen to properly handle multi‑turn masking.
Renamed loss‑mask type from "qwen" to "qwen25" and set the default to "qwen3".
Enhanced loss‑mask correctness tests with additional assertions and scenarios.

Changes

slime/utils/mask_utils.py: corrected multi‑turn masking and adjusted type names.
slime/utils/arguments.py: updated loss‑mask type options and default value.
tests/utils/test_mask_utils.py: added more comprehensive correctness checks.

Testing

Ran python tests/utils/test_mask_utils.py.
Running results:

$ python tests/utils/test_mask_utils.py
AMEM [INFO] amem_nccl.cpp:x_init:685 groupID:0 pid:2560859 build:Nov 27 2025 09:15:06 NCCL plugin loaded. pause func:Off offload_free_tag:-1 cuMemEnabled:1
/usr/local/lib/python3.12/dist-packages/torch/cuda/__init__.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
  import pynvml  # type: ignore[import]
==== Single Turn Test Qwen/Qwen3-0.6B (qwen3) ====
text =  ['<|im_start|>system\nSYSTEM MESSAGE FOR TESTING ONLY<|im_end|>\n<|im_start|>user\nUSER CONTENT FOR TESTING ONLY<|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\nASSISTANT RESPONSE FOR TESTING ONLY<|im_end|>\n']
token_ids =  [151644, 8948, 198, 46487, 40758, 4613, 95535, 26687, 151645, 198, 151644, 872, 198, 6448, 35768, 4613, 95535, 26687, 151645, 198, 151644, 77091, 198, 151667, 271, 151668, 271, 4939, 3846, 2821, 76173, 4613, 95535, 26687, 151645, 198]
loss_mask =  [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
selected_texts =  ['<think>\n\n</think>\n\nASSISTANT RESPONSE FOR TESTING ONLY<|im_end|>\n']
==== Single Turn Test Qwen/Qwen2.5-0.5B-Instruct (qwen25) ====
text =  ['<|im_start|>system\nSYSTEM MESSAGE FOR TESTING ONLY<|im_end|>\n<|im_start|>user\nUSER CONTENT FOR TESTING ONLY<|im_end|>\n<|im_start|>assistant\nASSISTANT RESPONSE FOR TESTING ONLY<|im_end|>\n']
token_ids =  [151644, 8948, 198, 46487, 40758, 4613, 95535, 26687, 151645, 198, 151644, 872, 198, 6448, 35768, 4613, 95535, 26687, 151645, 198, 151644, 77091, 198, 4939, 3846, 2821, 76173, 4613, 95535, 26687, 151645, 198]
loss_mask =  [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1]
selected_texts =  ['ASSISTANT RESPONSE FOR TESTING ONLY<|im_end|>\n']
==== Single Turn Test deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B (distill_qwen) ====
text =  ['<｜begin▁of▁sentence｜>SYSTEM MESSAGE FOR TESTING<｜User｜>USER CONTENT FOR TESTING ONLY<｜Assistant｜>ASSISTANT RESPONSE FOR TESTING ONLY<｜end▁of▁sentence｜>']
token_ids =  [151646, 46487, 40758, 4613, 95535, 151644, 6448, 35768, 4613, 95535, 26687, 151645, 4939, 3846, 2821, 76173, 4613, 95535, 26687, 151643]
loss_mask =  [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1]
selected_texts =  ['ASSISTANT RESPONSE FOR TESTING ONLY<｜end▁of▁sentence｜>']
==== Multi-turn with Tools Test Qwen/Qwen3-0.6B (qwen3) ====
text =  ['<|im_start|>system\nSYSTEM MESSAGE FOR TESTING ONLY\n\n# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within <tools></tools> XML tags:\n<tools>\n{"type": "function", "function": {"name": "terminal", "description": "Perform operations from the terminal.", "parameters": {"type": "object", "properties": {"command": {"type": "string", "description": "The bash command to execute as `bash -c <command>`"}, "description": {"type": "string", "description": "Brief description of the command for the user."}}, "required": ["command"]}}}\n{"type": "function", "function": {"name": "read_file", "description": "Read the content of a file given its path.", "parameters": {"type": "object", "properties": {"file_path": {"type": "string", "description": "The absolute path to the file to be read."}}, "required": ["file_path"]}}}\n</tools>\n\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\n<tool_call>\n{"name": <function-name>, "arguments": <args-json-object>}\n</tool_call><|im_end|>\n<|im_start|>user\nUSER CONTENT FOR TESTING ONLY<|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\nI WILL CALL terminal\n<tool_call>\n{"name": "terminal", "arguments": {"command": "ls"}}\n</tool_call>\n<tool_call>\n{"name": "terminal", "arguments": {"command": "ls"}}\n</tool_call><|im_end|>\n<|im_start|>user\n<tool_response>\nLICENSE  README.md  README_zh.md\n</tool_response><|im_end|>\n<|im_start|>user\n<tool_response>\nLICENSE  README.md  README_zh.md\n</tool_response><|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\nASSISTANT RESPONSE FOR TESTING ONLY<|im_end|>\n']
token_ids =  [151644, 8948, 198, 46487, 40758, 4613, 95535, 26687, 271, 2, 13852, 271, 2610, 1231, 1618, 825, 476, 803, 5746, 311, 7789, 448, 279, 1196, 3239, 382, 2610, 525, 3897, 448, 729, 32628, 2878, 366, 15918, 1472, 15918, 29, 11874, 9492, 510, 27, 15918, 397, 4913, 1313, 788, 330, 1688, 497, 330, 1688, 788, 5212, 606, 788, 330, 36327, 497, 330, 4684, 788, 330, 46951, 7525, 504, 279, 15022, 10465, 330, 13786, 788, 5212, 1313, 788, 330, 1700, 497, 330, 13193, 788, 5212, 5631, 788, 5212, 1313, 788, 330, 917, 497, 330, 4684, 788, 330, 785, 27023, 3210, 311, 9026, 438, 1565, 46216, 481, 66, 366, 5631, 43626, 14345, 330, 4684, 788, 5212, 1313, 788, 330, 917, 497, 330, 4684, 788, 330, 85984, 4008, 315, 279, 3210, 369, 279, 1196, 1189, 38154, 330, 6279, 788, 4383, 5631, 1341, 3417, 532, 4913, 1313, 788, 330, 1688, 497, 330, 1688, 788, 5212, 606, 788, 330, 878, 2458, 497, 330, 4684, 788, 330, 4418, 279, 2213, 315, 264, 1034, 2661, 1181, 1815, 10465, 330, 13786, 788, 5212, 1313, 788, 330, 1700, 497, 330, 13193, 788, 5212, 1192, 2638, 788, 5212, 1313, 788, 330, 917, 497, 330, 4684, 788, 330, 785, 10740, 1815, 311, 279, 1034, 311, 387, 1349, 1189, 38154, 330, 6279, 788, 4383, 1192, 2638, 1341, 3417, 532, 522, 15918, 1339, 2461, 1817, 729, 1618, 11, 470, 264, 2951, 1633, 448, 729, 829, 323, 5977, 2878, 220, 151657, 151658, 11874, 9492, 510, 151657, 198, 4913, 606, 788, 366, 1688, 11494, 8066, 330, 16370, 788, 366, 2116, 56080, 40432, 31296, 151658, 151645, 198, 151644, 872, 198, 6448, 35768, 4613, 95535, 26687, 151645, 198, 151644, 77091, 198, 151667, 271, 151668, 271, 40, 27915, 25752, 15022, 198, 151657, 198, 4913, 606, 788, 330, 36327, 497, 330, 16370, 788, 5212, 5631, 788, 330, 4730, 95642, 151658, 198, 151657, 198, 4913, 606, 788, 330, 36327, 497, 330, 16370, 788, 5212, 5631, 788, 330, 4730, 95642, 151658, 151645, 198, 151644, 872, 198, 151665, 198, 64368, 220, 61945, 21324, 220, 61945, 6415, 71, 21324, 198, 151666, 151645, 198, 151644, 872, 198, 151665, 198, 64368, 220, 61945, 21324, 220, 61945, 6415, 71, 21324, 198, 151666, 151645, 198, 151644, 77091, 198, 151667, 271, 151668, 271, 4939, 3846, 2821, 76173, 4613, 95535, 26687, 151645, 198]
loss_mask =  [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
selected_texts =  ['<think>\n\n</think>\n\nI WILL CALL terminal\n<tool_call>\n{"name": "terminal", "arguments": {"command": "ls"}}\n</tool_call>\n<tool_call>\n{"name": "terminal", "arguments": {"command": "ls"}}\n</tool_call><|im_end|>\n', '<think>\n\n</think>\n\nASSISTANT RESPONSE FOR TESTING ONLY<|im_end|>\n']
==== Multi-turn with Tools Test Qwen/Qwen2.5-0.5B-Instruct (qwen25) ====
text =  ['<|im_start|>system\nSYSTEM MESSAGE FOR TESTING ONLY\n\n# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within <tools></tools> XML tags:\n<tools>\n{"type": "function", "function": {"name": "terminal", "description": "Perform operations from the terminal.", "parameters": {"type": "object", "properties": {"command": {"type": "string", "description": "The bash command to execute as `bash -c <command>`"}, "description": {"type": "string", "description": "Brief description of the command for the user."}}, "required": ["command"]}}}\n{"type": "function", "function": {"name": "read_file", "description": "Read the content of a file given its path.", "parameters": {"type": "object", "properties": {"file_path": {"type": "string", "description": "The absolute path to the file to be read."}}, "required": ["file_path"]}}}\n</tools>\n\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\n<tool_call>\n{"name": <function-name>, "arguments": <args-json-object>}\n</tool_call><|im_end|>\n<|im_start|>user\nUSER CONTENT FOR TESTING ONLY<|im_end|>\n<|im_start|>assistant\nI WILL CALL terminal\n<tool_call>\n{"name": "terminal", "arguments": {"command": "ls"}}\n</tool_call>\n<tool_call>\n{"name": "terminal", "arguments": {"command": "ls"}}\n</tool_call><|im_end|>\n<|im_start|>user\n<tool_response>\nLICENSE  README.md  README_zh.md\n</tool_response><|im_end|>\n<|im_start|>user\n<tool_response>\nLICENSE  README.md  README_zh.md\n</tool_response><|im_end|>\n<|im_start|>assistant\nASSISTANT RESPONSE FOR TESTING ONLY<|im_end|>\n']
token_ids =  [151644, 8948, 198, 46487, 40758, 4613, 95535, 26687, 271, 2, 13852, 271, 2610, 1231, 1618, 825, 476, 803, 5746, 311, 7789, 448, 279, 1196, 3239, 382, 2610, 525, 3897, 448, 729, 32628, 2878, 366, 15918, 1472, 15918, 29, 11874, 9492, 510, 27, 15918, 397, 4913, 1313, 788, 330, 1688, 497, 330, 1688, 788, 5212, 606, 788, 330, 36327, 497, 330, 4684, 788, 330, 46951, 7525, 504, 279, 15022, 10465, 330, 13786, 788, 5212, 1313, 788, 330, 1700, 497, 330, 13193, 788, 5212, 5631, 788, 5212, 1313, 788, 330, 917, 497, 330, 4684, 788, 330, 785, 27023, 3210, 311, 9026, 438, 1565, 46216, 481, 66, 366, 5631, 43626, 14345, 330, 4684, 788, 5212, 1313, 788, 330, 917, 497, 330, 4684, 788, 330, 85984, 4008, 315, 279, 3210, 369, 279, 1196, 1189, 38154, 330, 6279, 788, 4383, 5631, 1341, 3417, 532, 4913, 1313, 788, 330, 1688, 497, 330, 1688, 788, 5212, 606, 788, 330, 878, 2458, 497, 330, 4684, 788, 330, 4418, 279, 2213, 315, 264, 1034, 2661, 1181, 1815, 10465, 330, 13786, 788, 5212, 1313, 788, 330, 1700, 497, 330, 13193, 788, 5212, 1192, 2638, 788, 5212, 1313, 788, 330, 917, 497, 330, 4684, 788, 330, 785, 10740, 1815, 311, 279, 1034, 311, 387, 1349, 1189, 38154, 330, 6279, 788, 4383, 1192, 2638, 1341, 3417, 532, 522, 15918, 1339, 2461, 1817, 729, 1618, 11, 470, 264, 2951, 1633, 448, 729, 829, 323, 5977, 2878, 220, 151657, 151658, 11874, 9492, 510, 151657, 198, 4913, 606, 788, 366, 1688, 11494, 8066, 330, 16370, 788, 366, 2116, 56080, 40432, 31296, 151658, 151645, 198, 151644, 872, 198, 6448, 35768, 4613, 95535, 26687, 151645, 198, 151644, 77091, 198, 40, 27915, 25752, 15022, 198, 151657, 198, 4913, 606, 788, 330, 36327, 497, 330, 16370, 788, 5212, 5631, 788, 330, 4730, 95642, 151658, 198, 151657, 198, 4913, 606, 788, 330, 36327, 497, 330, 16370, 788, 5212, 5631, 788, 330, 4730, 95642, 151658, 151645, 198, 151644, 872, 198, 27, 14172, 9655, 397, 64368, 220, 61945, 21324, 220, 61945, 6415, 71, 21324, 198, 522, 14172, 9655, 29, 151645, 198, 151644, 872, 198, 27, 14172, 9655, 397, 64368, 220, 61945, 21324, 220, 61945, 6415, 71, 21324, 198, 522, 14172, 9655, 29, 151645, 198, 151644, 77091, 198, 4939, 3846, 2821, 76173, 4613, 95535, 26687, 151645, 198]
loss_mask =  [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1]
selected_texts =  ['I WILL CALL terminal\n<tool_call>\n{"name": "terminal", "arguments": {"command": "ls"}}\n</tool_call>\n<tool_call>\n{"name": "terminal", "arguments": {"command": "ls"}}\n</tool_call><|im_end|>\n', 'ASSISTANT RESPONSE FOR TESTING ONLY<|im_end|>\n']
==== Multi-turn with Tools Test deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B (distill_qwen) ====
text =  ['<｜begin▁of▁sentence｜>SYSTEM MESSAGE FOR TESTING<｜User｜>USER CONTENT FOR TESTING ONLY<｜Assistant｜><｜tool▁calls▁begin｜><｜tool▁call▁begin｜>function<｜tool▁sep｜>terminal\n```json\n{"command": "ls"}\n```<｜tool▁call▁end｜>\n<｜tool▁call▁begin｜>function<｜tool▁sep｜>terminal\n```json\n{"command": "ls"}\n```<｜tool▁call▁end｜><｜tool▁calls▁end｜><｜end▁of▁sentence｜><｜tool▁outputs▁begin｜><｜tool▁output▁begin｜>LICENSE  README.md  README_zh.md<｜tool▁output▁end｜><｜tool▁outputs▁end｜><｜tool▁outputs▁begin｜><｜tool▁output▁begin｜>LICENSE  README.md  README_zh.md<｜tool▁output▁end ｜><｜tool▁outputs▁end｜><｜Assistant｜>ASSISTANT RESPONSE FOR TESTING ONLY<｜end▁of▁sentence｜>']
token_ids =  [151646, 46487, 40758, 4613, 95535, 151644, 6448, 35768, 4613, 95535, 26687, 151645, 27, 130957, 14172, 10417, 223, 44620, 10417, 223, 7265, 130957, 1784, 130957, 14172, 10417, 223, 6659, 10417, 223, 7265, 130957, 29, 1688, 27, 130957, 14172, 10417, 223, 28036, 130957, 29, 36327, 198, 73594, 2236, 198, 4913, 5631, 788, 330, 4730, 16707, 73594, 27, 130957, 14172, 10417, 223, 6659, 10417, 223, 408, 130957, 397, 27, 130957, 14172, 10417, 223, 6659, 10417, 223, 7265, 130957, 29, 1688, 27, 130957, 14172, 10417, 223, 28036, 130957, 29, 36327, 198, 73594, 2236, 198, 4913, 5631, 788, 330, 4730, 16707, 73594, 27, 130957, 14172, 10417, 223, 6659, 10417, 223, 408, 130957, 1784, 130957, 14172, 10417, 223, 44620, 10417, 223, 408, 130957, 29, 151643, 27, 130957, 14172, 10417, 223, 41006, 10417, 223, 7265, 130957, 1784, 130957, 14172, 10417, 223, 3006, 10417, 223, 7265, 130957, 29, 64368, 220, 61945, 21324, 220, 61945, 6415, 71, 21324, 27, 130957, 14172, 10417, 223, 3006, 10417, 223, 408, 130957, 1784, 130957, 14172, 10417, 223, 41006, 10417, 223, 408, 130957, 29, 27, 130957, 14172, 10417, 223, 41006, 10417, 223, 7265, 130957, 1784, 130957, 14172, 10417, 223, 3006, 10417, 223, 7265, 130957, 29, 64368, 220, 61945, 21324, 220, 61945, 6415, 71, 21324, 27, 130957, 14172, 10417, 223, 3006, 10417, 223, 408, 130957, 1784, 130957, 14172, 10417, 223, 41006, 10417, 223, 408, 130957, 29, 151645, 4939, 3846, 2821, 76173, 4613, 95535, 26687, 151643]
loss_mask =  [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1]
selected_texts =  ['<｜tool▁calls▁begin｜><｜tool▁call▁begin｜>function<｜tool▁sep｜>terminal\n```json\n{"command": "ls"}\n```<｜tool▁call▁end｜>\n<｜tool▁call▁begin｜>function<｜tool▁sep｜>terminal\n```json\n{"command": "ls"}\n```<｜tool▁call▁end｜><｜tool▁calls▁end｜><｜end▁of▁sentence｜>', 'ASSISTANT RESPONSE FOR TESTING ONLY<｜end▁of▁sentence｜>']
AMEM [INFO] amem_nccl.cpp:x_fini:693 groupID:0 pid:2560859 exit

…t to qwen3

Surya-Gunukula · 2026-02-05T06:24:13Z

#1159, also implemented similar loss masking

Daucloud added 3 commits January 14, 2026 11:06

fix: gen_multi_turn_loss_mask_distill_qwen() is not multi-turn

882433b

refactor: rename the loss-mask-type qwen to qwen25 and set the defaul…

d57b42d

…t to qwen3

test: correctness check for loss masks

4f3e4c4

Daucloud force-pushed the fix/loss-mask branch from e191d5a to 4f3e4c4 Compare January 14, 2026 11:07

Daucloud added 2 commits January 15, 2026 19:34

test: assert the loss mask using ground_truth

43e1fb5

ci: test_loss_mask

98b3cec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix multi‑turn loss masking, clarify qwen25/qwen3 types, and strengthen mask tests#1418

Fix multi‑turn loss masking, clarify qwen25/qwen3 types, and strengthen mask tests#1418
Daucloud wants to merge 5 commits intoTHUDM:mainfrom
Daucloud:fix/loss-mask

Daucloud commented Jan 14, 2026 •

edited

Loading

Uh oh!

Surya-Gunukula commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Daucloud commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Testing

Uh oh!

Surya-Gunukula commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Daucloud commented Jan 14, 2026 •

edited

Loading