-
Notifications
You must be signed in to change notification settings - Fork 728
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add support for NVIDIA DGX Spark (GB10 / sm_121a, arm64)
#1835
opened Apr 15, 2026 by
boots-coder
Loading…
Fix missing activation checkpointing (recompute) parameters in bridge mode
#1833
opened Apr 14, 2026 by
XJL010622
Loading…
[build] Add A100 support: patch set, offline-friendly conda build, and examples
#1832
opened Apr 14, 2026 by
jason9693
Loading…
[Fix] Fix cuda-python pin in build_conda.sh
#1827
opened Apr 12, 2026 by
kaysonyu
Contributor
Loading…
fix(gemma3): use GeGLU activation instead of SwiGLU
#1825
opened Apr 10, 2026 by
leofan-lab
Loading…
feat: add GLM-5 SFT loss mask support
#1824
opened Apr 10, 2026 by
stevewx
Contributor
Loading…
4 tasks done
fix: auto-fallback to flash_attn for Qwen3.5 on pre-Hopper GPUs (head_dim=256)
#1808
opened Apr 6, 2026 by
dadiaomengmeimei
Loading…
feat: delta compression for weight sync
#1806
opened Apr 5, 2026 by
nanjiangwill
Collaborator
•
Draft
Supporting FIPO (Future-KL Influenced Policy Optimization)
#1801
opened Apr 3, 2026 by
SeungyounShin
Loading…
feat: add checkpoint retention limit to automatically clean up old checkpoints
#1798
opened Apr 2, 2026 by
stevewx
Contributor
Loading…
4 tasks done
Add rollout sampling-mask support
run-ci-short
#1795
opened Apr 2, 2026 by
yitianlian
Collaborator
Loading…
Enhanced Off-Policy Async Rollout with Staleness Control and Partial Rollout Support
#1781
opened Mar 30, 2026 by
huang3eng
Contributor
Loading…
[kimi25 rl part4] Support K25 HF weight conversion between BF16\FP8\INT4
#1757
opened Mar 23, 2026 by
Gao016
Contributor
Loading…
[kimi25 rl part2] pass megatron bridge provider args from slime config
#1754
opened Mar 23, 2026 by
GeLee-Q
Contributor
Loading…
[kimi25 rl part1.2] support kimi25 q-lora pairing in bridge update path (weight update for train-infer colocate)
#1753
opened Mar 23, 2026 by
GeLee-Q
Contributor
Loading…
Add Mooncake Backend for Rollout Data Transfer
run-ci-megatron
#1709
opened Mar 11, 2026 by
zxpdemonio
Loading…
6 tasks done
fix: normalize rewards per-group when sample counts are unequal
#1655
opened Mar 2, 2026 by
dubin555
Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654
opened Mar 2, 2026 by
tourzhao
Loading…
3 tasks
Fix the Rotary Position Embedding (RoPE) parameter passing in the GLM5 mode
#1650
opened Mar 2, 2026 by
hanxdmech-ship-it
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.