Skip to content

Pull requests: deepspeedai/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix gemma4 num attention head bugs
#7975 opened Apr 15, 2026 by mingxiang1006 Loading…
[Blog] Muon Optimizer Support in DeepSpeed
#7962 opened Apr 8, 2026 by delock Collaborator Loading…
Add Gram Newton-Schulz orthogonalization for Muon optimizer
#7953 opened Apr 3, 2026 by delock Collaborator Loading…
Fix/warnings stacklevel mvapich runner
#7949 opened Apr 2, 2026 by nathon-lee Contributor Draft
Refactor/torch autocast encapsulate global state
#7946 opened Apr 2, 2026 by nathon-lee Contributor Loading…
feat(zero2): add CPU offload support for Muon optimizer
#7939 opened Mar 31, 2026 by delock Collaborator Loading…
Add AutoEP
#7938 opened Mar 31, 2026 by tohtana Collaborator Draft
[Feature] Enable AutoEP Compatibility with ZeRO-3
#7928 opened Mar 28, 2026 by nathon-lee Contributor Loading…
Add torch_xla TPU support for ZeRO-1/2
#7917 opened Mar 21, 2026 by PKUWZP Collaborator Loading…
doc: Remove suggestion to build extensions in parallel
#7899 opened Mar 12, 2026 by Flamefire Contributor Loading…
Fix subgroup optimizer metadata inconsistency
#7820 opened Jan 27, 2026 by st-bang97 Loading…
ProTip! Add no:assignee to see everything that’s not assigned.