-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Dev] Revert code owner changes from pull main
#4354
opened Apr 17, 2026 by
yaox12
Member
Loading…
5 tasks
Fix incorrect bias display in extra_repr of Column/RowParallelLinear
community-request
Final Review
PR is in the "final review" stage
#4330
opened Apr 16, 2026 by
HelloWorldBeginner
Loading…
5 tasks
Fix activation_func check and MLP sharded_state_dict
complexity: low
#4325
opened Apr 15, 2026 by
gdengk
Contributor
Loading…
5 tasks
Fix checkpoint loading with
load_main_params_from_ckpt=True for grouped weight
#4324
opened Apr 15, 2026 by
ksivaman
Member
Loading…
5 tasks
remove legacy GPT code
complexity: high
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
SafeUnpickler class for safe pickle usage
complexity: low
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
Final Review
PR is in the "final review" stage
Run functional tests
Eliminate QPs during checkpoint loading, no need for review or merge
#4317
opened Apr 15, 2026 by
CarlosGomes98
Contributor
•
Draft
fix: correct 'Seperate'/'Seperated' typo in comments
community-request
#4313
opened Apr 15, 2026 by
MukundaKatta
•
Draft
feat: Optimize memory footprint of long-context training via fused kernel and chunking
community-request
#4312
opened Apr 15, 2026 by
terminator123
Loading…
Fix fused grouped MLP wgrad hooks for DDP reduce-scatter
complexity: low
#4311
opened Apr 15, 2026 by
gdengk
Contributor
Loading…
5 tasks
Add quantization type debug logging.
#4308
opened Apr 14, 2026 by
kwyss-nvidia
Contributor
•
Draft
5 tasks
Move inference context bookkeeping to CPU with ContextGPUView
#4306
opened Apr 14, 2026 by
lmcafee-nvidia
Contributor
•
Draft
8 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.