Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Dev] Revert code owner changes from pull main
#4354 opened Apr 17, 2026 by yaox12 Member Loading…
5 tasks
Fix incorrect bias display in extra_repr of Column/RowParallelLinear community-request Final Review PR is in the "final review" stage
#4330 opened Apr 16, 2026 by HelloWorldBeginner Loading…
5 tasks
Load MTP and latent MoE layers in bf16 for mxfp8 inference
#4326 opened Apr 15, 2026 by santhnm2 Contributor Draft
5 tasks
Fix activation_func check and MLP sharded_state_dict complexity: low
#4325 opened Apr 15, 2026 by gdengk Contributor Loading…
5 tasks
Fix inference graph override in RL flow complexity: low
#4323 opened Apr 15, 2026 by tdene Contributor Loading…
5 tasks
Core 0.16
remove legacy GPT code complexity: high Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review.
#4322 opened Apr 15, 2026 by dimapihtar Contributor Loading…
5 tasks
Core 0.16
Overlap engine bookkeeping with step complexity: high
#4320 opened Apr 15, 2026 by tdene Contributor Loading…
5 tasks
Core 0.16
SafeUnpickler class for safe pickle usage complexity: low Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review. Final Review PR is in the "final review" stage Run functional tests
#4319 opened Apr 15, 2026 by dimapihtar Contributor Loading…
5 tasks
Core 0.16
Fix fused grouped MLP wgrad hooks for DDP reduce-scatter complexity: low
#4311 opened Apr 15, 2026 by gdengk Contributor Loading…
5 tasks
Add quantization type debug logging.
#4308 opened Apr 14, 2026 by kwyss-nvidia Contributor Draft
5 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.