-
Notifications
You must be signed in to change notification settings - Fork 209
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
cp: Run doctests, unit tests, and functional tests
Run CICD
fix: log metrics that can be coerced to scalars (1723) into r0.5.0
cherry-pick
CI:L1
#1730
opened Jan 6, 2026 by
chtruong814
Loading…
cp: Run doctests, unit tests, and functional tests
Run CICD
fix: Disable cudnn sdpa backend when using activation checkpointing (1717) into r0.5.0
cherry-pick
CI:L1
#1727
opened Jan 6, 2026 by
chtruong814
Loading…
fix: apply offloading change from v2 to v1
CI:L0
Run doctests and unit tests
r0.5.0
#1726
opened Jan 6, 2026 by
terrykong
Loading…
4 tasks
fix: fix several nightly tests that were flaky
CI:L0
Run doctests and unit tests
r0.5.0
#1724
opened Jan 6, 2026 by
terrykong
Loading…
fix: use median instead of mean for logprob error for stability in nightlies
CI:L1
Run doctests, unit tests, and functional tests
r0.5.0
#1722
opened Jan 6, 2026 by
terrykong
Loading…
4 tasks
fix: gemma3 27b must now have skip_tokenizer_init=False in vllm
CI:L1
Run doctests, unit tests, and functional tests
r0.5.0
#1721
opened Jan 6, 2026 by
terrykong
Loading…
4 tasks
fix: mcore generation config restored in nightly test
r0.5.0
#1720
opened Jan 6, 2026 by
terrykong
Loading…
4 tasks
feat: refactor common data utilities of dtensor policy v2
#1710
opened Jan 5, 2026 by
hemildesai
•
Draft
4 tasks
Update index.md
documentation
Improvements or additions to documentation
#1706
opened Jan 2, 2026 by
snowmanwwg
Loading…
4 tasks
Create model-support.md for NeMoRL
documentation
Improvements or additions to documentation
#1705
opened Jan 2, 2026 by
snowmanwwg
Loading…
4 tasks
feat: RL support for custom moe models in dtensor v2
CI:L1
Run doctests, unit tests, and functional tests
#1695
opened Dec 24, 2025 by
hemildesai
Loading…
[don't merge] support multiple datasets for response dataset
CI:L1
Run doctests, unit tests, and functional tests
feat: Add SGLang rollout backend and tests
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
community-request
#1674
opened Dec 21, 2025 by
RolaoDenthu
Loading…
4 tasks
fix: Add debug parameter to reduce verbose output
community-request
#1664
opened Dec 19, 2025 by
sahgerlad
Loading…
refactor: Order node by IP in GRPO
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
GB200
#1655
opened Dec 18, 2025 by
guyueh1
Loading…
4 tasks
chore: update Megatron-LM submodule to ed804b4
#1653
opened Dec 17, 2025 by
yaoyu-33
Loading…
4 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.