Skip to content

Pull requests: radixark/miles

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add packed SFT rollout for pretokenized blocks
#1515 opened Jun 29, 2026 by samaritan1998 Loading…
feat: Dynamo integration
#1514 opened Jun 29, 2026 by AndyDai-nv Draft
Megatron e2e: weight-check skip-list, Qwen3.5 MTP cases run-ci-qwen35 Run qwen3.5 e2e CI
#1512 opened Jun 29, 2026 by guapisolo Collaborator Loading…
refactor: extract session core with direct HTTP responses
#1510 opened Jun 29, 2026 by guapisolo Collaborator Loading…
[test] mv deepseek v4 to /manual
#1508 opened Jun 29, 2026 by yueming-yuan Collaborator Loading…
Ci history gate
#1507 opened Jun 29, 2026 by guapisolo Collaborator Draft
[AMD] update ROCm sglang base image
#1506 opened Jun 29, 2026 by XinyuJiangCMU Contributor Loading…
fsdp: keep fp32 master for nemotron_h (mixed-dtype checkpoint)
#1502 opened Jun 29, 2026 by Zhichenzzz Contributor Loading…
fsdp: clear stale GDN packing boundaries on non-packed forwards
#1501 opened Jun 29, 2026 by Zhichenzzz Contributor Loading…
fsdp: force flash attention for attention-sink models (gpt-oss)
#1500 opened Jun 28, 2026 by Zhichenzzz Contributor Loading…
feat(loss): support pg_loss aggregation modes
#1498 opened Jun 27, 2026 by EazyReal Loading…
fix(update_weight): skip flush_cache for retract pause mode
#1497 opened Jun 27, 2026 by Shi-Dong Contributor Loading…
1 task
fix(opd): score teacher at rollout temperature
#1496 opened Jun 27, 2026 by EazyReal Loading…
fix(dist): preserve new_group options across reload
#1495 opened Jun 27, 2026 by EazyReal Loading…
fix(train): support eval-only mode (--num-rollout 0)
#1494 opened Jun 27, 2026 by EazyReal Loading…
fix(ppo): preserve raw KL metric tensor
#1493 opened Jun 27, 2026 by EazyReal Loading…
fix(weights): handle empty colocated tensor buckets
#1492 opened Jun 27, 2026 by EazyReal Loading…
docs: fix cli-reference defaults and advantage-estimator choices
#1490 opened Jun 26, 2026 by Shi-Dong Contributor Loading…
1 task
[OPD] Add Qwen3.5-35B-A3B single-node self-distillation example
#1488 opened Jun 26, 2026 by maocheng23 Contributor Loading…
Add OpenEnv example: miles <-> HuggingFace OpenEnv integration
#1487 opened Jun 26, 2026 by Shi-Dong Contributor Draft
1 of 5 tasks
ProTip! Filter pull requests by the default branch with base:main.