Improve moe grouped gemm logging by araina-amd · Pull Request #652 · AMD-AGI/Primus

araina-amd · 2026-04-08T21:55:50Z

Improve moe grouped gemm logging

… scheduler comparison Performance projection fixes: - Fix double-counting of DeepEP A2A overlap when EP is unchanged - Correctly reconstruct sequential compute time when EP changes with DeepEP ON - Fix VPP handling: use interleaved_1f1b when zero-bubble + VPP>1 Scheduler comparison (--pipeline-schedule-algorithm): - Thread scheduler_algorithm from CLI through projection engine - Add zbv-formatted and zbv-greedy as CLI choices - Add _print_scheduler_comparison for multi-scheduler results table - 'all' mode runs all applicable schedulers + SeaAILab ILP and picks best CLI fixes: - Re-add --pipeline-schedule-algorithm argument with full choices - Rename megatron-ilp to seaailab-ilp

…ata patch

- Log Megatron grouped-GEMM flags and Origami-style M/H/F, grouped_batch, token counts, and layer pattern from MoE block config (GPU benchmark and simulation). - Simulation: print expert routed GEMM-only fwd/bwd ms before router overhead. - training_config_debug_one_line() for compact config in profiler logs. - Optional backward autograd label/args and CUDA profiler hook in utils; wire through layer profilers.

root and others added 9 commits April 1, 2026 09:43

feat(primus-pipeline): fix te gemm patch for zerobubble

2443313

feat(primus-pipeline): add lagacy grouped gemm support

910bcde

feat(primus-pipeline): add turbo fp8 support

187ad1d

feat(primus-pipeline): fix newest megatron issue

64024ca

fix(pp): fix original PP dump data logic

a7197eb

fix: replace empty except clause with log_rank_0 warning in dump_pp_d…

8aaa46e

…ata patch

style: run black formatter on projection and pretrain_trainer

bc9a4e7

araina-amd requested review from Xiaoming-AMD, limou102 and wenxie-amd as code owners April 8, 2026 21:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve moe grouped gemm logging#652

Improve moe grouped gemm logging#652
araina-amd wants to merge 9 commits intomainfrom
araina/dev/projection-moe-grouped-gemm-logging

araina-amd commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

araina-amd commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants