Releases: sgl-project/sgl-kernel-npu
Releases · sgl-project/sgl-kernel-npu
20251209
What's Changed
- Fix notify dispatch cann8.3 by @oagniqgnat in #245
- fix normal and low_latency layerd rdma_data_size when mixed running by @zuje123 in #246
- fixing release ci by @BourneSun0527 in #248
Full Changelog: 2025120...2025120
20251206
What's Changed
- add catlass ops demo by @ltcs11 in #200
- Add release package dependencies by @monkeyLoveding in #215
- bugfix: The rint interface causes ub to be unaligned. by @chenxu140 in #208
- add pybind11 by @monkeyLoveding in #217
- Upload wheel to release by @BourneSun0527 in #218
- Upload wheel to release (#218) by @BourneSun0527 in #220
- optimizer internode_dispatch hostbound by @zuje123 in #211
- Release workflow switch to cpu by @BourneSun0527 in #222
- switch release machine by @BourneSun0527 in #224
- Add cann path to LD_LIBRARY_PATH by @BourneSun0527 in #225
- [Feat] Lightning indexer op & GE helper engineering by @randgun in #203
- Fixing missing.so files by @monkeyLoveding in #226
- try Fixing missing.so files by @monkeyLoveding in #228
- add sinks_attenton for GPT-OSS by @Todobe in #216
- add zero_experts_compute_identity by @Todobe in #214
- Add performance testing section to the moe script by @goosj in #198
- normal_dispatch num_recv_tokens_per_expert_list support prefixSum by @zuje123 in #221
- A2 dispatch/combine layered operator adaptation for SGLang interface by @oagniqgnat in #209
- Add swiglu_oai for GPT-OSS by @Todobe in #233
- [DFX] Adaptable to multiple model validations for fused moe by @kaniel-outis in #229
- [Bugfix] add padding cases for causal_conv1d_update by @ltcs11 in #235
- [Feat] add chunk_gated_delta_rule triton support by @ltcs11 in #232
- Add two mixed-race tests: normal and low latency, normal and fused deep moe. by @goosj in #206
- debug deepep build by @BourneSun0527 in #231
- rework release build by @iforgetmyname in #237
New Contributors
- @ltcs11 made their first contribution in #200
- @monkeyLoveding made their first contribution in #215
Full Changelog: 2025112...2025120