Skip to content

Issue/1308: Add MoE inference operators and EP communication primitives#1309

Merged
wooway777 merged 1 commit into
InfiniTensor:mainfrom
qinyiqun:moe
Jun 22, 2026
Merged

Issue/1308: Add MoE inference operators and EP communication primitives#1309
wooway777 merged 1 commit into
InfiniTensor:mainfrom
qinyiqun:moe

Conversation

@qinyiqun

Copy link
Copy Markdown
Collaborator

No description provided.

@qinyiqun qinyiqun requested a review from a team June 18, 2026 02:06
@qinyiqun qinyiqun force-pushed the moe branch 3 times, most recently from 99d3f38 to d11396a Compare June 18, 2026 09:30
- add MoE topk, align, fused dense, fused gate, sum, and prepare input operators
- expose allgather and reduce-scatter distributed ops for MoE EP paths
- update runtime and module ownership handling needed by graph replay
@wooway777 wooway777 merged commit 0de4454 into InfiniTensor:main Jun 22, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants