Skip to content

finegrained-fp8: fused moe kernels#530

Draft
IlyasMoutawwakil wants to merge 3 commits intohuggingface:mainfrom
IlyasMoutawwakil:fp8-fused-moe
Draft

finegrained-fp8: fused moe kernels#530
IlyasMoutawwakil wants to merge 3 commits intohuggingface:mainfrom
IlyasMoutawwakil:fp8-fused-moe

Conversation

@IlyasMoutawwakil
Copy link
Copy Markdown
Member

@IlyasMoutawwakil IlyasMoutawwakil commented Apr 7, 2026

moe_tflops_grouped moe_tflops_batched faster and more accurate fused fp8 for both long context (grouped) and decode (batched)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant