Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add test case for Qwen3N
#2532 opened Feb 10, 2026 by samuellees Loading…
3 of 5 tasks
Add parallel testing to unit test script
#2531 opened Feb 9, 2026 by dierksen Loading…
2 tasks
Chore: Cute dsl moe update (TMA.RED implementation)
#2529 opened Feb 9, 2026 by nv-yunzheq Loading…
5 tasks
feat: BF16 GEMM benchmarking support
#2525 opened Feb 9, 2026 by raayandhar Loading…
5 tasks done
Feat/gdn decode pooled
#2521 opened Feb 8, 2026 by xutizhou Loading…
4 of 10 tasks
Support NVFP4 KV cache decode on SM120
#2520 opened Feb 8, 2026 by Tom-Zheng Loading…
5 tasks done
perf: avoid redundant shared writes in RMSNorm reductions
#2519 opened Feb 7, 2026 by JackeyLove1 Loading…
5 tasks
ci: add cleanup step to nightly release self-hosted runner jobs
#2510 opened Feb 6, 2026 by yongwww Loading…
4 of 5 tasks
bugfix: fix the enum/int type mismatch mentioned in #2507
#2508 opened Feb 6, 2026 by yzh119 Loading…
4 of 5 tasks
mxfp8 trtllm integration
#2505 opened Feb 6, 2026 by IwakuraRein Draft
5 tasks
Ameyn/gdn decode cutedsl kernel
#2498 opened Feb 5, 2026 by ameynaik-hub Loading…
5 tasks done
ci: Add CUDA 13.1 CI container support
#2465 opened Feb 2, 2026 by bkryu Draft
5 tasks done
feat: Add MXFP8 GEMM mm_mxfp8 (cutlass)
#2464 opened Feb 2, 2026 by danisereb Loading…
5 tasks done
fix: RMSNorm/FusedRMSNorm + Quant kernels cuda graph fixes
#2459 opened Feb 1, 2026 by BLaZeKiLL Loading…
5 tasks done
ProTip! no:milestone will show everything without a milestone.