-
Notifications
You must be signed in to change notification settings - Fork 698
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
pick fa2 for BatchDecodeWithPagedKVCacheWrapper auto backend
#2530
opened Feb 9, 2026 by
saltyminty
Loading…
2 tasks
Chore: Cute dsl moe update (TMA.RED implementation)
#2529
opened Feb 9, 2026 by
nv-yunzheq
Loading…
5 tasks
Update Docker CI tags to 20260209-a2d3b39
automated
docker
#2528
opened Feb 9, 2026 by
flashinfer-bot
Loading…
Revert "ci: refactor PR tests to hide failed spot jobs from PR status…
#2524
opened Feb 8, 2026 by
yongwww
Loading…
5 tasks
perf: avoid redundant shared writes in RMSNorm reductions
#2519
opened Feb 7, 2026 by
JackeyLove1
Loading…
5 tasks
Improve small size performance in cutedsl fp4
#2517
opened Feb 7, 2026 by
vincentzed
•
Draft
5 tasks
benchmarks: Add microbenchmark support for Mamba selective_state_update
#2512
opened Feb 6, 2026 by
bkryu
Loading…
5 tasks
ci: add cleanup step to nightly release self-hosted runner jobs
#2510
opened Feb 6, 2026 by
yongwww
Loading…
4 of 5 tasks
bugfix: fix the enum/int type mismatch mentioned in #2507
#2508
opened Feb 6, 2026 by
yzh119
Loading…
4 of 5 tasks
fix: update block size and tile size heuristic for 16+ num of token
#2492
opened Feb 4, 2026 by
nv-yunzheq
Loading…
5 tasks
[WIP][bugfix]Correct chunk_end calculation in multi-CTA collaboration when max_len > length
#2489
opened Feb 4, 2026 by
huangzhilin-hzl
Loading…
5 tasks
feat: Add TRTLLM-Gen Skip-Softmax kernels for prefill and decode
run-ci
#2477
opened Feb 3, 2026 by
DomBrown
Loading…
5 tasks done
fix: RMSNorm/FusedRMSNorm + Quant kernels cuda graph fixes
#2459
opened Feb 1, 2026 by
BLaZeKiLL
Loading…
5 tasks done
Previous Next
ProTip!
no:milestone will show everything without a milestone.