-
Notifications
You must be signed in to change notification settings - Fork 157
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Bump actions/setup-python from 5 to 6 in the github-actions group
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#1274
opened May 4, 2026 by
dependabot
Bot
Loading…
[codex] Use PR41015 fix setup for GB200 MTP2 high throughput
full-sweep-enabled
#1273
opened May 4, 2026 by
alec-flowers
Collaborator
Loading…
[WIP] Updated DSv4 vllm B300 MTP
full-sweep-enabled
#1271
opened May 4, 2026 by
wzhao18
Collaborator
Loading…
[DNM - test PR, waiting for nvidia/tensorrtllm to accept the fusedmhc patch] Update DSV4 TRT fused MHC image
full-sweep-enabled
#1270
opened May 3, 2026 by
Oseltamivir
Collaborator
Loading…
Clean up DSv4 ATOM AITER PR2998 overlay
full-sweep-enabled
#1260
opened May 2, 2026 by
Oseltamivir
Collaborator
Loading…
1 task
[WIP] [NV] qwen3.5 fp4 b200 sglang mtp
full-sweep-enabled
NVIDIA
#1257
opened May 1, 2026 by
hshrivastava-droid
Collaborator
Loading…
[AMD][ROCM] Add MI355X Config: glm5-fp4-mi355x-sglang-mtp
#1254
opened May 1, 2026 by
ChangLiu0709
Collaborator
•
Draft
4 of 5 tasks
[AMD][ROCM] Fix benchmark_serving Rust Tokenizer Crash via Direct transformers AutoTokenizer
#1253
opened May 1, 2026 by
ChangLiu0709
Collaborator
•
Draft
3 of 4 tasks
Add SGLANG_OPT_USE_MULTI_STREAM_OVERLAP=1 to SGLang DSv4 launch configs
full-sweep-enabled
#1246
opened May 1, 2026 by
yhyang201
Collaborator
Loading…
2 tasks
[AMD] Update MI355x Deepseek-R1 FP4 SGLang Image to v0.5.10
#1237
opened Apr 30, 2026 by
ppalanga
Collaborator
Loading…
[AMD] improve dsr1 fp4 disagg perf on mi355x
AMD
sweep-enabled
#1236
opened Apr 30, 2026 by
billishyahao
Collaborator
Loading…
Adjust MiniMax MI355X block size for TP8 EP8
#1228
opened Apr 29, 2026 by
jiacao-amd
Collaborator
Loading…
[waiting for bug fix to land in v0.20.1] Add DSv4 FP8 H200 vLLM MTP benchmark
sweep-enabled
vllm/sglang release broken -need to wait
#1222
opened Apr 29, 2026 by
functionstackx
Contributor
Loading…
4 tasks
chore: upstream srt-slurm recipes + first-class recipe field + custom-bench wrapper
#1211
opened Apr 28, 2026 by
cquil11
Collaborator
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.