-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: [AutoDeploy] generalizing cudagraph to multiple dynamic inputs
#3589
opened Apr 16, 2025 by
lucaslie
Loading…
fix: check for config architectures in model config
#3586
opened Apr 15, 2025 by
vanshilshah97
Loading…
feat: Integrate GPUDirect Storage (GDS) into Executor API
#3582
opened Apr 15, 2025 by
DomBrown
Loading…
fix : release torch-managed memory as soon as it's not needed
#3579
opened Apr 15, 2025 by
peaceh-nv
Loading…
[TRTLLM-4051] Support only run some backend type test
#3578
opened Apr 15, 2025 by
ZhanruiSunCh
•
Draft
chore: add assertion for devices to avoid underlying errors
#3558
opened Apr 15, 2025 by
Superjomn
Loading…
move the reset models into
examples/models/core
directory
#3555
opened Apr 15, 2025 by
QiJune
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.