File tree 1 file changed +7
-2
lines changed
1 file changed +7
-2
lines changed Original file line number Diff line number Diff line change @@ -12,14 +12,18 @@ The pre-built image includes:
12
12
13
13
- ROCm™ 6.3.1
14
14
- vLLM 0.6.6
15
- - PyTorch 2.6dev (nightly)
15
+ - PyTorch 2.7dev (nightly)
16
16
17
17
## Pull latest Docker Image
18
18
19
19
Pull the most recent validated docker image with ` docker pull rocm/vllm-dev:main `
20
20
21
21
## What is New
22
22
23
+ 20250205_aiter:
24
+ - [ AITER] ( https://github.com/ROCm/aiter ) support
25
+ - Performance improvement for custom paged attention
26
+ - Reduced memory overhead bug fix
23
27
20250124:
24
28
- Fix accuracy issue with 405B FP8 Triton FA
25
29
- Fixed accuracy issue with TP8
@@ -475,8 +479,9 @@ To reproduce the release docker:
475
479
``` bash
476
480
git clone https://github.com/ROCm/vllm.git
477
481
cd vllm
478
- git checkout 8e87b08c2a284c1a20eb3d8e0fbdc84918bf27dc
482
+ git checkout 9dc3394c9ee4da250be28d7bd08babf098d51081
479
483
docker build -f Dockerfile.rocm -t < your_tag> --build-arg BUILD_HIPBLASLT=1 --build-arg USE_CYTHON=1 .
484
+ export VLLM_USE_AITER=0
480
485
```
481
486
482
487
### AITER
You can’t perform that action at this time.
0 commit comments