File tree
5 files changed
+205
-98
lines changed- 3rdparty
- cmake
- include/tvm/runtime
- src/runtime/relax_vm
5 files changed
+205
-98
lines changedSubmodule flashinfer updated 55 files
- .github/workflows/release_wheel.yml+12-5
- .gitignore+4-1
- .release-please-manifest.json+1-1
- CHANGELOG.md+16
- CMakeLists.txt+80-39
- cmake/config.cmake+10
- docs/api/python/decode.rst+3
- docs/api/python/norm.rst+13
- docs/api/python/sampling.rst+15
- docs/conf.py+6-6
- docs/index.rst+2-1
- docs/installation.rst+17-2
- include/flashinfer/attention/decode.cuh-67
- include/flashinfer/attention/handler.cuh+183-81
- include/flashinfer/attention/prefill.cuh+46-89
- include/flashinfer/decode_attention_decl.cuh-146
- include/flashinfer/prefill_attention_decl.cuh-95
- include/flashinfer/sampling.cuh+109-99
- include/flashinfer/utils.cuh+12-19
- python/MANIFEST.in+1
- python/csrc/batch_decode.cu+63-28
- python/csrc/batch_prefill.cu+18-8
- python/csrc/flashinfer_ops.cu+23-3
- python/csrc/flashinfer_ops.h+45-19
- python/csrc/norm.cu+43
- python/csrc/pytorch_extension_utils.h+4-4
- python/csrc/sampling.cu+98
- python/flashinfer/__init__.py+7
- python/flashinfer/decode.py+284-1
- python/flashinfer/norm.py+49
- python/flashinfer/prefill.py+4-2
- python/flashinfer/sampling.py+190
- python/generate_batch_paged_prefill_inst.py+3-6
- python/generate_dispatch_inc.py+40-16
- python/setup.py+12-3
- python/tests/test_batch_decode_kernels.py+143-4
- python/tests/test_norm.py+47
- python/tests/test_sampling.py+101
- src/bench_batch_decode.cu+3-4
- src/bench_cascade.cu+5-6
- src/bench_sampling.cu+4-4
- src/bench_single_decode.cu+2-2
- src/bench_single_prefill.cu+2-1
- src/cpu_reference.h+62-63
- src/flashinfer_ops.cuh+314
- src/test_batch_decode.cu+5-5
- src/test_batch_prefill.cu+9-8
- src/test_cascade.cu+9-10
- src/test_page.cu+2-2
- src/test_sampling.cu+1.7k-9
- src/test_single_decode.cu+1-1
- src/test_single_prefill.cu+1-2
- src/tvm_wrapper.cu+23-45
- src/utils.h+43
- version.txt+1-1
+3-3
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
960 | 960 |
| |
961 | 961 |
| |
962 | 962 |
| |
963 |
| - | |
964 |
| - | |
965 |
| - | |
| 963 | + | |
966 | 964 |
| |
967 | 965 |
| |
968 | 966 |
| |
969 | 967 |
| |
| 968 | + | |
| 969 | + | |
970 | 970 |
| |
971 | 971 |
| |
972 | 972 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
444 | 444 |
| |
445 | 445 |
| |
446 | 446 |
| |
| 447 | + | |
| 448 | + | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
| 452 | + | |
| 453 | + | |
| 454 | + | |
| 455 | + | |
| 456 | + | |
| 457 | + | |
| 458 | + | |
| 459 | + | |
447 | 460 |
| |
448 | 461 |
| |
449 | 462 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
534 | 534 |
| |
535 | 535 |
| |
536 | 536 |
| |
| 537 | + | |
| 538 | + | |
| 539 | + | |
| 540 | + | |
| 541 | + | |
| 542 | + | |
| 543 | + | |
| 544 | + | |
| 545 | + | |
| 546 | + | |
| 547 | + | |
| 548 | + | |
| 549 | + | |
| 550 | + | |
| 551 | + | |
| 552 | + | |
| 553 | + | |
537 | 554 |
| |
538 | 555 |
| |
539 | 556 |
| |
|
0 commit comments