Skip to content

feat: Implement FP8 functionality #2763

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 147 commits into from
May 30, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
147 commits
Select commit Hold shift + click to select a range
9ad87ac
chore: Upgrade to TRT 10.0
peri044 Mar 12, 2024
a655c9a
chore: updates to trt api
peri044 Mar 12, 2024
cd86660
feat: Add save API for torch-trt compiled models
peri044 Mar 14, 2024
31285e5
feat: Add FP8 support including dtype and converters
peri044 Mar 5, 2024
7c9c646
chore: minor fixes
peri044 Mar 15, 2024
4eabeb0
Merge branch 'main' into trt_10
peri044 Mar 15, 2024
a320e56
Merge branch 'trt_10' into fp8_trt10
peri044 Mar 15, 2024
3ece71b
chore: resolve merge conflicts
peri044 Mar 15, 2024
eab0dba
chore: Fix save failures
peri044 Mar 18, 2024
b191d62
chore: update to 2.3 rc build
peri044 Mar 18, 2024
ce606fe
chore: rebase with release/2.3 branch
peri044 Mar 19, 2024
8674a3c
chore: minor fixes
peri044 Mar 19, 2024
f4e8fe9
chore: remove duplicate bert test case
peri044 Mar 20, 2024
4ae6ab9
chore: remove comments
peri044 Mar 20, 2024
fff1b80
chore: Upgrade to TRT 10.0
peri044 Mar 12, 2024
39ca77d
chore: more fixes
peri044 Mar 21, 2024
5431ee3
chore: update trt version
peri044 Mar 25, 2024
0c03de5
chore: more updates
peri044 Mar 26, 2024
982dbd2
parent f39e89e3964bc3d6ea3a6989b1e4099e1bb3e6dd
peri044 Mar 25, 2024
1ae46e9
chore: more updates
peri044 Mar 27, 2024
ae87fba
chore: rebase with save
peri044 Mar 27, 2024
beb5920
chore: Update versions
peri044 Mar 27, 2024
f0068c6
chore: update tensorrt version in CI
peri044 Mar 27, 2024
39261b9
chore: more updates
peri044 Mar 27, 2024
3753150
chore: more fixes
peri044 Apr 2, 2024
16a191c
Merge branch 'release/2.3' into trt_10
peri044 Apr 2, 2024
c355766
chore: remove NvUtils.h
peri044 Apr 2, 2024
2d237dc
chore: more updates
peri044 Apr 2, 2024
e4b4429
chore: change lib64 to lib in rhel BUILD file
peri044 Apr 2, 2024
fa4fb9c
chore: more updates
peri044 Apr 2, 2024
e11eb60
chore: fix TRT version
peri044 Apr 2, 2024
092feb2
chore: more updates
peri044 Apr 2, 2024
09ecf26
fix shape bug in bitwise ops
zewenli98 Apr 3, 2024
85e04c5
chore: update to rhel9
peri044 Apr 3, 2024
6a3664e
Merge branch 'trt_10' of github.com:pytorch/TensorRT into trt_10
peri044 Apr 3, 2024
41229d6
chore: change trt version
peri044 Apr 3, 2024
9d7a656
fix test bug and add more tests
zewenli98 Apr 3, 2024
5e911a9
chore: delete mirror of rules_pkg
peri044 Apr 3, 2024
dae0eb2
chore: fix conv test
peri044 Apr 3, 2024
2a32b13
Merge branch 'trt_10' of github.com:pytorch/TensorRT into trt_10
peri044 Apr 3, 2024
4676cd2
chore: fix trt version range
peri044 Apr 4, 2024
88efe8e
chore: fix trt rangfe
peri044 Apr 4, 2024
f9b40e6
chore: minor fix
peri044 Apr 4, 2024
b86aec2
chore: update rules_pkg
peri044 Apr 4, 2024
6630281
chore: minor fixes
peri044 Apr 4, 2024
fca55fe
chore: expt
peri044 Apr 4, 2024
1ca01e7
chore: update WORKSPACE tmpl
peri044 Apr 5, 2024
cdf5d07
chore: rebase with 2.3
peri044 Apr 5, 2024
6ffb85e
chore: fix
peri044 Apr 5, 2024
76af510
chore: remove cudnn dep
peri044 Apr 6, 2024
f9cf75a
chore: fix
peri044 Apr 6, 2024
33ba8b2
chore: updates
peri044 Apr 8, 2024
923377c
chore: update post-build script
peri044 Apr 9, 2024
89f04db
chore: remove trt dep
peri044 Apr 9, 2024
7620acc
chore: updates
peri044 Apr 9, 2024
62332fb
chore: set ld_library path in post script
peri044 Apr 9, 2024
96a8bf6
chore: updates
peri044 Apr 9, 2024
041f6a3
chore: updates
peri044 Apr 9, 2024
83e9a0b
chore: disable smoke test
peri044 Apr 9, 2024
e8529b0
chore: updates
peri044 Apr 9, 2024
1357112
chore: updates
peri044 Apr 9, 2024
608a6d2
chore: updates
peri044 Apr 10, 2024
1b34b32
chore: updates
peri044 Apr 10, 2024
89cb55a
chore: updates
peri044 Apr 10, 2024
4323e36
chore: updates
peri044 Apr 10, 2024
60b3e51
chore: update hw_compat
peri044 Apr 10, 2024
05627cd
chore: updates
peri044 Apr 12, 2024
d16585f
chore: update streams
peri044 Apr 12, 2024
16088e6
chore: updates
peri044 Apr 12, 2024
3d149ef
chore: updates
peri044 Apr 13, 2024
3addcae
chore: updates
peri044 Apr 13, 2024
b0e92d8
chore: update hw_compat.ts
peri044 Apr 15, 2024
d285d27
fix dynamic shape bugs for test_binary_ops_aten
zewenli98 Apr 15, 2024
d78a846
chore: revert layer_norm test
peri044 Apr 16, 2024
ba8a424
chore: rebase
peri044 Apr 16, 2024
097d887
Merge branch 'trt_10' of github.com:pytorch/TensorRT into trt_10
zewenli98 Apr 16, 2024
faaa0fa
chore: rebase with trt_10
peri044 Apr 17, 2024
68aab70
chore: updates
peri044 Apr 17, 2024
ffe7a52
chore: rebase with release/2.3
peri044 Apr 18, 2024
bac409a
chore: rebase
peri044 Apr 18, 2024
38642bb
chore: updates
peri044 Apr 18, 2024
c70c6dc
Merge branch 'trt_10' into fp8_trt10
peri044 Apr 18, 2024
ba286bd
chore: add fp8 test
peri044 Apr 18, 2024
d15dd72
chore: updates
peri044 Apr 19, 2024
dda88ee
Merge branch 'trt_10' into fp8_trt10
peri044 Apr 19, 2024
dee9aa0
chore: updates
peri044 Apr 19, 2024
fc6078b
Merge branch 'trt_10' into fp8_trt10
peri044 Apr 19, 2024
c05d675
chore: update stream in python runtime
peri044 Apr 19, 2024
2329657
chore: update hw_compat.ts
peri044 Apr 19, 2024
b8a8709
chore: updates
peri044 Apr 20, 2024
44778e1
chore: updates
peri044 Apr 20, 2024
0dbbcd7
chore: updates
peri044 Apr 20, 2024
55e4a1b
Merge branch 'trt_10' into fp8_trt10
peri044 Apr 22, 2024
89c3d76
chore: updates
peri044 Apr 22, 2024
bd70ef8
Merge branch 'trt_10' into fp8_trt10
peri044 Apr 22, 2024
3956749
chore: updates
peri044 Apr 23, 2024
0a2337b
chore: updates
peri044 Apr 23, 2024
358255d
chore: updates
peri044 Apr 23, 2024
dad5399
chore: rebase
peri044 Apr 23, 2024
e3e1d85
chore: rebase
peri044 May 7, 2024
7e717d6
chore: updates
peri044 May 14, 2024
c6d2f2a
chore: update to modelopt
peri044 May 14, 2024
ceec39d
chore: updates
peri044 May 14, 2024
a7e566b
chore: updates
peri044 May 14, 2024
707b10a
chore: updates
peri044 May 15, 2024
22066c5
chore: minor fix
peri044 May 15, 2024
6eed383
chore: fixes
peri044 May 15, 2024
ff231b5
chore: fixes
peri044 May 16, 2024
2f167c6
chore: updates
peri044 May 16, 2024
367eaf0
chore: updates
peri044 May 16, 2024
8cb6b91
chore: updates
peri044 May 16, 2024
4d38368
chore: updates
peri044 May 16, 2024
ee54da6
chore: updates
peri044 May 17, 2024
f4ccd62
chore: updates
peri044 May 17, 2024
681a6d1
chore: fixes
peri044 May 17, 2024
44071aa
chore: updates
peri044 May 17, 2024
3f6999d
chore: updates
peri044 May 17, 2024
5de9325
chore: updates
peri044 May 17, 2024
c677ef9
refactor vgg16 with fp8 and ptq example
zewenli98 May 21, 2024
f0b8d47
fix bugs
zewenli98 May 22, 2024
3ce9bed
chore: rebase
peri044 May 22, 2024
beb888d
chore: updates
peri044 May 23, 2024
e7989a0
chore: address review comments
peri044 May 23, 2024
96fd462
chore: updates
peri044 May 23, 2024
4030344
chore: updates
peri044 May 24, 2024
ad9d825
chore: updates
peri044 May 24, 2024
0059c1c
Update build-test-windows.yml
narendasan May 24, 2024
f98abd6
Update build-test-linux.yml
narendasan May 24, 2024
0d2021d
chore: updates
peri044 May 24, 2024
1940267
chore: updates
peri044 May 27, 2024
5814402
chore: disable all lower_linear tests
peri044 May 27, 2024
338a92b
chore: updates
peri044 May 27, 2024
59d0bd0
chore: fixes
peri044 May 27, 2024
020fe63
chore: updates
peri044 May 27, 2024
3f8297e
chore: updates
peri044 May 27, 2024
5ce0ee1
chore: updates
peri044 May 27, 2024
65c5c3e
chore: updates
peri044 May 28, 2024
d99989d
chore: updates
peri044 May 28, 2024
99dfbdc
chore: updates
peri044 May 28, 2024
ad996a5
chore: updates
peri044 May 28, 2024
6ada351
chore: updates
peri044 May 28, 2024
88fd7ee
chore: fixes
peri044 May 28, 2024
2511095
chore: updates
peri044 May 28, 2024
5346a45
chore: updates
peri044 May 29, 2024
d284b8f
chore: updates
peri044 May 29, 2024
c71c017
chore: updates
peri044 May 29, 2024
a983064
chore: updates
peri044 May 29, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
18 changes: 8 additions & 10 deletions .github/scripts/install-torch-tensorrt-windows.sh
Original file line number Diff line number Diff line change
@@ -1,14 +1,12 @@
#!/usr/bin/env bash
set -eou pipefail
source "${BUILD_ENV_FILE}"
# Source conda so it's available to the script environment
source ${BUILD_ENV_FILE}
export EXTRA_INDEX_URL="https://download.pytorch.org/whl/test/${CU_VERSION}"
# Install all the dependencies required for Torch-TensorRT
${CONDA_RUN} pip install --pre -r ${PWD}/tests/py/requirements.txt --use-deprecated=legacy-resolver --extra-index-url=${EXTRA_INDEX_URL}

# Install test index version of Torch and Torchvision
${CONDA_RUN} ${PIP_INSTALL_TORCH} torchvision
${CONDA_RUN} pip install pyyaml mpmath==1.3.0

# Install TRT 10 from PyPi
${CONDA_RUN} pip install tensorrt==10.0.0b6 tensorrt-${CU_VERSION::4}-bindings==10.0.0b6 tensorrt-${CU_VERSION::4}-libs==10.0.0b6 --extra-index-url https://pypi.nvidia.com

# Install pre-built Torch-TRT
# Install Torch-TensorRT via pre-built wheels. On windows, the location of wheels is not fixed.
${CONDA_RUN} pip install ${RUNNER_ARTIFACT_DIR}/torch_tensorrt*.whl

echo -e "Running test script";
echo -e "Running test script";
12 changes: 6 additions & 6 deletions .github/scripts/install-torch-tensorrt.sh
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,11 @@
set -eou pipefail
# Source conda so it's available to the script environment
source ${BUILD_ENV_FILE}
${CONDA_RUN} ${PIP_INSTALL_TORCH} torchvision
${CONDA_RUN} python -m pip install pyyaml mpmath==1.3.0
export TRT_VERSION=$(${CONDA_RUN} python -c "import versions; versions.tensorrt_version()")
export EXTRA_INDEX_URL="https://download.pytorch.org/whl/test/${CU_VERSION}"
# Install all the dependencies required for Torch-TensorRT
${CONDA_RUN} pip install --pre -r ${PWD}/tests/py/requirements.txt --use-deprecated=legacy-resolver --extra-index-url=${EXTRA_INDEX_URL}

# Install Torch-TensorRT
${CONDA_RUN} python -m pip install /opt/torch-tensorrt-builds/torch_tensorrt*+${CU_VERSION}*.whl tensorrt~=${TRT_VERSION} --extra-index-url=https://pypi.ngc.nvidia.com
# Install Torch-TensorRT via pre-built wheels. On windows, the location of wheels is not fixed.
${CONDA_RUN} pip install /opt/torch-tensorrt-builds/torch_tensorrt*.whl

echo -e "Running test script";
echo -e "Running test script";
19 changes: 9 additions & 10 deletions .github/workflows/build-test-linux.yml
Original file line number Diff line number Diff line change
Expand Up @@ -66,6 +66,7 @@ jobs:
package-name: torch_tensorrt
pre-script: packaging/pre_build_script.sh
post-script: packaging/post_build_script.sh
smoke-test-script: packaging/smoke_test_script.sh
uses: pytorch/tensorrt/.github/workflows/linux-test.yml@release/2.3
with:
job-name: tests-py-torchscript-fe
Expand All @@ -80,13 +81,10 @@ jobs:
export LD_LIBRARY_PATH=/usr/lib64:$LD_LIBRARY_PATH
pushd .
cd tests/modules
# Don't use requirements.txt here as it contains tensorrt and torch which should have been installed by now.
${CONDA_RUN} python -m pip install numpy packaging pyyaml transformers==4.39.3 timm==0.9.16 pybind11==2.6.2
${CONDA_RUN} python hub.py
popd
pushd .
cd tests/py/ts
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/ts_api_test_results.xml api/
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/ts_models_test_results.xml models/
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/ts_integrations_test_results.xml integrations/
Expand All @@ -103,6 +101,7 @@ jobs:
package-name: torch_tensorrt
pre-script: packaging/pre_build_script.sh
post-script: packaging/post_build_script.sh
smoke-test-script: packaging/smoke_test_script.sh
uses: pytorch/tensorrt/.github/workflows/linux-test.yml@release/2.3
with:
job-name: tests-py-dynamo-converters
Expand All @@ -116,7 +115,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/dynamo_converters_test_results.xml -n 10 conversion/
popd

Expand All @@ -131,6 +129,7 @@ jobs:
package-name: torch_tensorrt
pre-script: packaging/pre_build_script.sh
post-script: packaging/post_build_script.sh
smoke-test-script: packaging/smoke_test_script.sh
uses: pytorch/tensorrt/.github/workflows/linux-test.yml@release/2.3
with:
job-name: tests-py-dynamo-fe
Expand All @@ -144,7 +143,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/dynamo_fe_test_results.xml --ir dynamo models/test_models_export.py
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/dyn_models_export.xml --ir dynamo models/test_dyn_models.py
popd
Expand All @@ -160,6 +158,7 @@ jobs:
package-name: torch_tensorrt
pre-script: packaging/pre_build_script.sh
post-script: packaging/post_build_script.sh
smoke-test-script: packaging/smoke_test_script.sh
uses: pytorch/tensorrt/.github/workflows/linux-test.yml@release/2.3
with:
job-name: tests-py-dynamo-serde
Expand All @@ -173,7 +172,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/export_serde_test_results.xml --ir dynamo models/test_export_serde.py
popd

Expand All @@ -188,6 +186,7 @@ jobs:
package-name: torch_tensorrt
pre-script: packaging/pre_build_script.sh
post-script: packaging/post_build_script.sh
smoke-test-script: packaging/smoke_test_script.sh
uses: pytorch/tensorrt/.github/workflows/linux-test.yml@release/2.3
with:
job-name: tests-py-torch-compile-be
Expand All @@ -201,7 +200,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest -n 10 --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_compile_be_test_results.xml backend/
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_comple_be_e2e_test_results.xml --ir torch_compile models/test_models.py
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_compile_dyn_models_export.xml --ir torch_compile models/test_dyn_models.py
Expand All @@ -218,6 +216,7 @@ jobs:
package-name: torch_tensorrt
pre-script: packaging/pre_build_script.sh
post-script: packaging/post_build_script.sh
smoke-test-script: packaging/smoke_test_script.sh
uses: pytorch/tensorrt/.github/workflows/linux-test.yml@release/2.3
with:
job-name: tests-py-dynamo-core
Expand All @@ -231,7 +230,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_runtime_test_results.xml runtime/
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_partitioning_test_results.xml partitioning/
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_lowering_test_results.xml lowering/
Expand All @@ -247,7 +245,9 @@ jobs:
- repository: pytorch/tensorrt
package-name: torch_tensorrt
pre-script: packaging/pre_build_script.sh
uses: pytorch/tensorrt/.github/workflows/linux-test.yml@main
post-script: packaging/post_build_script.sh
smoke-test-script: packaging/smoke_test_script.sh
uses: pytorch/tensorrt/.github/workflows/linux-test.yml@release/2.3
with:
job-name: tests-py-core
repository: "pytorch/tensorrt"
Expand All @@ -260,6 +260,5 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/core
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_core_test_results.xml .
popd
4 changes: 0 additions & 4 deletions .github/workflows/build-test-windows.yml
Original file line number Diff line number Diff line change
Expand Up @@ -72,7 +72,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/dynamo_converters_test_results.xml -n 10 conversion/
popd

Expand All @@ -98,7 +97,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/dynamo_fe_test_results.xml --ir dynamo models/test_models_export.py
${CONDA_RUN} python -m pytest --junitxml=${RUNNER_TEST_RESULTS_DIR}/dyn_models_export.xml --ir dynamo models/test_dyn_models.py
popd
Expand All @@ -125,7 +123,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest -n 10 --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_compile_be_test_results.xml backend/
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_comple_be_e2e_test_results.xml --ir torch_compile models/test_models.py
popd
Expand All @@ -152,7 +149,6 @@ jobs:
export USE_HOST_DEPS=1
pushd .
cd tests/py/dynamo
${CONDA_RUN} python -m pip install --pre -r ../requirements.txt
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_runtime_test_results.xml runtime/
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_partitioning_test_results.xml partitioning/
${CONDA_RUN} python -m pytest -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_lowering_test_results.xml lowering/
Expand Down
4 changes: 4 additions & 0 deletions dev_dep_versions.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,3 +2,7 @@ __version__: "2.3.0"
__cuda_version__: "12.1"
__cudnn_version__: "8.9"
__tensorrt_version__: "10.0.1"
__torch_version__: "2.3.0"
# torchvision version here is not a direct dependency but the one used during testing
__torchvision_version__: "0.18.0"
__index_url__: "https://download.pytorch.org/whl/test/"
1 change: 1 addition & 0 deletions docsrc/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -111,6 +111,7 @@ Tutorials
tutorials/_rendered_examples/dynamo/torch_compile_transformers_example
tutorials/_rendered_examples/dynamo/torch_compile_advanced_usage
tutorials/_rendered_examples/dynamo/torch_compile_stable_diffusion
tutorials/_rendered_examples/dynamo/vgg16_fp8_ptq

Python API Documenation
------------------------
Expand Down
1 change: 1 addition & 0 deletions examples/dynamo/README.rst
Original file line number Diff line number Diff line change
Expand Up @@ -10,3 +10,4 @@ a number of ways you can leverage this backend to accelerate inference.
* :ref:`torch_compile_transformer`: Compiling a Transformer model using ``torch.compile``
* :ref:`torch_compile_advanced_usage`: Advanced usage including making a custom backend to use directly with the ``torch.compile`` API
* :ref:`torch_compile_stable_diffusion`: Compiling a Stable Diffusion model using ``torch.compile``
* :ref:`vgg16_fp8_ptq`: Compiling a VGG16 model with FP8 and PTQ using ``torch.compile``
Loading
Loading