Skip to content

Commit cc1e02a

Browse files
authored
Merge pull request #1126 from pytorch/tmp_tvarshney
amending triton deployment docs
2 parents bb79f74 + 53a8504 commit cc1e02a

File tree

2 files changed

+7
-5
lines changed

2 files changed

+7
-5
lines changed

docsrc/index.rst

+2-2
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ Getting Started
2828
* :ref:`use_from_pytorch`
2929
* :ref:`runtime`
3030
* :ref:`using_dla`
31-
* :ref:`deploy_torch_tensorrt_to_triton`
31+
* :ref:`serving_torch_tensorrt_with_triton`
3232

3333
.. toctree::
3434
:caption: Getting Started
@@ -44,7 +44,7 @@ Getting Started
4444
tutorials/use_from_pytorch
4545
tutorials/runtime
4646
tutorials/using_dla
47-
tutorials/deploy_torch_tensorrt_to_triton
47+
tutorials/serving_torch_tensorrt_with_triton
4848

4949
.. toctree::
5050
:caption: Notebooks

docsrc/tutorials/deploy_torch_tensorrt_to_triton.rst renamed to docsrc/tutorials/serving_torch_tensorrt_with_triton.rst

+5-3
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
1-
Deploying a Torch-TensorRT model (to Triton)
2-
============================================
1+
.. _serving_torch_tensorrt_with_triton:
2+
3+
Serving a Torch-TensorRT model with Triton
4+
==========================================
35

46
Optimization and deployment go hand in hand in a discussion about Machine
57
Learning infrastructure. Once network level optimzation are done
@@ -210,5 +212,5 @@ The output of the same should look like below:
210212
b'8.234375:11']
211213

212214
The output format here is ``<confidence_score>:<classification_index>``.
213-
To learn how to map these to the label names and more, refer to our
215+
To learn how to map these to the label names and more, refer to Triton Inference Server's
214216
`documentation <https://github.com/triton-inference-server/server/blob/main/docs/protocol/extension_classification.md>`__.

0 commit comments

Comments
 (0)