Merge pull request #1126 from pytorch/tmp_tvarshney

narendasan · web-flow · commit cc1e02a04930 · 2022-06-16T19:12:56.000-07:00
amending triton deployment docs
diff --git a/docsrc/index.rst b/docsrc/index.rst
@@ -28,7 +28,7 @@ Getting Started
 * :ref:`use_from_pytorch`
 * :ref:`runtime`
 * :ref:`using_dla`
-* :ref:`deploy_torch_tensorrt_to_triton`
+* :ref:`serving_torch_tensorrt_with_triton`
 
 .. toctree::
    :caption: Getting Started
@@ -44,7 +44,7 @@ Getting Started
    tutorials/use_from_pytorch
    tutorials/runtime
    tutorials/using_dla
-   tutorials/deploy_torch_tensorrt_to_triton
+   tutorials/serving_torch_tensorrt_with_triton
 
 .. toctree::
    :caption: Notebooks
diff --git a/docsrc/tutorials/serving_torch_tensorrt_with_triton.rst b/docsrc/tutorials/serving_torch_tensorrt_with_triton.rst
@@ -1,5 +1,7 @@
-Deploying a Torch-TensorRT model (to Triton)
-============================================
+.. _serving_torch_tensorrt_with_triton:
+
+Serving a Torch-TensorRT model with Triton
+==========================================
 
 Optimization and deployment go hand in hand in a discussion about Machine 
 Learning infrastructure. Once network level optimzation are done 
@@ -210,5 +212,5 @@ The output of the same should look like below:
     b'8.234375:11']
 
 The output format here is ``<confidence_score>:<classification_index>``.
-To learn how to map these to the label names and more, refer to our
+To learn how to map these to the label names and more, refer to Triton Inference Server's
 `documentation <https://github.com/triton-inference-server/server/blob/main/docs/protocol/extension_classification.md>`__.