You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/ContribOperators.md
+2
Original file line number
Diff line number
Diff line change
@@ -1597,6 +1597,8 @@ This version of the operator has been available since version 1 of the 'com.micr
1597
1597
<dd>Usually each single EPContext associate with a graph partition.But for some case like QNN, it has single EPContext contains all partitions.In that case, the node with ep_cache_context should set main_context=1. Other nodes set main_context=0 and skip ep_cache_context.The path is relative to this Onnx file. Default is 1.</dd>
1598
1598
<dt><tt>notes</tt> : string</dt>
1599
1599
<dd>(Optional) Some notes for the model</dd>
1600
+
<dt><tt>onnx_model_filename</tt> : string</dt>
1601
+
<dd>(Optional) Filename of the original ONNX model.</dd>
* - if "trt_engine_cache_path" is "" -> the engine cache will be saved to "./context_model_dir"
65
65
* - if "trt_engine_cache_path" is "engine_dir" -> the engine cache will be saved to "./context_model_dir/engine_dir"
66
66
*
67
+
* 3. In the case of building weight-stripped engines, the same security reasons as listed in 1) apply to the
68
+
* "onnx_model_filename" node attribute of EP context node, which contains a filename of the ONNX model with the
69
+
* weights needed for the refit process. User can specify a folder path relative to the current working
70
+
* directory by means of the "trt_onnx_model_folder_path" option.
71
+
*
67
72
*/
68
-
inttrt_dump_ep_context_model{0}; // Dump EP context node model
69
-
constchar*trt_ep_context_file_path{nullptr}; // Specify file name to dump EP context node model. Can be a path or a file name or a file name with path.
70
-
inttrt_ep_context_embed_mode{0}; // Specify EP context embed mode. Default 0 = context is engine cache path, 1 = context is engine binary data
73
+
inttrt_dump_ep_context_model{0}; // Dump EP context node model
74
+
constchar*trt_ep_context_file_path{nullptr}; // Specify file name to dump EP context node model. Can be a path or a file name or a file name with path.
75
+
inttrt_ep_context_embed_mode{0}; // Specify EP context embed mode. Default 0 = context is engine cache path, 1 = context is engine binary data
0 commit comments