Make partitioning utils QDQ aware so it does not break up QDQ node units #19723

skottmckay · 2024-02-29T12:36:15Z

Description

If the EP handles QDQ node units, we need to make sure we do not split those into different partitions.

Update the partitioning utils to be QDQ aware. If there are node units we process the logical nodes they represent instead of individual nodes. This ensure we process all nodes in a QDQ node unit at the same time so that they are always in the same partition.

Motivation and Context

Fix one of the issues in #19590

…its.

onnxruntime/test/testdata/ort_github_issue_19590.py

onnxruntime/test/providers/partitioning_utils_test.cc

onnxruntime/test/testdata/ort_github_issue_19590.py

…ioningUtilsQDQAware

Fix order of nodes in partition. DQ -> target -> Q must be added sequentially. Fix x86 build error. Fix android issue with gtest flags not being processed (couldn't debug failing test easily when --gtest_filter didn't work)

HectorSVC · 2024-03-06T07:11:44Z

CreateSupportedPartitions(const GraphViewer& graph_viewer,

QNN EP use this one.

Refers to: onnxruntime/core/providers/partitioning_utils.cc:409 in 956412b. [](commit_id = 956412b, deletion_comment = False)

HectorSVC · 2024-03-06T07:19:03Z

  if (!Contains(node_outputs, input)) {

Is it guaranteed that the node order is topological sorted? Otherwise, I think it's safe to build up the node_outputs first before processing any inputs.

Refers to: onnxruntime/core/providers/partitioning_utils.cc:328 in 956412b. [](commit_id = 956412b, deletion_comment = False)

skottmckay · 2024-03-06T09:20:56Z

  if (!Contains(node_outputs, input)) {

It is currently as we process nodes in topological order when doing partitioning.

We could process all outputs first before inputs. Is there a benefit apart from not requiring things to be topologically sorted if we do that?

In reply to: 1980232432

Refers to: onnxruntime/core/providers/partitioning_utils.cc:328 in 956412b. [](commit_id = 956412b, deletion_comment = False)

…ns variant. Fix issue with QDQ node group that has no Q nodes. TODO: Fix QnnHTPBackendTests.TopK_LargestFloats_U8_LastAxis

HectorSVC · 2024-03-06T16:44:52Z

  if (!Contains(node_outputs, input)) {

It happened when I was trying to fix it in my PR which made the nodes not in topological order any more. That's why I was asking. I was wondering in case someone else do that again without the awareness the code here which requires nodes in topological order.

In reply to: 1980424111

Refers to: onnxruntime/core/providers/partitioning_utils.cc:328 in 956412b. [](commit_id = 956412b, deletion_comment = False)

HectorSVC

…et node (int64_t indices output that is not quantized) as well as edge through Q node (values output)

…ioningUtilsQDQAware

- The whole QDQ setup needs a rethink at some point as it's currently spread across too many places (framework, optimizer, base providers lib, EP specific providers lib) - move NodeGroup to framework/node_unit.h and ValidateNodeGroupQDQNodes to NodeGroup::CanCreateNodeGroup so it's in the framework lib as it's used by NodeUnit - move GetAllNodeUnits to optimizer - doesn't quite belong there but this works will all the current EPs that use it.

skottmckay · 2024-03-07T22:13:34Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

skottmckay · 2024-03-07T22:13:36Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

skottmckay · 2024-03-07T22:13:37Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-07T22:13:53Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-03-07T22:14:11Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-03-07T22:14:14Z

Azure Pipelines successfully started running 10 pipeline(s).

edgchen1

haven't finished reviewing but it looks good so far.

onnxruntime/core/providers/partitioning_utils.cc

…ioningUtilsQDQAware

onnxruntime/core/providers/qnn/qnn_execution_provider.cc

onnxruntime/test/providers/partitioning_utils_test.cc

onnxruntime/core/providers/partitioning_utils.cc

…ioningUtilsQDQAware

Co-authored-by: Edward Chen <[email protected]>

### Description Enable code in QNN UT to verify the fix for partition issue relate to QDQ model. #19723

Make partitioning utils QDQ aware so it does not break up QDQ node un…

d2bc196

…its.

skottmckay requested a review from edgchen1 February 29, 2024 12:36

github-advanced-security bot found potential problems Feb 29, 2024

View reviewed changes

onnxruntime/test/testdata/ort_github_issue_19590.py Dismissed Show dismissed Hide dismissed

github-advanced-security bot found potential problems Feb 29, 2024

View reviewed changes

onnxruntime/test/providers/partitioning_utils_test.cc Fixed Show fixed Hide fixed

onnxruntime/test/testdata/ort_github_issue_19590.py Fixed Show fixed Hide fixed

skottmckay added 2 commits March 1, 2024 09:39

Lint

32fec58

Merge remote-tracking branch 'origin/main' into skottmckay/MakePartit…

577acce

…ioningUtilsQDQAware

jywu-msft requested a review from HectorSVC March 1, 2024 04:33

skottmckay added 2 commits March 3, 2024 11:48

Fix build error

956412b

Merge remote-tracking branch 'origin/main' into skottmckay/MakePartit…

0dcb98f

…ioningUtilsQDQAware

skottmckay mentioned this pull request Mar 6, 2024

improve the partition logic #19789

Closed

skottmckay added 2 commits March 6, 2024 15:35

Merge remote-tracking branch 'origin/main' into skottmckay/MakePartit…

2e0fbb5

…ioningUtilsQDQAware

Move NodeUnit to framework, and update include paths.

1a0336f

Fix order of nodes in partition. DQ -> target -> Q must be added sequentially. Fix x86 build error. Fix android issue with gtest flags not being processed (couldn't debug failing test easily when --gtest_filter didn't work)

Add ability to pass in node unit map to other CreateSupportedPartitio…

20ba3eb

…ns variant. Fix issue with QDQ node group that has no Q nodes. TODO: Fix QnnHTPBackendTests.TopK_LargestFloats_U8_LastAxis

HectorSVC previously approved these changes Mar 6, 2024

View reviewed changes

skottmckay added 2 commits March 7, 2024 12:19

Fix issue with handling QDQ node group that has output edge from targ…

4df61e9

…et node (int64_t indices output that is not quantized) as well as edge through Q node (values output)

Merge remote-tracking branch 'origin/main' into skottmckay/MakePartit…

cb41315

…ioningUtilsQDQAware

skottmckay dismissed HectorSVC’s stale review via cb41315 March 7, 2024 02:20

skottmckay added 2 commits March 7, 2024 22:20

Exclude node unit and partitioning utils from minimal build.

c89504c

edgchen1 reviewed Mar 8, 2024

View reviewed changes

skottmckay added 2 commits March 8, 2024 18:15

Address PR comments

8841083

Merge remote-tracking branch 'origin/main' into skottmckay/MakePartit…

6785671

…ioningUtilsQDQAware

edgchen1 previously approved these changes Mar 9, 2024

View reviewed changes

onnxruntime/core/providers/qnn/qnn_execution_provider.cc Show resolved Hide resolved

onnxruntime/test/providers/partitioning_utils_test.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/partitioning_utils.cc Outdated Show resolved Hide resolved

Merge remote-tracking branch 'origin/main' into skottmckay/MakePartit…

fdf8c6a

…ioningUtilsQDQAware

skottmckay dismissed edgchen1’s stale review via 2e6800d March 11, 2024 08:17

Apply suggestions from code review

2e6800d

Co-authored-by: Edward Chen <[email protected]>

edgchen1 approved these changes Mar 11, 2024

View reviewed changes

skottmckay merged commit 978c40d into main Mar 12, 2024
91 of 94 checks passed

skottmckay deleted the skottmckay/MakePartitioningUtilsQDQAware branch March 12, 2024 00:55

HectorSVC mentioned this pull request Mar 15, 2024

Enable code in QNN UT to verify the fix for partition issue #19939

Merged

HectorSVC added a commit that referenced this pull request Mar 16, 2024

Enable code in QNN UT to verify the fix for partition issue (#19939)

d5c6a2c

### Description Enable code in QNN UT to verify the fix for partition issue relate to QDQ model. #19723

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make partitioning utils QDQ aware so it does not break up QDQ node units #19723

Make partitioning utils QDQ aware so it does not break up QDQ node units #19723

skottmckay commented Feb 29, 2024

HectorSVC commented Mar 6, 2024

HectorSVC commented Mar 6, 2024

skottmckay commented Mar 6, 2024

HectorSVC commented Mar 6, 2024

HectorSVC left a comment

skottmckay commented Mar 7, 2024

skottmckay commented Mar 7, 2024

skottmckay commented Mar 7, 2024

azure-pipelines bot commented Mar 7, 2024

azure-pipelines bot commented Mar 7, 2024

azure-pipelines bot commented Mar 7, 2024

edgchen1 left a comment

Make partitioning utils QDQ aware so it does not break up QDQ node units #19723

Make partitioning utils QDQ aware so it does not break up QDQ node units #19723

Conversation

skottmckay commented Feb 29, 2024

Description

Motivation and Context

HectorSVC commented Mar 6, 2024

HectorSVC commented Mar 6, 2024

skottmckay commented Mar 6, 2024

HectorSVC commented Mar 6, 2024

HectorSVC left a comment

Choose a reason for hiding this comment

skottmckay commented Mar 7, 2024

skottmckay commented Mar 7, 2024

skottmckay commented Mar 7, 2024

azure-pipelines bot commented Mar 7, 2024

azure-pipelines bot commented Mar 7, 2024

azure-pipelines bot commented Mar 7, 2024

edgchen1 left a comment

Choose a reason for hiding this comment