[GPU] to use original dims for fc_onednn_impl #30087

e-ddykim · 2025-04-11T11:39:54Z

Details:

This PR updates fc_onednn_impl to use original dims instead of squashing input layout to 2D.
It resolves layout mismatch problems between fc and post-ops.

Tickets:

164104, 164106

yeonbok · 2025-04-15T17:09:01Z

src/plugins/intel_gpu/src/graph/fully_connected.cpp

+                                           << orig_impl_param.typed_desc<fully_connected>()->id
+                                           << " " << input_shape
+                                           << " + " << fuse_op_input_shape << std::endl;
+                    can_apply_fake_alignment = false;


NOTE: I think we can add a reshape for this case in the future

Oh, I'll try it in the future as you reviewed. Thank you!

isanghao · 2025-04-16T04:20:46Z

src/plugins/intel_gpu/src/graph/fully_connected.cpp

@@ -66,14 +66,14 @@ format::type get_preferred_format(fully_connected_node const& node, const kernel
    }

    if (input_layout.data_type == data_types::f32 &&
-        (input_layout.format == format::bfyx || input_layout.format == format::bfzyx) &&
+        (input_layout.format == format::bfyx || input_layout.format == format::bfzyx || input_layout.format == format::bfwzyx) &&


nit: You can use one_of(input_layout.format, format::bfyx, format::bfzyx, format::bfwzyx).

I updated as you reviewed. Thank you.

src/plugins/intel_gpu/src/graph/impls/onednn/fully_connected_onednn.cpp

src/plugins/intel_gpu/tests/unit/test_cases/fully_connected_gpu_test.cpp

isanghao · 2025-04-16T04:39:07Z

src/plugins/intel_gpu/src/graph/impls/onednn/fully_connected_onednn.cpp

                group_size = ifm / ngroups;
                if (!is_four_bit_weight) {
                    // 8-bit quantized weight
-                    attr->set_scales(DNNL_ARG_WEIGHTS, PER_OC, dnnl::memory::dims{}, ds_data_type);
+                    attr->set_scales(DNNL_ARG_WEIGHTS, (PER_OC << shift_size), dnnl::memory::dims{}, ds_data_type);


nit: what about introducing a local variable per_oc = PER_OC << shift_size? That will reduce duplicated expression.

I updated as you reviewed. Thank you.

src/plugins/intel_gpu/src/graph/fully_connected.cpp

src/plugins/intel_gpu/src/graph/impls/onednn/fully_connected_onednn.cpp

isanghao

no perf issue from dgpu static-shape daily test

e-ddykim added WIP work in progress do not merge do_not_review do_not_merge labels Apr 11, 2025

e-ddykim requested review from a team as code owners April 11, 2025 11:39

github-actions bot added the category: GPU OpenVINO GPU plugin label Apr 11, 2025

e-ddykim force-pushed the gpu_onednn_fc_fix branch from 956fba1 to f9a5278 Compare April 14, 2025 09:58

e-ddykim removed WIP work in progress do not merge do_not_review do_not_merge labels Apr 15, 2025

yeonbok reviewed Apr 15, 2025

View reviewed changes

yeonbok approved these changes Apr 15, 2025

View reviewed changes

yeonbok added this to the 2025.2 milestone Apr 15, 2025

ahnyoung-paul approved these changes Apr 15, 2025

View reviewed changes

isanghao reviewed Apr 16, 2025

View reviewed changes

e-ddykim force-pushed the gpu_onednn_fc_fix branch 2 times, most recently from a1e95d1 to 4d8f7dd Compare April 16, 2025 08:44

e-ddykim added 8 commits April 17, 2025 22:06

use original dims for fc_onednn_impl

6e773c0

fix mistake

f814092

updated not to apply fake_alignment when weights_rank is higher than 2

c379a0f

fixed typos

4c79e21

updated per reviews

0919748

fixed errors in weights calculation of fc

bd440c3

updated static fc to have 4 dims at least

32a821b

updated hash key for fc

4b0097b

e-ddykim force-pushed the gpu_onednn_fc_fix branch from 4d8f7dd to 4b0097b Compare April 17, 2025 13:43

isanghao approved these changes Apr 18, 2025

View reviewed changes

isanghao added this pull request to the merge queue Apr 18, 2025

Merged via the queue into openvinotoolkit:master with commit 95122bd Apr 18, 2025
169 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPU] to use original dims for fc_onednn_impl #30087

[GPU] to use original dims for fc_onednn_impl #30087

Uh oh!

e-ddykim commented Apr 11, 2025

Uh oh!

yeonbok Apr 15, 2025

Uh oh!

e-ddykim Apr 16, 2025

Uh oh!

isanghao Apr 16, 2025

Uh oh!

e-ddykim Apr 16, 2025

Uh oh!

Uh oh!

Uh oh!

isanghao Apr 16, 2025

Uh oh!

e-ddykim Apr 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isanghao left a comment

Uh oh!

Uh oh!

Uh oh!

[GPU] to use original dims for fc_onednn_impl #30087

[GPU] to use original dims for fc_onednn_impl #30087

Uh oh!

Conversation

e-ddykim commented Apr 11, 2025

Details:

Tickets:

Uh oh!

yeonbok Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

e-ddykim Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

isanghao Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

e-ddykim Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

isanghao Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

e-ddykim Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isanghao left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!