tpch10000 q18. WideCombiner single bucket don't fit into memory #11416

lll-phill-lll · 2024-11-08T13:02:52Z

ydb/ydb/library/yql/minikql/comp_nodes/mkql_wide_combine.cpp

Line 481 in 65dc86e

bool isNew = SpilledBuckets.front().InMemoryProcessingState->TasteIt();

Bt:
https://paste.yandex-team.ru/d7a045b0-8309-4f2f-bc6d-56c514da5928

vladl2802 · 2024-11-13T13:32:38Z

What we tried for now (in #11471):

When extracting data from combiner first process in-memory buckets and only after those process spilled buckets. But this change most likely don't have any impact because all buckets will be spilled before extracting.
Add extra hashing function on top of existing Hasher. Motivation: without this there are only 1/4 unique hash values on q18 (Hasher for integers returns just its value as hash and for input data %32 < 8 is true. So for ex if we want to split values in 128 buckets with hash % 128 we will get only 1/4 non-empty buckets). But with extra hash function values are distributed evenly between buckets

As a result execution time of q18 on scale 100 with 1 task (pragma ydb.MaxTasksPerStage="1") jump from average 297s to 357s. But judging by flame graphs (placed lower), additional hash function does not have any huge impact on performance (0.08% for both last combiners). So my guess is that performance drop is caused by more smaller buckets to process then before.

According to @lll-phill-lll, on 10k scale this fixes memory limit exception that was firing and query got timeouted after 1 hour of execution.

Without hashing

With hashing

vladl2802 · 2024-11-14T12:15:29Z

For point (2) I've tried xxh64 (those flame graphs are in previous comment and below also) and fibonacci hashing (its flame graph is below).

So for some reason (that I can't explain for now) xxh64 seems faster than fibonacci even so fibonacci should require less operations to compute. But I am not sure about that point, because execution time is really unstable.

So we will use xxh64 for now. Further progress can be made in block combine or/and after #11591

Fibonacci hashing (as here)

xxh64

lll-phill-lll · 2024-11-14T18:57:03Z

2h run results:
https://nda.ya.ru/t/qWfckDan79ezxP
spilling plot: https://nda.ya.ru/t/wp-yyyx779ezvp

Looks suspicious. Like it worked only for 30 mins

lll-phill-lll · 2024-11-14T20:08:09Z

https://nda.ya.ru/t/dlVeCoho79f4uA

Error after 32 mins

lll-phill-lll added the area/runtime YDB runtime issues label Nov 8, 2024

lll-phill-lll self-assigned this Nov 8, 2024

vladl2802 mentioned this issue Nov 11, 2024

[Not for merge] Fix order of bucket processing in wide combine spilling #11471

Draft

vladl2802 self-assigned this Nov 11, 2024

lll-phill-lll removed their assignment Nov 11, 2024

lll-phill-lll linked a pull request Nov 12, 2024 that will close this issue

[Not for merge] Fix order of bucket processing in wide combine spilling #11471

Draft

lll-phill-lll mentioned this issue Nov 14, 2024

get rid of std::hash #11591

Open

lll-phill-lll closed this as completed Dec 5, 2024

lll-phill-lll mentioned this issue May 23, 2025

Scalar aggregation. Buckets repartitioning algorithm #18754

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

tpch10000 q18. WideCombiner single bucket don't fit into memory #11416

tpch10000 q18. WideCombiner single bucket don't fit into memory #11416

lll-phill-lll commented Nov 8, 2024 •

edited

Loading

vladl2802 commented Nov 13, 2024 •

edited

Loading

Uh oh!

vladl2802 commented Nov 14, 2024 •

edited

Loading

Uh oh!

lll-phill-lll commented Nov 14, 2024

Uh oh!

lll-phill-lll commented Nov 14, 2024

Uh oh!

tpch10000 q18. WideCombiner single bucket don't fit into memory #11416

tpch10000 q18. WideCombiner single bucket don't fit into memory #11416

Comments

lll-phill-lll commented Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

vladl2802 commented Nov 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Without hashing

With hashing

Uh oh!

vladl2802 commented Nov 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fibonacci hashing (as here)

xxh64

Uh oh!

lll-phill-lll commented Nov 14, 2024

Uh oh!

lll-phill-lll commented Nov 14, 2024

Uh oh!

lll-phill-lll commented Nov 8, 2024 •

edited

Loading

vladl2802 commented Nov 13, 2024 •

edited

Loading

vladl2802 commented Nov 14, 2024 •

edited

Loading