BUGFIX: NodeDataRepository uses PersistenceObjectIdentifier to improve query speed #49

gjwnc · 2021-09-15T12:19:40Z

This PR reduces runtime of the buildCommand by around 90%.

On a larger project, with ~350.000 nodes in the nodedatatable, it takes around 50 minutes to fill the build queue using ./flow nodeindexqueue:build --workspace live. With this PR, it takes around 4 minutes.

What I did

Instead of using OFFSET, I use a condition on the persistence object identifier and sort the nodedata table using the poi. This method allows the DB to provide the results in constant time for every query.

Why does it work

The previous query, uses OFFSET, which means the database needs to search through the table as the offset increases. That's why the build command is slowing down as the offset increases. The DB needs to search for the correct offset every time.
To avoid this problem, we can make use of the primary key instead of the offset and start right after the last used poi to search for the next batch.

…e query speed

gjwnc · 2022-03-30T15:24:54Z

@kdambekalns @daniellienert Maybe you can have a look at this PR please. It would greatly improve the speed of filling the ES build queue.

daniellienert · 2022-03-30T18:22:23Z

Hey @gjwnc - interesting approach! Didn't expect that seeking is more expensive then sorting but we have the same issues - I will give it a try.

kdambekalns

What is a "POD"?

Should that be "POI" (Persistence Object Identifier?) Or do I need to learn a new term here? 🤷‍♂️

gjwnc · 2022-03-31T05:32:29Z

What is a "POD"?

Should that be "POI" (Persistence Object Identifier?) Or do I need to learn a new term here? man_shrugging

@kdambekalns Arrgh, you're right. I'm using this abbreviation since years and never noticed it 🤦 . I've renamed to POI

gjwnc · 2022-03-31T05:46:59Z

@daniellienert Yeah, this should work, because the primary key is usually indexed as a B-Tree thus it is already sorted, and that's the reason it is faster. Behind the scenes, for the primary key, there is no need to sort since it is already indexed. The query tells the DB engine, to use a given primary key value to start with and keep the sorting already available from the primary key unique index.

kdambekalns

by reading…

gjwnc · 2022-05-03T14:18:43Z

@kdambekalns We had a short talk about this issue during the NEOS con. Maybe you can have a look at this again during the sprint, please?

daniellienert

Great performance boost!! Tested with 200k nodes.

BUGFIX: NodeDataRepository uses PersistenceObjectIdentifier to improv…

275b82d

…e query speed

daniellienert self-requested a review March 30, 2022 18:21

kdambekalns reviewed Mar 30, 2022

View reviewed changes

TASK: Rename abbreviation for persistence object identifier

468e5bc

kdambekalns approved these changes Mar 31, 2022

View reviewed changes

daniellienert approved these changes May 4, 2022

View reviewed changes

daniellienert merged commit bfc80c4 into Flowpack:master May 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BUGFIX: NodeDataRepository uses PersistenceObjectIdentifier to improve query speed #49

BUGFIX: NodeDataRepository uses PersistenceObjectIdentifier to improve query speed #49

Uh oh!

gjwnc commented Sep 15, 2021 •

edited

Loading

Uh oh!

gjwnc commented Mar 30, 2022

Uh oh!

daniellienert commented Mar 30, 2022

Uh oh!

kdambekalns left a comment

Uh oh!

gjwnc commented Mar 31, 2022

Uh oh!

gjwnc commented Mar 31, 2022 •

edited

Loading

Uh oh!

kdambekalns left a comment

Uh oh!

gjwnc commented May 3, 2022

Uh oh!

daniellienert left a comment

Uh oh!

Uh oh!

BUGFIX: NodeDataRepository uses PersistenceObjectIdentifier to improve query speed #49

BUGFIX: NodeDataRepository uses PersistenceObjectIdentifier to improve query speed #49

Uh oh!

Conversation

gjwnc commented Sep 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What I did

Why does it work

Uh oh!

gjwnc commented Mar 30, 2022

Uh oh!

daniellienert commented Mar 30, 2022

Uh oh!

kdambekalns left a comment

Choose a reason for hiding this comment

Uh oh!

gjwnc commented Mar 31, 2022

Uh oh!

gjwnc commented Mar 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kdambekalns left a comment

Choose a reason for hiding this comment

Uh oh!

gjwnc commented May 3, 2022

Uh oh!

daniellienert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gjwnc commented Sep 15, 2021 •

edited

Loading

gjwnc commented Mar 31, 2022 •

edited

Loading