NC | lifecycle | continue last run #8925

nadavMiz · 2025-04-01T16:53:39Z

Describe the Problem

lifecycle run might fail or timeout before all the objects were processed. add a flag to continue the ran from where it last stopped

Explain the Changes

add new flag continue to resume the last run. in case last run finished. this will end the ran. in case there was no last run. start a new run
move function to load previous run status to a new file lifecycle_utils and use it both in nc_lifecycle and health scripts
create init function for bucket and rule status, and init objects if they don't exist
in case we run with continue. load the previous run states to this run status object

Issues: Fixed #xxx / Gap #xxx

Testing Instructions:

automatic tests:
sudo npx jest test_nc_lifecycle_cli
manual test:

set timeout less then run time, but more then a single cycle
keep starting new runs with continue flag, until a lifecycle finishes successfully
all new lifecycle runs with continue should pass without doing anything

Doc added/updated
Tests added

src/util/lifecycle_utils.js

src/manage_nsfs/nc_lifecycle.js

romayalon · 2025-04-02T09:48:36Z

src/manage_nsfs/nc_lifecycle.js

+        for (const [bucket_name, prev_bucket_status] of Object.entries(previous_run.buckets_statuses)) {
+            if (!buckets.includes(bucket_name)) continue;
+            const bucket_json = await config_fs.get_bucket_by_name(bucket_name, config_fs_options);
+            if (!bucket_json.lifecycle_configuration_rules) continue;


why bucket json and lifecycle_configuration_rules check is needed? AFAIU you can just copy and nullify/undefindify ( :) ) the stats and times of each bucket/rule
I believe we care about the general structure of buckets and rules and state

buckets and rules can change between runs. since we save this in the log file. we will have empty object for rule/ bucket with irrelevant state. there may still be concurrency issue. but I think we should at least check it between runs

yes we can nullify the state if it doesn't exist. I can change it work like that

romayalon · 2025-04-02T09:55:57Z

src/test/unit_tests/jest_tests/test_nc_lifecycle_cli.test.js

+            //ignore error
+        }
+        await exec_manage_cli(TYPES.LIFECYCLE, '', {continue: 'true', disable_service_validation: 'true', disable_runtime_validation: 'true', config_root}, undefined, undefined);
+        const object_list2 = await object_sdk.list_objects({bucket: test_bucket});


where is the assert that the run didn't do anything?

this is not the test that the run didn't do anything. its a test that it did finish the job. might be a redundant test. but I though it might find general bugs with continue flow. I can test that the lifecycle finished successfully. the test that test that it didn't is the previous one

src/test/unit_tests/jest_tests/test_nc_lifecycle_cli.test.js

src/manage_nsfs/nc_lifecycle.js

Signed-off-by: nadav mizrahi <[email protected]>

nadavMiz requested a review from romayalon April 1, 2025 16:53

pull-request-size bot added the size/L label Apr 1, 2025

nadavMiz force-pushed the lifecycle-continue-argument branch from 4fd3474 to f735ef9 Compare April 1, 2025 16:58

nadavMiz self-assigned this Apr 1, 2025

nadavMiz force-pushed the lifecycle-continue-argument branch 3 times, most recently from 1d8f8da to d04893f Compare April 2, 2025 09:25

romayalon mentioned this pull request Apr 2, 2025

NC | Lifecycle | GPFS ILM integration #8923

Merged

2 tasks

romayalon reviewed Apr 2, 2025

View reviewed changes

nadavMiz force-pushed the lifecycle-continue-argument branch from d04893f to d59c586 Compare April 2, 2025 12:51

NC | lifecycle | continue last run

73edf4c

Signed-off-by: nadav mizrahi <[email protected]>

nadavMiz force-pushed the lifecycle-continue-argument branch from d59c586 to 73edf4c Compare April 2, 2025 12:54

romayalon approved these changes Apr 2, 2025

View reviewed changes

nadavMiz merged commit af53218 into noobaa:master Apr 2, 2025
11 of 13 checks passed

romayalon mentioned this pull request May 15, 2025

NC | 5.18.4 backports #9027

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NC | lifecycle | continue last run #8925

NC | lifecycle | continue last run #8925

Uh oh!

nadavMiz commented Apr 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

romayalon Apr 2, 2025

Uh oh!

nadavMiz Apr 2, 2025

Uh oh!

nadavMiz Apr 2, 2025

Uh oh!

romayalon Apr 2, 2025

Uh oh!

nadavMiz Apr 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NC | lifecycle | continue last run #8925

NC | lifecycle | continue last run #8925

Uh oh!

Conversation

nadavMiz commented Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe the Problem

Explain the Changes

Issues: Fixed #xxx / Gap #xxx

Testing Instructions:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

romayalon Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

nadavMiz Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

nadavMiz Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

romayalon Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

nadavMiz Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nadavMiz commented Apr 1, 2025 •

edited

Loading