Increase Azure client timeout on tests #67210

fcofdez · 2021-01-08T15:21:51Z

Additionally, this commit improves the error messages provided as
previously we weren't including the blob name on deletion failures.

Closes #67119

Instead of executing all the delete request in parallel this commits introduces a change that allows the execution of delete requests in batches of 100 parallel deletions. The reason for this change is to avoid timeout failures when large number of files should be deleted as if we execute all of them in parallel a few slow requests could make the rest to fail due to timeouts, as there is an effective limit at the connection pool level. Additionally, this commit improves the error messages provided as previously we weren't including the blob name on deletion failures. Closes elastic#67119

elasticmachine · 2021-01-08T15:21:54Z

Pinging @elastic/es-distributed (Team:Distributed)

fcofdez · 2021-01-12T09:34:58Z

@original-brownbear would you mind to take a look into this when you have the time? Thanks!

original-brownbear

Thanks Francisco this makes sense I think.
My understanding is that the problem we're facing is a client side request timeout because the requests have to wait for so long to actually go out over the limited connections we have.
I wonder if we couldn't just way increase the timeout to work around this instead?
Would keep the code simpler for one but also would overall run faster I guess since we get more parallelism from deleting.

The risk of every now and then failing a bulk delete because of a bunch of slow running deletes isn't so bad IMO, all our repo operations will clean the left-overs up during subsequent delete operations anyway. In the real world it seems very unlikely we'd be seeing what failed the test here anyway since the test timeouts are so absurdly short.
Also, in practice we don't even have a timeout by default anyway for Azure do we so this is a test-only issue for the most part it seems?

fcofdez · 2021-01-12T13:49:48Z

Thanks for the review Armin!

I wonder if we couldn't just way increase the timeout to work around this instead?

Yes, that should solve the issue. I was a bit worried about the consequences of a bunch of failed delete
requests, but as you point that should be solved during subsequent delete operations.

I'll increase the timeout and keep the improvements around the error messages

…es-in-batches

original-brownbear

LGTM thanks Francisco!

…es-in-batches

Additionally, this commit improves the error messages provided as previously we weren't including the blob name on deletion failures. Closes elastic#67119 Backport of elastic#67210

Additionally, this commit improves the error messages provided as previously we weren't including the blob name on deletion failures. Closes #67119 Backport of #67210

fcofdez added >bug :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. v7.11.0 v7.12.0 labels Jan 8, 2021

fcofdez requested a review from original-brownbear January 8, 2021 15:57

original-brownbear reviewed Jan 12, 2021

View reviewed changes

fcofdez added 2 commits January 12, 2021 14:51

Merge remote-tracking branch 'origin/master' into execute-azure-delet…

19c6aac

…es-in-batches

Increase test client timeouts and remove delete batching

e5ce84d

fcofdez changed the title ~~Execute azure blob deletions in batches~~ Increase Azure client timeout on tests Jan 12, 2021

fcofdez requested a review from original-brownbear January 12, 2021 14:56

fcofdez added >test Issues or PRs that are addressing/adding tests and removed >bug labels Jan 12, 2021

original-brownbear approved these changes Jan 12, 2021

View reviewed changes

fcofdez added 2 commits January 13, 2021 09:45

Increase AzureBlobContainerRetriesTests too

8697b6c

Merge remote-tracking branch 'origin/master' into execute-azure-delet…

265419c

…es-in-batches

fcofdez merged commit 4b9f2e9 into elastic:master Jan 13, 2021

fcofdez added the backport pending label Jan 13, 2021

This was referenced Jan 13, 2021

[7.x] Increase Azure client timeout on tests #67428

Merged

[7.11] Increase Azure client timeout on tests #67429

Merged

fcofdez added a commit that referenced this pull request Jan 13, 2021

Increase Azure client timeout on tests (#67428)

6e58051

Additionally, this commit improves the error messages provided as previously we weren't including the blob name on deletion failures. Closes #67119 Backport of #67210

fcofdez added a commit that referenced this pull request Jan 13, 2021

Increase Azure client timeout on tests (#67429)

dba4ee0

Additionally, this commit improves the error messages provided as previously we weren't including the blob name on deletion failures. Closes #67119 Backport of #67210

fcofdez removed the backport pending label Jan 13, 2021

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Increase Azure client timeout on tests #67210

Increase Azure client timeout on tests #67210

Uh oh!

fcofdez commented Jan 8, 2021 •

edited

Loading

Uh oh!

elasticmachine commented Jan 8, 2021

Uh oh!

fcofdez commented Jan 12, 2021

Uh oh!

original-brownbear left a comment

Uh oh!

fcofdez commented Jan 12, 2021

Uh oh!

original-brownbear left a comment

Uh oh!

Uh oh!

Increase Azure client timeout on tests #67210

Increase Azure client timeout on tests #67210

Uh oh!

Conversation

fcofdez commented Jan 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Jan 8, 2021

Uh oh!

fcofdez commented Jan 12, 2021

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

fcofdez commented Jan 12, 2021

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fcofdez commented Jan 8, 2021 •

edited

Loading