Options of processing ROIs in parallel within wells. #44

jluethi · 2022-08-31T12:34:47Z

The current vision for ROI processing (see #27) is running all the ROIs in a well sequentially. This will be a very useful first implementation. Many operations inherently parallelize and we can run many wells in parallel.

Nevertheless, we may eventually want to parallelize some ROI processing within a well. This becomes hard when ROIs need to write to the same chunk of the zarr array, which would not be safe. But we can think of ways to handle this. I think this roughly goes in the following order:

Run all ROIs sequentially (current plan to implement)
Run ROIs in parallel if they are all saved to independent chunks of the zarr array. Basically, when we have a grid of field of views, we want to run some processing by field of view at level 0 (e.g. illumination correction) and we saved the field of views as chunks in the zarr file. If we can verify that this is the case, we can run the tasks in parallel.
Run batches of ROIs that write to independent chunks in parallel. We may have some ROIs that need to write to the same chunk, but most ROIs don't overlap. In that case, we could check which ROIs need to write to independent zarr chunks and batch them in a clever way that groups ROIs that can be processed independently. These can then be processed in parallel.

We will implement 1 now. 2 should be fairly doable and useful. 3 is more of a potential thing we could eventually do, so we don't forget we have that option

tcompa · 2022-09-16T08:19:16Z

As of ongoing discussions with @jluethi and @mfranzon (related to #72, #27 and #75), we are now only implementing strategy 1, where all ROIs are computed sequentially (as in "define computation for a certain ROI, execute it, free up memory, move on to the next ROI").

This allows us to simplify the function that gets mapped onto the well array, and to make it such that its I/O are both numpy arrays (rather than delayed arrays).

Moving towards strategies 2 or 3 will clearly require a refactor of the relevant tasks, because within the in-progress work (see 5b61cd9 and #75) each ROI computation is blocking - and nothing else happens until it's over.

jluethi · 2022-09-16T09:03:54Z

Fully agree, thanks for the summary Tommaso.

When we want to tackle this eventually, we'll have to find a way to call the functions with numpy arrays, but somehow remaining delayed in this call. At the moment, the conversion of the dask region to a numpy array forces computation.

I think this sequential per well approach should be fine for quite a while, because we parallelize over the wells. I see 3 reasons when we may need to reconsider this trade-off:

We want to process large OME-Zarr images, especially things that aren't in the HCS spec (=> no wells to parallelize over). That's not on our current roadmap, but eventually interesting
We want to parallelize more for the GPU operations: The current implementation also means only one thing runs on a GPU at a time. If we e.g. do segmentation at level 2, we could often fit multiple jobs in a typical GPU at the same time. But the current implementation likely won't allow for that. (because sending multiple well jobs to the same GPU can become tricky)
We want to optimize for processing HCS datasets with very few wells (similar to 1) => eventually interesting, but not the immediate roadmap

jluethi · 2022-09-28T06:51:52Z

Another thing to consider: I've started processing the 23 well dataset again and the parsing to OME-Zarr now seems to take about 10 hours. Looks like that is a bit slower than before. I think the biggest bottleneck is parallel IO performance, so that's not something Fractal can optimize. But given that it seems to have slowed down a bit (I remember this being in the 6 hour range before), there may be a bit of optimization potential.

One thing we could consider: Currently we're parsing all the channels sequentially. An potentially easy way to get more parallelization without having to process multiple ROIs in parallel would be to process the different channels in parallel for a given FOV.

jluethi · 2024-07-03T15:13:53Z

cc @lorenzocerrone on this issue. Will be something that we eventually cover in the OME-Zarr reader/writer class :)

jluethi mentioned this issue Aug 31, 2022

[ROI] Memory and running time for ROI-based illumination correction #27

Closed

jluethi added the Overview label Sep 2, 2022

jluethi transferred this issue from fractal-analytics-platform/fractal-client Sep 2, 2022

jluethi added this to Fractal Project Management Sep 13, 2022

jluethi moved this to TODO in Fractal Project Management Sep 13, 2022

jluethi added the Backlog Backlog issues we may eventually fix, but aren't a priority label Sep 16, 2022

tcompa mentioned this issue Oct 11, 2022

Keep track of workflow completion fractal-analytics-platform/fractal-server#107

Closed

jluethi moved this from TODO to Backlog in Fractal Project Management Oct 12, 2022

tcompa added the Tables AnnData and ROI/feature tables label Sep 18, 2023

jluethi added the OME-Zarr reader/writer label Jul 3, 2024

jluethi added the ngio label Jul 12, 2024

jluethi mentioned this issue Oct 10, 2024

ngio for fractal-tasks-core: Overview #850

Open

63 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Options of processing ROIs in parallel within wells. #44

Options of processing ROIs in parallel within wells. #44

jluethi commented Aug 31, 2022

tcompa commented Sep 16, 2022

Uh oh!

jluethi commented Sep 16, 2022

Uh oh!

jluethi commented Sep 28, 2022

Uh oh!

jluethi commented Jul 3, 2024

Uh oh!

Options of processing ROIs in parallel within wells. #44

Options of processing ROIs in parallel within wells. #44

Comments

jluethi commented Aug 31, 2022

tcompa commented Sep 16, 2022

Uh oh!

jluethi commented Sep 16, 2022

Uh oh!

jluethi commented Sep 28, 2022

Uh oh!

jluethi commented Jul 3, 2024

Uh oh!