`prepare_label_group` should handle mismatches of label_name & the zattrs name of the label #619

jluethi · 2023-11-30T15:29:57Z

Issue originally reported by @nrepina

If we copy a label image and give it a new name, the zattrs we get from an old label image also contain a name attribute => potential mismatches.

Example code

# Get the label_attrs correctly
label_attrs = get_zattrs(zarr_url=f"{rx_zarr_path}/labels/{label_name}")

rx_zarr_out = Path(output_path) / component
new_label_name = label_name + '_consensus'

# useful check for overwriting, adds metadata to labels group
_ = prepare_label_group(
    image_group=zarr.group(rx_zarr_out),
    label_name=new_label_name,
    overwrite=True,
    label_attrs=label_attrs,
    logger=logger,
)

Now the label image will be named label_name + '_consensus' in the folder, but the metadata still contains only label_name.

We can debate who's responsibility this should be. The easy fix is to add a line like this:

label_attrs['multiscales'][0]['name'] = new_label_name

But we tend to want to remove more of this metadata handling from tasks and put it into the helper functions.
At a minimum, prepare_label_group should give a warning if there is this mismatch, but maybe even just fix it and give priority to the explicit label_name parameter.

The text was updated successfully, but these errors were encountered:

jluethi · 2023-11-30T15:31:34Z

Broader discussion: Should the prepare_label_group do any kind of attrs validation to ensure it's a valid OME-Zarr attr? Would also make sense

jluethi · 2023-12-06T13:54:40Z

label_attrs need to be valid NGFF attrs => check whether this is tested

tcompa · 2023-12-11T09:38:45Z

As of 9e11b56:

We validate the label_attrs with our NgffImageMeta Pydantic model, and fail if attributes do not comply with specs
If the multiscale name in label_attrs does not match with the label_name function argument, we update the multiscale name.

As of 48a014b:

We make the label_attrs required, for prepare_label_group. @jluethi: we had not explicitly mentioned this change, but it seems natural to me that this function should not write a Zarr group without attributes. I can revert this change if we want the flexibility of preparing a group without attributes that can be modified later from within the task.

tcompa · 2023-12-11T09:54:19Z

Closed within #613. I'm re-opening, to make sure we agree on point 3 above:

We make the label_attrs required, for prepare_label_group. @jluethi: we had not explicitly mentioned this change, but it seems natural to me that this function should not write a Zarr group without attributes. I can revert this change if we want the flexibility of preparing a group without attributes that can be modified later from within the task.

jluethi · 2023-12-12T08:16:53Z

We make the label_attrs required, for prepare_label_group. @jluethi: we had not explicitly mentioned this change, but it seems natural to me that this function should not write a Zarr group without attributes. I can revert this change if we want the flexibility of preparing a group without attributes that can be modified later from within the task.

Seems fair to me. Is there a reasonable behavior we'd do if a user doesn't provide zattrs? We couldn't know things like pixel sizes etc.

tcompa · 2023-12-12T14:24:22Z

Is there a reasonable behavior we'd do if a user doesn't provide zattrs? We couldn't know things like pixel sizes etc.

This is the current function signature:

def prepare_label_group(
    image_group: zarr.hierarchy.Group,
    label_name: str,
    label_attrs: dict[str, Any],
    overwrite: bool = False,
    logger: Optional[logging.Logger] = None,
) -> zarr.group:

Given these arguments, there's no attribute that we can directly infer. This is not about things like pixel sizes, but really about any attribute.

Things change if we make a further assumption, namely that we have access to the image being labeled (either by assuming we can go up two levels in the Zarr hierarchy and find it, or by having an additional function argument).
In that case, we could get access to the image attributes; if we then also had additional information (mainly the target pyramid level), we could re-build the full attributes of the label image, as is currently done e.g. in the cellpose task (or equivalently in the napari-workflows task):

    new_datasets = rescale_datasets(
        datasets=[ds.dict() for ds in ngff_image_meta.datasets],
        coarsening_xy=coarsening_xy,
        reference_level=level,
        remove_channel_axis=True,
    )

    label_attrs = {
        "image-label": {
            "version": __OME_NGFF_VERSION__,
            "source": {"image": "../../"},
        },
        "multiscales": [
            {
                "name": output_label_name,
                "version": __OME_NGFF_VERSION__,
                "axes": [
                    ax.dict()
                    for ax in ngff_image_meta.multiscale.axes
                    if ax.type != "channel"
                ],
                "datasets": new_datasets,
            }
        ],
    }

This option for sure looks interesting, in view of factoring out a functionality which is already repeated in two tasks (note however that this is not how things happen e.g. in https://github.com/fmi-basel/gliberal-scMultipleX/blob/nar-fractal/src/scmultiplex/fractal/relabel_by_linking_consensus.py#L157-L174).

If we take this latter option, then the prepare_label_group function scope becomes broader, as it will handle both its current feature (mostly overwrite-related checks) and the feature of preparing Zarr attributes for the new group. We could then decide whether this is internally split in two functions, or a single one takes care of all of it (a renaming of the function could then be appropriate).

jluethi · 2023-12-12T15:21:51Z

Thanks for the further explanations. I'd say we take this on board for a future refactor of label writing. For now, we made the prepare_label_group function more robust, that should be the limit of the 0.14.0 scope.

jluethi added this to Fractal Project Management Nov 30, 2023

github-project-automation bot moved this to TODO in Fractal Project Management Nov 30, 2023

jluethi mentioned this issue Dec 1, 2023

Comply with table specs v1 #613

Merged

1 task

tcompa added a commit that referenced this issue Dec 11, 2023

Update test_prepare_label_group (ref #619)

f53c039

tcompa added a commit that referenced this issue Dec 11, 2023

Make label_attrs argument of prepare_label_group required (ref #619)

48a014b

tcompa added a commit that referenced this issue Dec 11, 2023

Update test_prepare_label_group (ref #619)

5ad9680

tcompa closed this as completed in 9e11b56 Dec 11, 2023

github-project-automation bot moved this from TODO to Done in Fractal Project Management Dec 11, 2023

tcompa reopened this Dec 11, 2023

github-project-automation bot moved this from Done to TODO in Fractal Project Management Dec 11, 2023

tcompa mentioned this issue Dec 12, 2023

Refactor prepare_label_group? #634

Open

tcompa closed this as completed Dec 12, 2023

github-project-automation bot moved this from TODO to Done in Fractal Project Management Dec 12, 2023

jluethi removed this from Fractal Project Management Apr 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`prepare_label_group` should handle mismatches of label_name & the zattrs name of the label #619

`prepare_label_group` should handle mismatches of label_name & the zattrs name of the label #619

jluethi commented Nov 30, 2023

jluethi commented Nov 30, 2023

Uh oh!

jluethi commented Dec 6, 2023

Uh oh!

tcompa commented Dec 11, 2023

Uh oh!

tcompa commented Dec 11, 2023

Uh oh!

jluethi commented Dec 12, 2023

Uh oh!

tcompa commented Dec 12, 2023 •

edited

Loading

Uh oh!

jluethi commented Dec 12, 2023

Uh oh!

prepare_label_group should handle mismatches of label_name & the zattrs name of the label #619

prepare_label_group should handle mismatches of label_name & the zattrs name of the label #619

Comments

jluethi commented Nov 30, 2023

jluethi commented Nov 30, 2023

Uh oh!

jluethi commented Dec 6, 2023

Uh oh!

tcompa commented Dec 11, 2023

Uh oh!

tcompa commented Dec 11, 2023

Uh oh!

jluethi commented Dec 12, 2023

Uh oh!

tcompa commented Dec 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jluethi commented Dec 12, 2023

Uh oh!

`prepare_label_group` should handle mismatches of label_name & the zattrs name of the label #619

`prepare_label_group` should handle mismatches of label_name & the zattrs name of the label #619

tcompa commented Dec 12, 2023 •

edited

Loading