-
Notifications
You must be signed in to change notification settings - Fork 35
VCF Zarr improvements #56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi there. I started on this today. @alimanfoo gave us some access to vfczarr data. I'm using that as the basis to read in data. It looks like it the data is for all contigs you have:
Where It can also be the case that you have 1 zarr file per contig, in which case it looks like this:
So I broke this functionality down into 2 parts. The first is defining an Xarray dataset that we can use as a very minimal interface.
The justification for having an additional dataset is that we can have a grouped Then I have a WIP
Which a user would then call as:
|
Thanks @jerowe - just updating to say I think we should wait until @alimanfoo is back before making any decisions on what we do with this. |
It would be good to make VCF Zarr reading more flexible:
samples
is a group (https://github.com/pystatgen/sgkit/pull/40#issuecomment-658280113)Mapping
object (not just a string orPath
)The text was updated successfully, but these errors were encountered: