Requirements for UKB GWAS #67

eric-czech · 2020-07-24T20:02:55Z

tomwhite · 2020-07-27T10:49:42Z

I'd be happy to work on variant/sample stats (#29) if no one else is working on them.

hammer · 2020-07-27T11:28:14Z

@eric-czech how are you thinking about LD estimation/pruning, population structure estimation/pruning, and relatedness estimation/pruning? Does REGENIE include in the implementation some means of estimating these things as covariates for the regression, or are you just thinking of those operations as optimizations that can be implemented later?

hammer · 2020-07-27T19:06:40Z

To answer my own question, there are 3 stages to our work with UK Biobank

Stage 1: per-variant linear regression with provided population structure and kinship estimates. No LD pruning needed.
Stage 2: whole-genome regression with provided population structure and kinship estimates. LD pruning needed.
Stage 3: do our own population structure and kinship estimation.

hammer · 2020-07-27T19:08:13Z

A phenotype normalization pipeline.

I've been thinking about this one too. I think we're going to feel Dask's poor handling of nested data when working with phenotypes, and I'd prefer to keep Spark out of this project as a dependency, so I think we put that code into a separate repo if we find we do need Spark.

hammer · 2020-08-12T20:00:30Z

A variant annotation function like vep. There are plenty of other ways to get this but an internal function would be great.

File an issue to track?

eric-czech · 2020-08-13T13:36:00Z

File an issue to track?

https://github.com/pystatgen/sgkit/issues/112

hammer · 2020-09-04T18:04:51Z

A phenotype normalization pipeline.

@eric-czech I just had a nice chat with @zietzm and @ntatonetti who are at Columbia and are experts in handling complex phenotypes and running many GWAS against them.

They're interested in using sgkit and possibly contributing back, particularly on the phenotype side.

Would you be open to making https://github.com/related-sciences/ukb-gwas-pipeline-nealelab public soon and potentially working with @zietzm to factor the phenotype handling code into its own repo, maybe something like sgkit-pheno or phenokit?

eric-czech · 2020-09-08T12:39:05Z

Would you be open to making https://github.com/related-sciences/ukb-gwas-pipeline-nealelab public soon and potentially working with @zietzm to factor the phenotype handling code into its own repo, maybe something like sgkit-pheno or phenokit?

For sure! Looking forward to seeing how we can better integrate phenotypes.

eric-czech · 2020-09-14T14:09:42Z

FYI @zietzm / @ntatonetti (cc: @hammer) the phenotype prep code we're currently using (via PHESANT) is here: ukb-gwas-pipeline-nealelab#phenotype_prep.smk.

There is little to it yet other than running some messy, very inefficient R code to produce ~75 phenotypes that I wanted to attempt to validate against first. It would be great to hear your thoughts on how we might better define these as well as improve the mechanics of how we're creating them. I'm particularly interested in ICD code management since this pipeline doesn't address that.

hammer added the core operations Issues related to domain-specific functionality such as LD pruning, PCA, association testing, etc. label Jul 27, 2020

eric-czech mentioned this issue Aug 3, 2020

Create bitpacking filter for biallelic diploid datasets #80

Open

jeromekelleher mentioned this issue Sep 1, 2020

Requirements for population structure analysis #226

Open

4 tasks

eric-czech mentioned this issue Sep 1, 2020

GWAS reproduction tasks related-sciences/ukb-gwas-pipeline-nealelab#16

Open

28 tasks

jeromekelleher mentioned this issue Sep 2, 2020

Genome-wide selection scans #232

Closed

3 tasks

eric-czech mentioned this issue Oct 22, 2020

Add function to create genotype calls from genotype probabilities #346

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Requirements for UKB GWAS #67

Requirements for UKB GWAS #67

eric-czech commented Jul 24, 2020 •

edited by tomwhite

Loading

tomwhite commented Jul 27, 2020

hammer commented Jul 27, 2020 •

edited

Loading

hammer commented Jul 27, 2020

hammer commented Jul 27, 2020 •

edited

Loading

hammer commented Aug 12, 2020

eric-czech commented Aug 13, 2020

hammer commented Sep 4, 2020

eric-czech commented Sep 8, 2020

eric-czech commented Sep 14, 2020

Requirements for UKB GWAS #67

Requirements for UKB GWAS #67

Comments

eric-czech commented Jul 24, 2020 • edited by tomwhite Loading

tomwhite commented Jul 27, 2020

hammer commented Jul 27, 2020 • edited Loading

hammer commented Jul 27, 2020

hammer commented Jul 27, 2020 • edited Loading

hammer commented Aug 12, 2020

eric-czech commented Aug 13, 2020

hammer commented Sep 4, 2020

eric-czech commented Sep 8, 2020

eric-czech commented Sep 14, 2020

eric-czech commented Jul 24, 2020 •

edited by tomwhite

Loading

hammer commented Jul 27, 2020 •

edited

Loading

hammer commented Jul 27, 2020 •

edited

Loading