-
Notifications
You must be signed in to change notification settings - Fork 35
[WIP] Popgen stats #100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] Popgen stats #100
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks @daletovar! I think your plan looks great. For the parameter tree sequences, I think having sample sizes of (say) [2, 3, 10, 100] would be a good start. I can follow up with another PR with some more examples that'll exercise various corner cases afterwards.
@daletovar thanks for your work on this PR so far! In the future, could you be sure to mention the issue in the PR? In this case, https://github.com/pystatgen/sgkit/issues/94. We should update our contributing docs with this request... |
@hammer, thanks. Yes, I'll be sure to do that in the future. |
@daletovar, can you rebase this so that it's up to date with upstream/master please? |
This PR has conflicts, @daletovar please rebase and push updated version 🙏 |
This branch looks pretty messed up @daletovar - I think it would be simplest to rebase against upstream/master and force push. |
@jeromekelleher, sorry for the radio silence. I'll see what I can do about that. I think for now I'll leave the Patterson F stats for another time. I've had a hard time finding where they're implemented in tskit and there are differences between the tskit and scikit allele outputs. |
I'm still not able to see a clean diff here @daletovar making it very difficult to see what's being done - can you rebase against |
227a195
to
59102c6
Compare
802193b
to
acccf5f
Compare
This PR has conflicts, @daletovar please rebase and push updated version 🙏 |
4e5736f
to
5fa9a8f
Compare
@daletovar - there's more issues with linting here. Can you clean up please? |
@jeromekelleher, when it's installing dependencies. it looks like there's an issue building msprime. I'm not sure what's causing it. Do you know of a fix or is there something else we can use to test against tskit? |
The simulator should work fine — at least it does here. I've sent you an email so we can figure this out. :) |
Got an update from @jd about the mergify's simulator issue:
Soooo, if someone could please check the configuration from #244 on this PR and report back, that would be helpful. Simulator link. @jeromekelleher ? |
@Mergifyio update |
Command
|
Looks like this is breaking because of #245 |
I just merged #246 which should fix the build. @Mergifyio update |
Command
|
@jeromekelleher @ravwojdyla how do we get mergify to actually merge this?! |
@tomwhite we need to merge #244 first, but I can't validate that configuration because https://github.com/pystatgen/sgkit/pull/100#issuecomment-691276250. |
Trying another update here, to see if the changes in #244 will unplug things. |
@Mergifyio update |
Command
|
This PR has conflicts, @daletovar please rebase and push updated version 🙏 |
@Mergifyio rebase |
Command
|
Pull request has been modified.
remove ts_to_dataset from public api make divergence take in two datasets add minimal fst Add read_vcfzarr (sgkit-dev#40) add ts_to_dataset add minimal diversity and divergence remove ts_to_dataset from public api add tajimas d add ts_to_dataset add minimal diversity and divergence remove ts_to_dataset from public api make divergence take in two datasets add minimal fst add tajimas d add ts_to_dataset add minimal diversity and divergence remove ts_to_dataset from public api add minimal fst add tajimas d fix allele count update cfg remove spaces add msprime and use np.testing add libgsl-dev dependency add docstrings ignore dep warning add ts_to_dataset add minimal diversity and divergence remove ts_to_dataset from public api make divergence take in two datasets add minimal fst Add read_vcfzarr (sgkit-dev#40) add ts_to_dataset add minimal diversity and divergence remove ts_to_dataset from public api add tajimas d add ts_to_dataset add minimal diversity and divergence remove ts_to_dataset from public api make divergence take in two datasets add minimal fst add tajimas d add ts_to_dataset add minimal diversity and divergence remove ts_to_dataset from public api add minimal fst add tajimas d fix allele count update cfg remove spaces add msprime and use np.testing add libgsl-dev dependency add docstrings fix divide by zero
85d447c
to
f09fac3
Compare
Pull request has been modified.
Github workflows have been modified, so this has be to be merged manually. |
Going forward in this PR I'd like to add:
I might save Fst and the others for a different PR.