-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
ENH: allow saving wide dataframes to hdf with format table #26135
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…as vlarray with object atom. pre-cleanup.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
in principle this change is ok, but this must be able to read existing arrays; so would need to commit some samples to the repo to test for this. once you can pass all of these tests I can have a look.
…64 explicitly to match stored table.
Codecov Report
@@ Coverage Diff @@
## master #26135 +/- ##
==========================================
+ Coverage 91.99% 92% +0.01%
==========================================
Files 175 175
Lines 52387 52414 +27
==========================================
+ Hits 48191 48222 +31
+ Misses 4196 4192 -4
Continue to review full report at Codecov.
|
Codecov Report
@@ Coverage Diff @@
## master #26135 +/- ##
==========================================
- Coverage 93% 91.74% -1.27%
==========================================
Files 182 174 -8
Lines 50311 50808 +497
==========================================
- Hits 46793 46615 -178
- Misses 3518 4193 +675
Continue to review full report at Codecov.
|
@jreback
|
can you merge master and update |
…as vlarray with object atom. pre-cleanup.
…64 explicitly to match stored table.
@jreback Updated.
|
something is not right with your patch; merge upstream/master |
Hi Jeff, |
maybe u didn’t push |
I see, thanks for the help. You were right, i didn't push. |
can you merge master; note we moved the test_pytables to a subdirectory |
@P-Tillmann can you rebase |
@jreback @jbrockmendel I updated to current master. But travis stil has an environment with pytables "cannot set WRITEABLE flag" bug for vlarrays. And since this PR uses a lot of vlarrays it fails almost all tests. |
@P-Tillmann is there a min version for pytables where that was fixed? |
@P-Tillmann is this still active? |
Closing as stale but if this is still relevant please ping and can reopen |
based on #11788
closes #6245
This PR allows to save wide dataframes to hdf. It will break forward compatibility, old versions of pandas will not be able to read hdfs from new versions.
The column is saved to a vlarray with object atomic. This is rather slow for very wide dfs but does not require any type checks.
One test fails because the saved column is not compressed but the compression level is checked for all nodes. I think this is fine but i wanted to consult with you before rewriting the test.
Two tests were added to check if a wide df can be saved and if it is appendable.
Please let me know what you think and if there is additional need for test coverage.