Skip to content

Replace parquet.js with hyparquet #105

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

platypii
Copy link

@platypii platypii commented Dec 9, 2024

Uses hyparquet for javascript parquet parsing. It is a small, pure js implementation of parquet parsing with no dependencies. Parquet.js that this replaces is unmaintained and has not been updated in 5+ years.

Fixes #102 and #104 by using a well-maintained parquet library that supports modern parquet files.

I tested this with the parquet file generated by together-python and confirmed that upload works and fixes issue #104.

Let me know if I can help with anything!

@platypii
Copy link
Author

@Nutlope? Anyone?

@nicolasembleton
Copy link

This should be straightforward to review.

@platypii
Copy link
Author

platypii commented Jan 2, 2025

@samselikoff?

@platypii
Copy link
Author

Here's a video showing that instructions on together.ai website fail due to parquetjs parsing error. This PR fixes this issue:

together-upload.mp4

@Nutlope @samselikoff anything I can do to help move this along?

@Nutlope
Copy link
Collaborator

Nutlope commented Jan 24, 2025

@platypii thanks so much for reporting and for the PR! We're in the process of fixing some things with the upload. @yogishbaliga, mind taking at this PR when you do your other PR on the upload functionality too?

@platypii
Copy link
Author

@yogishbaliga thoughts? happy to contribute if there's more work that needs done

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Replace parquetjs for better deno compatibility?
3 participants