Skip to content

Support data files compaction #1092

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Tracked by #1065
sungwy opened this issue Aug 22, 2024 · 2 comments
Open
Tracked by #1065

Support data files compaction #1092

sungwy opened this issue Aug 22, 2024 · 2 comments

Comments

@sungwy
Copy link
Collaborator

sungwy commented Aug 22, 2024

Introduce an API to compact data files. The first version of the API will do the following:

  • take a predicate expression as input parameter to find data files matching the filter that will be re-written
  • group data files by partitions and rewrite them using the same bin-packing constraints of the writer
@sungwy sungwy self-assigned this Aug 22, 2024
@sungwy sungwy changed the title Compact data files Support data files compaction Aug 22, 2024
@sungwy sungwy removed their assignment Sep 24, 2024
@sungwy
Copy link
Collaborator Author

sungwy commented Sep 24, 2024

Unassigning to work on other near-term priorities

Copy link

This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.

@github-actions github-actions bot added the stale label Mar 24, 2025
@Fokko Fokko removed the stale label Mar 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants