Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-filetype Support via Markitdown #269

Merged
merged 20 commits into from
Apr 8, 2025

Conversation

KennyZhang1
Copy link
Contributor

This PR adds native support for multiple filetypes using markitdown. The markitdown library is used to convert uploaded files to text format before uploading them to the specified blob container. Additionally, a caching mechanism was added to the data endpoint log successfully converted files and avoid duplicate conversions.

@KennyZhang1 KennyZhang1 requested a review from a team as a code owner March 20, 2025 16:18
@jgbradley1 jgbradley1 requested a review from a team as a code owner April 3, 2025 03:15
@nievespg1 nievespg1 merged commit 004fc65 into main Apr 8, 2025
8 checks passed
@nievespg1 nievespg1 deleted the kennyzhang/markitdown-integration branch April 8, 2025 15:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants