Switch to the recommended model (text-embedding-ada-002) #495
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
NOTE: NLP newbie here, happy to take any feedback; have written about my thought-process for this change below.
Was looking at the documentation for embedding model in Python. Then, when I had a sample working, the length of the embedding at
12,288
was too long to what the current trend is and made me look into this in a little more detail. Then came across relevant blog posts and documentation based on which I think this is the right change.Ref: https://openai.com/blog/new-and-improved-embedding-model
Ref: https://platform.openai.com/docs/guides/embeddings/what-are-embeddings