-
Notifications
You must be signed in to change notification settings - Fork 4.7k
Port to gpt-4o-mini as default #2443
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@@ -2,26 +2,26 @@ | |||
"gpt_groundedness": { | |||
"pass_count": 49, | |||
"pass_rate": 0.98, | |||
"mean_rating": 4.94 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In a future PR - we may want to add an additional set of prompts for each version of the model
I have done a test azd up with my old environment and a brand new environment, all seems well. Merging. |
Default azureaideploymentregion is canadaeast ( azure-search-openai-demo/infra/main.bicep Line 80 in c5bfb22
It doesn't make sense to default to a region that won't work. |
@Jbrocket Ah! Great catch, I'll send a fix for that now. |
Purpose
This PR upgrades to gpt-4o-mini as the default chat completion model.
See my analysis here:
https://blog.pamelafox.org/2025/03/gpt-4o-mini-vs-gpt-35-turbo-for-rag.html
The responses are longer, but it should be overall cheaper,
and it's a more recent model.
Does this introduce a breaking change?
When developers merge from main and run the server, azd up, or azd deploy, will this produce an error?
If you're not sure, try it out on an old environment.
Does this require changes to learn.microsoft.com docs?
This repository is referenced by this tutorial
which includes deployment, settings and usage instructions. If text or screenshot need to change in the tutorial,
check the box below and notify the tutorial author. A Microsoft employee can do this for you if you're an external contributor.
Type of change
Code quality checklist
See CONTRIBUTING.md for more details.
python -m pytest
).python -m pytest --cov
to verify 100% coverage of added linespython -m mypy
to check for type errorsruff
andblack
manually on my code.