Port to gpt-4o-mini as default #2443

pamelafox · 2025-03-22T00:30:47Z

Purpose

This PR upgrades to gpt-4o-mini as the default chat completion model.
See my analysis here:
https://blog.pamelafox.org/2025/03/gpt-4o-mini-vs-gpt-35-turbo-for-rag.html
The responses are longer, but it should be overall cheaper,
and it's a more recent model.

Does this introduce a breaking change?

When developers merge from main and run the server, azd up, or azd deploy, will this produce an error?
If you're not sure, try it out on an old environment.

[X] Yes - With existing environments, it will keep gpt-35-turbo
[ ] No

Does this require changes to learn.microsoft.com docs?

This repository is referenced by this tutorial
which includes deployment, settings and usage instructions. If text or screenshot need to change in the tutorial,
check the box below and notify the tutorial author. A Microsoft employee can do this for you if you're an external contributor.

[X] Yes - Maybe? It might mention the model
[ ] No

Type of change

[ ] Bugfix
[ ] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[X] Other... Please describe:

Code quality checklist

See CONTRIBUTING.md for more details.

The current tests all pass (python -m pytest).
I added tests that prove my fix is effective or that my feature works
I ran python -m pytest --cov to verify 100% coverage of added lines
I ran python -m mypy to check for type errors
I either used the pre-commit hooks or ran ruff and black manually on my code.

mattgotteiner · 2025-03-22T20:50:14Z

evals/results/baseline/summary.json

@@ -2,26 +2,26 @@
    "gpt_groundedness": {
        "pass_count": 49,
        "pass_rate": 0.98,
-        "mean_rating": 4.94


In a future PR - we may want to add an additional set of prompts for each version of the model

pamelafox · 2025-03-24T23:34:46Z

I have done a test azd up with my old environment and a brand new environment, all seems well. Merging.

Jbrocket · 2025-03-28T15:50:32Z

Default azureaideploymentregion is canadaeast (

azure-search-openai-demo/infra/main.bicep

Line 80 in c5bfb22

'canadaeast'

), which doesn't support gpt-4o-mini. https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models?tabs=global-standard%2Cstandard-chat-completions#assistants-preview

It doesn't make sense to default to a region that won't work.

pamelafox · 2025-03-28T15:52:54Z

@Jbrocket Ah! Great catch, I'll send a fix for that now.

Port to gpt-4o-mini as default

d1599da

mattgotteiner reviewed Mar 22, 2025

View reviewed changes

Merge branch 'main' into gpt4omini

6506528

pamelafox marked this pull request as ready for review March 24, 2025 21:07

mattgotteiner approved these changes Mar 24, 2025

View reviewed changes

pamelafox added 2 commits March 24, 2025 14:37

Update snapshot for 128K model

d092ef3

Fix markdown

48ffa41

pamelafox merged commit 236b592 into Azure-Samples:main Mar 24, 2025
18 checks passed

pamelafox deleted the gpt4omini branch March 24, 2025 23:34

pamelafox mentioned this pull request Mar 28, 2025

Reduce list to only the available ones for gpt-4o-mini/Standard #2459

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port to gpt-4o-mini as default #2443

Port to gpt-4o-mini as default #2443

pamelafox commented Mar 22, 2025 •

edited

Loading

mattgotteiner Mar 22, 2025 •

edited

Loading

pamelafox commented Mar 24, 2025

Jbrocket commented Mar 28, 2025

pamelafox commented Mar 28, 2025

Port to gpt-4o-mini as default #2443

Port to gpt-4o-mini as default #2443

Conversation

pamelafox commented Mar 22, 2025 • edited Loading

Purpose

Does this introduce a breaking change?

Does this require changes to learn.microsoft.com docs?

Type of change

Code quality checklist

mattgotteiner Mar 22, 2025 • edited Loading

Choose a reason for hiding this comment

pamelafox commented Mar 24, 2025

Jbrocket commented Mar 28, 2025

pamelafox commented Mar 28, 2025

pamelafox commented Mar 22, 2025 •

edited

Loading

mattgotteiner Mar 22, 2025 •

edited

Loading