Skip to content

Commit 236b592

Browse files
authored
Port to gpt-4o-mini as default (#2443)
* Port to gpt-4o-mini as default * Update snapshot for 128K model * Fix markdown
1 parent 88f987e commit 236b592

File tree

85 files changed

+293
-286
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

85 files changed

+293
-286
lines changed

Diff for: README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ This template, the application code and configuration it contains, has been buil
5151

5252
[📺 Watch a video overview of the app.](https://youtu.be/3acB0OWmLvM)
5353

54-
This sample demonstrates a few approaches for creating ChatGPT-like experiences over your own data using the Retrieval Augmented Generation pattern. It uses Azure OpenAI Service to access a GPT model (gpt-35-turbo), and Azure AI Search for data indexing and retrieval.
54+
This sample demonstrates a few approaches for creating ChatGPT-like experiences over your own data using the Retrieval Augmented Generation pattern. It uses Azure OpenAI Service to access a GPT model (gpt-4o-mini), and Azure AI Search for data indexing and retrieval.
5555

5656
The repo includes sample data so it's ready to try end to end. In this sample application we use a fictitious company called Contoso Electronics, and the experience allows its employees to ask questions about the benefits, internal policies, as well as job descriptions and roles.
5757

Diff for: docs/deploy_existing.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ You should set these values before running `azd up`. Once you've set them, retur
2727
1. Run `azd env set AZURE_OPENAI_RESOURCE_GROUP {Name of existing resource group that OpenAI service is provisioned to}`
2828
1. Run `azd env set AZURE_OPENAI_LOCATION {Location of existing OpenAI service}`
2929
1. Run `azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT {Name of existing chat deployment}`. Only needed if your chat deployment is not the default 'chat'.
30-
1. Run `azd env set AZURE_OPENAI_CHATGPT_MODEL {Model name of existing chat deployment}`. Only needed if your chat model is not the default 'gpt-35-turbo'.
30+
1. Run `azd env set AZURE_OPENAI_CHATGPT_MODEL {Model name of existing chat deployment}`. Only needed if your chat model is not the default 'gpt-4o-turbo'.
3131
1. Run `azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_VERSION {Version string for existing chat deployment}`. Only needed if your chat deployment model version is not the default '0125'. You definitely need to change this if you changed the model.
3232
1. Run `azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_SKU {Name of SKU for existing chat deployment}`. Only needed if your chat deployment SKU is not the default 'Standard', like if it is 'GlobalStandard' instead.
3333
1. Run `azd env set AZURE_OPENAI_EMB_DEPLOYMENT {Name of existing embedding deployment}`. Only needed if your embeddings deployment is not the default 'embedding'.

Diff for: docs/deploy_features.md

+38-29
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
This document covers optional features that can be enabled in the deployed Azure resources.
55
You should typically enable these features before running `azd up`. Once you've set them, return to the [deployment steps](../README.md#deploying).
66

7-
* [Using GPT-4](#using-gpt-4)
7+
* [Using different chat completion models](#using-different-chat-completion-models)
88
* [Using text-embedding-3 models](#using-text-embedding-3-models)
99
* [Enabling GPT-4 Turbo with Vision](#enabling-gpt-4-turbo-with-vision)
1010
* [Enabling media description with Azure Content Understanding](#enabling-media-description-with-azure-content-understanding)
@@ -22,15 +22,11 @@ You should typically enable these features before running `azd up`. Once you've
2222
* [Deploying with private endpoints](#deploying-with-private-endpoints)
2323
* [Using local parsers](#using-local-parsers)
2424

25-
## Using GPT-4
25+
## Using different chat completion models
2626

27-
(Instructions for **GPT-4**, **GPT-4o**, and **GPT-4o mini** models are also included here.)
27+
As of late March 2025, the default chat completion model is `gpt-4o-mini`. If you deployed this sample before that date, the default model is `gpt-3.5-turbo`. You can change the chat completion model to any Azure OpenAI chat model that's available in your Azure OpenAI resource region by following these steps:
2828

29-
We generally find that most developers are able to get high-quality answers using GPT-3.5. However, if you want to try GPT-4, GPT-4o, or GPT-4o mini, you can do so by following these steps:
30-
31-
Execute the following commands inside your terminal:
32-
33-
1. To set the name of the deployment, run this command with a unique name in your Azure OpenAI account. You can use any deployment name, as long as it's unique in your Azure OpenAI account.
29+
1. To set the name of the deployment, run this command with a unique name in your Azure OpenAI account. You can use any deployment name, as long as it's unique in your Azure OpenAI account. For convenience, many developers use the same deployment name as the model name, but this is not required.
3430

3531
```bash
3632
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT <your-deployment-name>
@@ -39,10 +35,10 @@ Execute the following commands inside your terminal:
3935
For example:
4036

4137
```bash
42-
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT chat4
38+
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT gpt-4o
4339
```
4440

45-
1. To set the GPT model name to a **gpt-4**, **gpt-4o**, or **gpt-4o mini** version from the [available models](https://learn.microsoft.com/azure/ai-services/openai/concepts/models), run this command with the appropriate GPT model name.
41+
1. To set the GPT model to a different [available model](https://learn.microsoft.com/azure/ai-services/openai/concepts/models), run this command with the appropriate model name.
4642

4743
For GPT-4:
4844

@@ -62,19 +58,13 @@ Execute the following commands inside your terminal:
6258
azd env set AZURE_OPENAI_CHATGPT_MODEL gpt-4o-mini
6359
```
6460

65-
1. To set the Azure OpenAI deployment SKU name, run this command with [the desired SKU name](https://learn.microsoft.com/azure/ai-services/openai/how-to/deployment-types#deployment-types).
66-
67-
```bash
68-
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_SKU GlobalStandard
69-
```
70-
71-
1. To set the Azure OpenAI deployment capacity, run this command with the desired capacity.
61+
For gpt-3.5-turbo:
7262

7363
```bash
74-
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_CAPACITY 10
64+
azd env set AZURE_OPENAI_CHATGPT_MODEL gpt-35-turbo
7565
```
7666

77-
1. To set the Azure OpenAI deployment version from the [available versions](https://learn.microsoft.com/azure/ai-services/openai/concepts/models), run this command with the appropriate version.
67+
1. To set the Azure OpenAI model version from the [available versions](https://learn.microsoft.com/azure/ai-services/openai/concepts/models), run this command with the appropriate version string.
7868

7969
For GPT-4:
8070

@@ -94,23 +84,42 @@ Execute the following commands inside your terminal:
9484
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_VERSION 2024-07-18
9585
```
9686

87+
For gpt-3.5-turbo:
88+
89+
```bash
90+
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_VERSION 0125
91+
```
92+
93+
1. To set the Azure OpenAI deployment SKU name, run this command with [the desired SKU name](https://learn.microsoft.com/azure/ai-services/openai/how-to/deployment-types#deployment-types).
94+
95+
For GlobalStandard:
96+
97+
```bash
98+
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_SKU GlobalStandard
99+
```
100+
101+
For Standard:
102+
103+
```bash
104+
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_SKU Standard
105+
```
106+
107+
1. To set the Azure OpenAI deployment capacity (TPM, measured in thousands of tokens per minute), run this command with the desired capacity. This is not necessary if you are using the default capacity of 30.
108+
109+
```bash
110+
azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_CAPACITY 20
111+
```
112+
97113
1. To update the deployment with the new parameters, run this command.
98114

99115
```bash
100116
azd up
101117
```
102118

119+
This process does *not* delete your previous model deployment. If you want to delete previous deployments, go to your Azure OpenAI resource in Azure AI Foundry and delete it there.
120+
103121
> [!NOTE]
104-
> To revert back to GPT 3.5, run the following commands:
105-
>
106-
> * `azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT chat` to set the name of your old GPT 3.5 deployment.
107-
> * `azd env set AZURE_OPENAI_CHATGPT_MODEL gpt-35-turbo` to set the name of your old GPT 3.5 model.
108-
> * `azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_CAPACITY 30` to set the capacity of your old GPT 3.5 deployment.
109-
> * `azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_SKU Standard` to set the Sku name back to Standard.
110-
> * `azd env set AZURE_OPENAI_CHATGPT_DEPLOYMENT_VERSION 0125` to set the version number of your old GPT 3.5.
111-
> * `azd up` to update the provisioned resources.
112-
>
113-
> Note that this does not delete your GPT-4 deployment; it just makes your application create a new or reuse an old GPT 3.5 deployment. If you want to delete it, you can go to your Azure OpenAI studio and do so.
122+
> To revert back to a previous model, run the same commands with the previous model name and version.
114123

115124
## Using text-embedding-3 models
116125

Diff for: docs/gpt4v.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -23,9 +23,9 @@ For more details on how this feature works, read [this blog post](https://techco
2323
* The ability to deploy a gpt-4o model in the [supported regions](https://learn.microsoft.com/azure/ai-services/openai/concepts/models#standard-deployment-model-availability). If you're not sure, try to create a gpt-4o deployment from your Azure OpenAI deployments page.
2424
* Ensure that you can deploy the Azure OpenAI resource group in [a region where all required components are available](https://learn.microsoft.com/azure/cognitive-services/openai/concepts/models#model-summary-table-and-region-availability):
2525
* Azure OpenAI models
26-
* gpt-35-turbo
26+
* gpt-4o-mini
2727
* text-embedding-ada-002
28-
* gpt-4o
28+
* gpt-4o (for vision/evaluation features)
2929
* [Azure AI Vision](https://learn.microsoft.com/azure/ai-services/computer-vision/)
3030

3131
### Deployment

0 commit comments

Comments
 (0)