Add support for reasoning models and token usage display #2448

mattgotteiner · 2025-03-24T20:29:09Z

Purpose

Add support for o3-mini and o1, AOAI reasoning models
Add token usage graph in thought process

Does this introduce a breaking change?

When developers merge from main and run the server, azd up, or azd deploy, will this produce an error?
If you're not sure, try it out on an old environment.

[ ] Yes
[X] No

Does this require changes to learn.microsoft.com docs?

This repository is referenced by this tutorial
which includes deployment, settings and usage instructions. If text or screenshot need to change in the tutorial,
check the box below and notify the tutorial author. A Microsoft employee can do this for you if you're an external contributor.

[ ] Yes
[X] No

Type of change

[ ] Bugfix
[ ] Feature
[X] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[ ] Other... Please describe:

Code quality checklist

See CONTRIBUTING.md for more details.

The current tests all pass (python -m pytest).
I added tests that prove my fix is effective or that my feature works
I ran python -m pytest --cov to verify 100% coverage of added lines
I ran python -m mypy to check for type errors
I either used the pre-commit hooks or ran ruff and black manually on my code.

app/backend/app.py

app/backend/approaches/chatreadretrieveread.py

app/backend/approaches/approach.py

app/backend/approaches/chatapproach.py

app/frontend/src/components/AnalysisPanel/AnalysisPanel.module.css

app/frontend/src/components/AnalysisPanel/TokenUsageGraph.tsx

app/frontend/src/components/OptionSlider/OptionSlider.tsx

tests/conftest.py

app/backend/approaches/approach.py

app/backend/approaches/chatreadretrieveread.py

pamelafox · 2025-04-02T19:10:02Z

app/backend/approaches/retrievethenread.py

@@ -33,6 +33,7 @@ def __init__(
        query_language: str,
        query_speller: str,
        prompt_manager: PromptManager,
+        reasoning_effort: Optional[str] = None,


Is there a str enum from openai library for reasoning effort? Would it be good to use that if so?

They have a literal for it:
Optional[ChatCompletionReasoningEffort]

But its not clear we should use it for what comes from the frontend.

app/backend/approaches/chatreadretrieveread.py

app/backend/approaches/chatreadretrievereadvision.py

pamelafox · 2025-04-02T19:12:20Z

app/frontend/src/locales/en/translation.json

@@ -142,6 +148,8 @@
            "Enables the Azure AI Search semantic ranker, a model that re-ranks search results based on semantic similarity to the user's query.",
        "useQueryRewriting":
            "Enables Azure AI Search query rewriting, a process that modifies the user's query to improve search results. Requires semantic ranker to be enabled.",
+        "reasoningEffort":


Please translate for other files. (Perhaps we need a script for this..but its not that often yet)

docs/reasoning.md

tests/snapshots/test_app/test_ask_prompt_template/client0/result.json

docs/reasoning.md

mattgotteiner added 14 commits March 22, 2025 15:51

WIP

f59979d

WIP

9ba2353

ruff, black

a2d6e31

adding usage

ab27f5e

mypy

80bcfb4

ruff, black

79b682b

mypy, ruff, black, and update generate thought steps

28d5bfd

fix comments, set answer thought tag on streaming approaches

4612ccc

fixing frontend

aae8966

fixing backend + frontend

84d3e94

token graph fixup

cef2cea

fix token usage for non-streaming response

403e294

re-style token graph

a22d993

updates

1136185

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/backend/app.py Outdated Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/backend/approaches/chatreadretrieveread.py Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/backend/approaches/chatreadretrieveread.py Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/backend/approaches/chatreadretrieveread.py Outdated Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/backend/approaches/approach.py Outdated Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/backend/approaches/approach.py Outdated Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/backend/approaches/chatapproach.py Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/frontend/src/components/AnalysisPanel/AnalysisPanel.module.css Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/frontend/src/components/AnalysisPanel/TokenUsageGraph.tsx Outdated Show resolved Hide resolved

mattgotteiner commented Mar 24, 2025

View reviewed changes

app/frontend/src/components/OptionSlider/OptionSlider.tsx Outdated Show resolved Hide resolved

Merge branch 'Azure-Samples:main' into matt/reasoning

e429ac6

pamelafox mentioned this pull request Mar 26, 2025

get_token_limit, build_messages for o series model pamelafox/openai-messages-token-helper#24

Open

mattgotteiner added 4 commits March 26, 2025 17:07

adddressing feedback

8420090

merging

2d00599

ruff, black

e5e462d

prettify

c332053

mattgotteiner commented Apr 1, 2025

View reviewed changes

tests/conftest.py Outdated Show resolved Hide resolved

mattgotteiner added 5 commits April 1, 2025 18:24

add tests; ruff, black

ca0ae46

run prettier

41a3d2f

update docs

8c98686

fix test

21e9958

fix linter

8eef1a3

mattgotteiner marked this pull request as ready for review April 2, 2025 04:57