Feature: Async handling of sampling calls #840

wreed4 · 2025-05-29T16:43:36Z

added change and formatted with ruff
added tests

Dispatch _received_request in asynchronous tasks inside the session's _receive_loop.

Motivation and Context

When writing mcp servers that have the potential to return large amounts of data, one workable pattern is to "map/reduce" the results by chunking up the backend response and summarizing it with an LLM, then combining the summaries before returning that combined summary as the tool response. Sampling is the perfect tool for this, but it is locked into a sequential execution. Meaning if I break my data up into 10 chunks I have to sequentially summarize all of those results before my tool can respond. This can lead to very long runtimes which is not necessary since each sampling call only needs its own data.

This change should allow much more efficient "map/reduce" using sampling from MCP Servers (without them implementing their own LLM integration server-side).

How Has This Been Tested?

I've tested this with fast-agent which is one of the only mcp clients that implement sampling. It greatly speeds up my applications.

Breaking Changes

No. Unless a client sends sampling requests concurrently (vs immediately awaiting which is more standard), the behavior will not change.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update

Checklist

I have read the MCP Documentation
My code follows the repository's style guidelines
New and existing tests pass locally (There is one existing test that does not pass currently regarding OAuth, but it doesn't seem related. will see if fails in this PR and continue debugging)
I have added appropriate error handling
I have added or updated documentation as needed

Additional context

wreed4 · 2025-05-29T19:42:50Z

the one failing test in the one case, I believe fails due to a race condition. When adding sleeps into the test to try to force order of execution, it reliably fails even without my change as far as I can tell. Will continue to see if I can figure it out, but from what I can tell, the second "await session.send_request" returns without ever triggering the message handler. Removing the sleeps allows it to succeed every time on my machine, but adding the sleeps shows that session.send_request is returning before the message handler is called.

… sleeps and prints and I have no idea why?

wreed4 · 2025-05-29T21:30:44Z

OKAY! I've tracked down the test_streamablehettp_client_resumption failure to the following behavior (seemingly)...

General layout of the test:

create a tool called "long_running_with_checkpoints" which we are intending to break our connection with before it finishes (and resume it later).
register a message handler which is called from shared/session.py::_receive_loop by calling self._handle_incoming which calls the client/session.py version of that function which calls our message handler
Create a client
Start the tool
Wait for it to send the first notification and then disconnect
THE TOOL WILL CONTINUE RUNNING

then, the intended flow seems to be

reconnect to the tool
pick up the remaining notifications it sent us
profit

However, this only seems to work correctly if the tool has sent another notification before we reconnect to it. If the tool has yet to send another notification, the call to send_request hangs forever.

This is, at this point, very outside the scope of my change I'm trying to make, as I've verified that this happens both with and without my change.. But in the spirit of not breaking everyone else, I'll try to fix this as well. If folks want to keep this out and put it in another bug, that works for me too. I can revert whatever I do to these files as they're unrelated to the change I wanted to present.

wreed4 · 2025-05-30T14:40:32Z

I've tracked this down as far as I can and determined, ultimately it should be out of scope of this PR even if I could find out what's happening.. which I haven't been successful in. So I've added a bit more logic to make the test more reliable and opened another issue to capture this error case.

wreed4 · 2025-05-30T15:05:48Z

https://github.com/orgs/modelcontextprotocol/discussions/406

wreed4 added 5 commits May 29, 2025 10:46

added change and formatted with ruff

e895e52

added tests

9065a68

tried another implementation

afe9dfa

formatting

1f3eb27

Add type assertions for TextContent in tests

ed21943

wreed4 added 3 commits May 29, 2025 16:18

I think this works but the test is randomly hanging once I've removed…

aff40b2

… sleeps and prints and I have no idea why?

reduced time a bit

c02a73d

remove prints

dca806b

wreed4 mentioned this pull request May 30, 2025

Resumption of streamable HTTP session has potential for deadlock #860

Open

wreed4 added 2 commits May 30, 2025 10:35

adding sleep to avoid deadlock

49cbc29

formatting

41ac446

wreed4 mentioned this pull request May 30, 2025

added slow llm to test parallel sampling evalstate/fast-agent#197

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: Async handling of sampling calls #840

Feature: Async handling of sampling calls #840

wreed4 commented May 29, 2025 •

edited

Loading

Uh oh!

wreed4 commented May 29, 2025

Uh oh!

wreed4 commented May 29, 2025

Uh oh!

wreed4 commented May 30, 2025

Uh oh!

wreed4 commented May 30, 2025

Uh oh!

Uh oh!

Feature: Async handling of sampling calls #840

Are you sure you want to change the base?

Feature: Async handling of sampling calls #840

Conversation

wreed4 commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

How Has This Been Tested?

Breaking Changes

Types of changes

Checklist

Additional context

Uh oh!

wreed4 commented May 29, 2025

Uh oh!

wreed4 commented May 29, 2025

Uh oh!

wreed4 commented May 30, 2025

Uh oh!

wreed4 commented May 30, 2025

Uh oh!

Uh oh!

wreed4 commented May 29, 2025 •

edited

Loading