Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: Watch startup errors for stdio client #289

Open
wants to merge 12 commits into
base: main
Choose a base branch
from

Conversation

mattzh72
Copy link

@mattzh72 mattzh72 commented Mar 14, 2025

Summary

Improves stdio_client error handling to prevent indefinite hangs when subprocess failures occur, particularly when wrapping MCP servers with a synchronous interface. Adds structured error propagation to make failures catchable instead of failing silently inside anyio.

This is a minimal fix, but more long term, it would be great if either:

  • MCP can have a native synchronous Python client (happy to contribute this)
  • MCP can establish a handshake protocol with the server

Motivation and Context

When integrating MCP servers with a sync interface, failures in anyio.open_process() or unexpected subprocess exits could result in the process hanging indefinitely instead of raising an exception. Previously, errors inside anyio's process handling were not properly surfaced, making debugging difficult.

This PR ensures:

  • Immediate errors (missing binaries, bad arguments) raise RuntimeError instead of hanging.
  • Background process failures (e.g., subprocess exits early) are surfaced via a structured exception.
  • Minimal startup wait (startup_wait_time) allows for quick failure detection instead of silent hangs.

This is particularly useful when MCP servers are used in synchronous mode, where an uncatchable error would otherwise stall execution.

How Has This Been Tested?

  • Added and refined tests:
    • Happy path: Ensures valid subprocesses work as expected.
    • Spawn failure: Verifies missing binaries raise RuntimeError immediately.
    • Nonzero exit: Ensures subprocess errors propagate correctly instead of hanging.
  • Ran tests locally with pytest and uv run pytest.
  • Verified behavior in a real MCP client with a sync wrapper.

Breaking Changes

  • No API-breaking changes, but:
    • Subprocess failures that previously resulted in silent hangs now raise structured RuntimeError exceptions.
    • Callers relying on silent failures may need to adjust their error handling.

Note that the field defaults to 0.0, so should not have any effect unless explicitly set.

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Documentation update

Checklist

  • I have read the MCP Documentation
  • My code follows the repository's style guidelines
  • New and existing tests pass locally
  • I have added appropriate error handling
  • I have added or updated documentation as needed

Additional context

  • Introduces startup_wait_time (default 0.0) to allow detection of immediate failures without forcing all cases to wait.
  • Improves test coverage to prevent regression.
  • Avoids ExceptionGroup issues by ensuring a single RuntimeError is raised when subprocess failures occur.

dsp-ant and others added 11 commits February 13, 2025 16:40
The character encoding of the stdin/stdout streams in Python is platform-
dependent. On Windows it will be something weird, like CP437 or CP1252,
depending on the locale. This change ensures that no matter the platform,
UTF-8 is used.
…col#218)

Adds sampling and list roots callbacks to the ClientSession, allowing the client to handle requests from the server.

Co-authored-by: TerminalMan <[email protected]>
Co-authored-by: David Soria Parra <[email protected]>
…/jerome/fix/request-context-typing

Updated typing on request context for the server to use server session
…ontextprotocol#222)

* feat: allow lowlevel servers to return a list of resources

The resource/read message in MCP allows of multiple resources
to be returned. However, in the SDK we do not allow this. This
change is such that we allow returning multiple resource in
the lowlevel API if needed. However in FastMCP we stick to
one, since a FastMCP resource defines the mime_type in the decorator
and hence a resource cannot dynamically return different mime_typed resources.
It also is just the better default to only return one resource.
However in the lowlevel API we will allow this.

Strictly speaking this is not a BC break since the new return value
is additive, but if people subclassed server, it will break them.

* feat: lower the type requriements for call_tool to Iterable
@mattzh72
Copy link
Author

mattzh72 commented Mar 14, 2025

We encountered this bad behavior while writing a synchronous MCP client for Letta. The uncatchable error stalls our server event loop.

cc @dsp-ant

@mattzh72
Copy link
Author

cc @Kludex @allenporter for suggestions here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants