[Feature] MCP bridge support #7934

James4Ever0 · 2025-01-23T04:31:17Z

The Feature

Act as a MCP bridge to utilize tools and provide enhanced response.

LiteLLM will read a MCP server config and act as a middle man between the MCP server, the LLM server and the LLM client.

There is an existing project for bridging MCP servers to OpenAI compatible clients.

Motivation, pitch

MCP is a general protocol for LLM Agents. By doing so MCP can be integrated into existing chat systems and have wider use cases.

Are you a ML Ops Team?

Yes

turian · 2025-02-19T01:03:33Z

I would love this

ishaan-jaff · 2025-03-11T23:57:40Z

added to March 2025 roadmap 🔥

ishaan-jaff · 2025-03-21T05:30:57Z

Hi @James4Ever0, @turian and everyone else on this thread. Initial implementation is done, is this what you wanted? #9426

I'd love your feedback on this as it's in Beta/ You can centrally define all MCP tools on the litellm config and your MCP clients can call list + call tool on litellm

(h/t @wagnerjt )

forpr1093 · 2025-03-21T08:02:30Z

Hi @ishaan-jaff, thanks for your contribution to this. However, I think what @James4Ever0 is looking for is to have LiteLLM act as a bridge between the clients and the MCP Server, utilizing the tools from the existing MCP Server (similar to SecretiveShell/MCP-Bridge), rather than having LiteLLM function as an MCP Server itself. That said, this approach might still be useful in certain use cases.

wagnerjt · 2025-03-21T13:09:15Z

Thanks @ishaan-jaff for this. I will be testing it when I get a moment!

I just wanted to share with @forpr1093 and @James4Ever0 that a MCP-bridge solution would be really great for a single ease of use point for the various deployed MCP servers. The problem is there are a number of things that MCP provides (Resources, Tools, and Prompts) as well as how quickly the MCP specification is changing around the transport and the authentication that it is probably best to start small.

While this first PR is only a MCP server on top of the litellm proxy, another win would be to simply incorporate the MCP client for tools into the litellm sdk so it can bring in tools from various MCP-backed servers

ishaan-jaff · 2025-03-21T14:24:54Z

What do you think about this interface @wagnerjt @forpr1093. This is an OpenAI MCP bridge

LiteLLM Python SDK with MCP Tools

import asyncio
from litellm import experimental_createMCPClient, completion
from litellm.mcp_stdio import Experimental_StdioMCPTransport
from litellm import openai

async def main():
   client_one = None

   try:
       # Initialize an MCP client to connect to a `stdio` MCP server:
       transport = Experimental_StdioMCPTransport(
           command='node',
           args=['src/stdio/dist/server.js']
       )
       client_one = await experimental_createMCPClient(
           transport=transport
       )

      
       tool_set_one = await client_one.list_tools()
       tools = tool_set_one
       response = await litellm.completion(
           model="gpt-4o",
           tools=tools,
           messages=[
               {
                   "role": "user",
                   "content": "Find products under $100"
               }
           ]
       )

       print(response.text)
   except Exception as error:
       print(error)
   finally:
       await asyncio.gather(
           client_one.close() if client_one else asyncio.sleep(0),
       )

if __name__ == "__main__":
   asyncio.run(main())

LiteLLM Proxy with MCP Tools

import asyncio
from openai import OpenAI
from litellm import experimental_createMCPClient
from litellm.mcp_stdio import Experimental_StdioMCPTransport

async def main():
   client_one = None
   try:
       # Initialize an MCP client to connect to a `stdio` MCP server:
       transport = Experimental_StdioMCPTransport(
           command='node',
           args=['src/stdio/dist/server.js']
       )
       client_one = await experimental_createMCPClient(
           transport=transport
       )
       
       # Get tools from MCP client
       tool_set_one = await client_one.list_tools()
       tools = tool_set_one
       
       # Initialize OpenAI client with custom base URL
       openai_client = OpenAI(
           base_url="http://localhost:4000",
           api_key="your-api-key"
       )
       
       # Create completion with the tools
       response = openai_client.chat.completions.create(
           model="gpt-4o",
           tools=tools,
           messages=[
               {
                   "role": "user",
                   "content": "Find products under $100"
               }
           ]
       )
       
       print(response.choices[0].message.content)
   except Exception as error:
       print(error)
   finally:
       await asyncio.gather(
           client_one.close() if client_one else asyncio.sleep(0),
       )

if __name__ == "__main__":
   asyncio.run(main())

rawwerks · 2025-03-21T16:21:18Z

fwiw, vercel just added mcp to ai sdk, might be a good place to look for clues => https://github.com/search?q=repo%3Avercel%2Fai%20mcp&type=code

ishaan-jaff · 2025-03-22T00:32:59Z

Hi everyone here's our initial implementation of a MCP bridge with litellm python SDK. Is this what you wanted ? @wagnerjt @forpr1093 @turian @James4Ever0 ? #9436

Overview

LiteLLM acts as a MCP bridge to utilize MCP tools with all LiteLLM supported models. LiteLLM offers the following features for using MCP

List Available MCP Tools: OpenAI clients can view all available MCP tools
- litellm.experimental_mcp_client.load_mcp_tools to list all available MCP tools
Call MCP Tools: OpenAI clients can call MCP tools
- litellm.experimental_mcp_client.call_openai_tool to call an OpenAI tool on an MCP server

Usage

1. List Available MCP Tools

In this example we'll use litellm.experimental_mcp_client.load_mcp_tools to list all available MCP tools on any MCP server. This method can be used in two ways:

format="mcp" - (default) Return MCP tools
- Returns: mcp.types.Tool
format="openai" - Return MCP tools converted to OpenAI API compatible tools. Allows using with OpenAI endpoints.
- Returns: openai.types.chat.ChatCompletionToolParam

# Create server parameters for stdio connection
from mcp import ClientSession, StdioServerParameters
from mcp.client.stdio import stdio_client
import os
import litellm
from litellm import experimental_mcp_client


server_params = StdioServerParameters(
    command="python3",
    # Make sure to update to the full absolute path to your math_server.py file
    args=["./mcp_server.py"],
)

async with stdio_client(server_params) as (read, write):
    async with ClientSession(read, write) as session:
        # Initialize the connection
        await session.initialize()

        # Get tools
        tools = await experimental_mcp_client.load_mcp_tools(session=session, format="openai")
        print("MCP TOOLS: ", tools)

        messages = [{"role": "user", "content": "what's (3 + 5)"}]
        llm_response = await litellm.acompletion(
            model="gpt-4o",
            api_key=os.getenv("OPENAI_API_KEY"),
            messages=messages,
            tools=tools,
        )
        print("LLM RESPONSE: ", json.dumps(llm_response, indent=4, default=str))

2. List and Call MCP Tools

In this example we'll use

litellm.experimental_mcp_client.load_mcp_tools to list all available MCP tools on any MCP server
litellm.experimental_mcp_client.call_openai_tool to call an OpenAI tool on an MCP server

The first llm response returns a list of OpenAI tools. We take the first tool call from the LLM response and pass it to litellm.experimental_mcp_client.call_openai_tool to call the tool on the MCP server.

How `litellm.experimental_mcp_client.call_openai_tool` works

Accepts an OpenAI Tool Call from the LLM response
Converts the OpenAI Tool Call to an MCP Tool
Calls the MCP Tool on the MCP server
Returns the result of the MCP Tool call

# Create server parameters for stdio connection
from mcp import ClientSession, StdioServerParameters
from mcp.client.stdio import stdio_client
import os
import litellm
from litellm import experimental_mcp_client


server_params = StdioServerParameters(
    command="python3",
    # Make sure to update to the full absolute path to your math_server.py file
    args=["./mcp_server.py"],
)

async with stdio_client(server_params) as (read, write):
    async with ClientSession(read, write) as session:
        # Initialize the connection
        await session.initialize()

        # Get tools
        tools = await experimental_mcp_client.load_mcp_tools(session=session, format="openai")
        print("MCP TOOLS: ", tools)

        messages = [{"role": "user", "content": "what's (3 + 5)"}]
        llm_response = await litellm.acompletion(
            model="gpt-4o",
            api_key=os.getenv("OPENAI_API_KEY"),
            messages=messages,
            tools=tools,
        )
        print("LLM RESPONSE: ", json.dumps(llm_response, indent=4, default=str))

        openai_tool = llm_response["choices"][0]["message"]["tool_calls"][0]
        # Call the tool using MCP client
        call_result = await experimental_mcp_client.call_openai_tool(
            session=session,
            openai_tool=openai_tool,
        )
        print("MCP TOOL CALL RESULT: ", call_result)

        # send the tool result to the LLM
        messages.append(llm_response["choices"][0]["message"])
        messages.append(
            {
                "role": "tool",
                "content": str(call_result.content[0].text),
                "tool_call_id": openai_tool["id"],
            }
        )
        print("final messages with tool result: ", messages)
        llm_response = await litellm.acompletion(
            model="gpt-4o",
            api_key=os.getenv("OPENAI_API_KEY"),
            messages=messages,
            tools=tools,
        )
        print(
            "FINAL LLM RESPONSE: ", json.dumps(llm_response, indent=4, default=str)
        )

ishaan-jaff · 2025-03-22T03:58:43Z

docs here: https://docs.litellm.ai/docs/mcp

I'd love feedback on this from the litellm community !

Bonus - Contributor issue - Can we get help with this ?

Task: LiteLLM should maintain a json of all known MCP Servers, can we get help with a script that scrapes all servers here: https://github.com/modelcontextprotocol/servers/tree/main/src and stores as a json and add it here: https://github.com/BerriAI/litellm/blob/main/mcp_servers.json

The benefit of this is we can then allow litellm users to easily reference well known MCP servers

thoughts @wagnerjt @rawwerks @James4Ever0 ?

Each server can be stored as the following

{
    "brave-search": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "-e",
        "BRAVE_API_KEY",
        "mcp/brave-search"
      ],
      "env": {
        "BRAVE_API_KEY": "YOUR_API_KEY_HERE"
      }
    }
  }

wagnerjt · 2025-03-22T21:02:39Z

Hey @ishaan-jaff. I want to start to say that I have spent some time today to experiment with the mcp client so far. I will test out the other parts I was fumbling with and the proxy aspect next week when I have more time.

First, I really like that you went with making the MCP client config and passing in the session as an argument. This allows us to have the flexibility on the various transport layers (sse vs stdio) and security measure that will come in the future. Very nice 👏!

I tested a go based MCP server over sse with the mcp client and was able to load_mcp_tools but wasn't able to execute the call_openai_tool just yet. I can update the docs next week to show off how to connect via the sse transport.

All code with examples can be found here.

Side note for the script to maintain the MCP servers that are out there, maybe it is sufficient to add a link to https://mcp-get.com/ to the docs until the official mcp-registry comes through.

wagnerjt · 2025-03-26T20:46:09Z

Wanted to bump the thread that the new spec came out today! I'm still reviewing it myself but wanted to highlight the SSE -> http as well as the authentication. Once I get my hands on it more, I'll come with additional feedback and proposals

https://spec.modelcontextprotocol.io/specification/2025-03-26/

ishaan-jaff · 2025-03-26T20:51:17Z

Thanks for sharing the new spec @wagnerjt. If I'm not mistaken we'll need to wait for the mcp python SDK to support the new HTTP transport too

jloganolson · 2025-03-28T12:47:09Z

Having an example mcp_server.py would have been helpful when I was trying this out, e.g.

# # server.py
from mcp.server.fastmcp import FastMCP

# # Create an MCP server
mcp = FastMCP("Demo")

# Add an addition tool
@mcp.tool()
def add(a: int, b: int) -> int:
    """Add two numbers"""
    return a + b


# Add a dynamic greeting resource
@mcp.resource("greeting://{name}")
def get_greeting(name: str) -> str:
    """Get a personalized greeting"""
    return f"Hello, {name}!"


if __name__ == "__main__":
    mcp.run()

antonkulaga · 2025-03-30T09:57:45Z

So, when such support will be part of litellm?

nbailey-sigtech · 2025-04-01T14:53:36Z

Hi @ishaan-jaff, thank you for all the work on MCP support so far! Is it correct that currently (as of v1.65.0) the only way to 'onboard' new servers onto litellm is by setting the SSE url, e.g.

mcp_servers:
  {
    "zapier_mcp": {
      "url": "https://actions.zapier.com/mcp/sk-akxxxxx/sse"
    },
    ...
  }

Do you have any plans to add support for the Claude Desktop spec with command, args, and env? e.g.

"mcpServers": {
  "filesystem": {
    "command": "npx",
    "args": [
      "-y",
      "@modelcontextprotocol/server-filesystem",
      "/Users/username/Desktop",
      "/Users/username/Downloads"
    ]
  }
}

We're looking at integrating our app with some MCP servers that we're looking to pull as docker images, so ideally we could just specify "comand": "docker" and run the container in interactive mode.

ishaan-jaff · 2025-04-01T15:18:45Z

hi @nbailey-sigtech - help me understand this a bit better. If given a non sse server, would litellm then need to run npx -y ... ?

In my SSE implementation we're forwarding requests to the SSE server

(cc @wagnerjt any thoughts on how we can support this ?)

wagnerjt · 2025-04-01T15:24:55Z

The support for LiteLLM Proxy to act as the MCP bridge with multiple servers over SSE is now in v1.65.1-nightly. You can see the respective PR here.

I personally think LiteLLM should not support the functionality of dealing with starting and running various MCP servers with commands @nbailey-sigtech. There are too many languages, execution patterns, etc. In a development sense, you can use write a shell script to loop over and start the servers than embedding all of this within LiteLLM. If you really wanted to do this, you can technically bake all of the dependencies within the Dockerfile to do this.

nbailey-sigtech · 2025-04-01T15:33:30Z

Makes sense, just wanted to know if that was on the roadmap. Interested to see what direction you take this in. Thanks both for the speedy response!

krrishdholakia · 2025-04-03T18:44:04Z

Is this issue now complete?

owent · 2025-04-08T13:01:54Z

I love this, can we declare MCP server handle as async function?

bendavis78 · 2025-05-02T21:34:42Z

@wagnerjt Given that a majority of MCP servers currently use stdio, it would be helpful to have a way to opt in to supporting stdio servers without having to fire up an HTTP server.

wagnerjt · 2025-05-02T21:41:12Z

@wagnerjt Given that a majority of MCP servers currently use stdio, it would be helpful to have a way to opt in to supporting stdio servers without having to fire up an HTTP server.

I get where you are coming from @bendavis78, but with the introduction of docker's mcp toolkit, everything will more than likely be ran and communicate over http (although there is still a way to use stdio..it is sort of living in a different process).

This isn't my decision, but I am of the opinion that the maintainers do not need to worry about the up keep of stdio as a feature.

Come join the discussion on more mcp features here.

James4Ever0 changed the title ~~[Feature] MCP Proxy support~~ [Feature] MCP bridge support Jan 23, 2025

This was referenced Jan 23, 2025

🎅 I WISH LITELLM HAD... #361

Open

MCP Client? egoist/chatwise-releases#54

Open

lutzleonhardt mentioned this issue Mar 30, 2025

Add MCP support Aider-AI/aider#3672

Open

9 tasks

krrishdholakia closed this as completed Apr 8, 2025

Uh oh!

[Feature] MCP bridge support #7934

[Feature] MCP bridge support #7934

Comments

James4Ever0 commented Jan 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The Feature

Motivation, pitch

Are you a ML Ops Team?

turian commented Feb 19, 2025

Uh oh!

ishaan-jaff commented Mar 11, 2025

Uh oh!

ishaan-jaff commented Mar 21, 2025

Uh oh!

forpr1093 commented Mar 21, 2025

Uh oh!

wagnerjt commented Mar 21, 2025

Uh oh!

ishaan-jaff commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LiteLLM Python SDK with MCP Tools

LiteLLM Proxy with MCP Tools

Uh oh!

rawwerks commented Mar 21, 2025

Uh oh!

ishaan-jaff commented Mar 22, 2025

Overview

Usage

1. List Available MCP Tools

2. List and Call MCP Tools

How litellm.experimental_mcp_client.call_openai_tool works

Uh oh!

ishaan-jaff commented Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bonus - Contributor issue - Can we get help with this ?

Uh oh!

wagnerjt commented Mar 22, 2025

Uh oh!

wagnerjt commented Mar 26, 2025

Uh oh!

ishaan-jaff commented Mar 26, 2025

Uh oh!

jloganolson commented Mar 28, 2025

Uh oh!

antonkulaga commented Mar 30, 2025

Uh oh!

nbailey-sigtech commented Apr 1, 2025

Uh oh!

ishaan-jaff commented Apr 1, 2025

Uh oh!

wagnerjt commented Apr 1, 2025

Uh oh!

nbailey-sigtech commented Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krrishdholakia commented Apr 3, 2025

Uh oh!

owent commented Apr 8, 2025

Uh oh!

bendavis78 commented May 2, 2025

Uh oh!

wagnerjt commented May 2, 2025

Uh oh!

James4Ever0 commented Jan 23, 2025 •

edited

Loading

ishaan-jaff commented Mar 21, 2025 •

edited

Loading

How `litellm.experimental_mcp_client.call_openai_tool` works

ishaan-jaff commented Mar 22, 2025 •

edited

Loading

nbailey-sigtech commented Apr 1, 2025 •

edited

Loading