Feature request: add support for streaming tool use #1883

lsorber · 2024-12-25T23:11:08Z

The combination stream=True, tool_choice="auto" raises an exception right now, which means that developers are stuck with one of two unfortunate choices:

Developing an application that streams the response but cannot use tools
Developing an LLM application that can use tools but cannot stream the response

Relevant discussion: #1615

The text was updated successfully, but these errors were encountered:

SaymV · 2025-04-22T21:34:32Z

Admittedly this is the wrong place to ask this question but as a beginner I feel like you're the right person to answer:

Does something need to be done to llama.cpp directly in order to handle streaming tool calling? I see from your feature branch that you added a RAG layer to this python implementation. I ask because I built llamma.cpp from source figuring it would be better optimized for my system, but I am stuck with this server error
{"code":500,"message":"Cannot use tools with stream","type":"server_error"}.

Is it the case that if I installed the pre-built python version that this would go away?

Edit: I see here that there's a PR in draft. We're too close to the bleeding edge!

edmcman · 2025-04-22T21:40:57Z

Llama.cpp is still waiting on ggml-org/llama.cpp#12379

I'm not sure how this python library handles tools. I think it is somewhat different though.

This was referenced Dec 25, 2024

feat: add streaming tool use #1884

Open

feat: add streaming tool use to llama-cpp-python superlinear-ai/raglite#71

Merged

lsorber closed this as completed in superlinear-ai/raglite#71 Jan 5, 2025

lsorber reopened this Jan 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: add support for streaming tool use #1883

Feature request: add support for streaming tool use #1883

lsorber commented Dec 25, 2024

SaymV commented Apr 22, 2025 •

edited

Loading

edmcman commented Apr 22, 2025

Feature request: add support for streaming tool use #1883

Feature request: add support for streaming tool use #1883

Comments

lsorber commented Dec 25, 2024

SaymV commented Apr 22, 2025 • edited Loading

edmcman commented Apr 22, 2025

SaymV commented Apr 22, 2025 •

edited

Loading