[Bug]: Function calling with Qwen & Streaming ('NoneType' object has no attribute 'get') #9874

githebs · 2024-10-31T08:21:19Z

Your current environment

The output of `python collect_env.py`

Your output of `python collect_env.py` here

Model Input Dumps

No response

🐛 Describe the bug

vLLM Version

v0.6.3.post1

Model

Qwen2.5-7B-Instruct

Docker command for vLLM

command: --host 0.0.0.0 --model /hf/Qwen-Qwen2.5-7B-Instruct --max-model-len 32768 --gpu_memory_utilization 0.9 --enable-auto-tool-choice --tool-call-parser hermes

Parsing from my own fastapi

async def stream_response(payload: dict, log: RequestLogger) -> AsyncGenerator[str, None]:
    """Handle streaming response from vLLM."""
    async with httpx.AsyncClient() as client:
        try:
            async with client.stream(
                'POST',
                VLLM_API_BASE,
                json=payload,
                headers={"Content-Type": "application/json"},
                timeout=30.0
            ) as response:
                if response.status_code != 200:
                    error_msg = f"vLLM API error: {response.status_code}"
                    log(error_msg, level='error')
                    yield f"data: {json.dumps({'error': error_msg})}\n\n"
                    return

                async for line in response.aiter_lines():
                    if not line or not line.startswith('data: '):
                        continue
                        
                    line = line.removeprefix('data: ')
                    if line.strip() == '[DONE]':
                        log("Stream completed")
                        yield 'data: [DONE]\n\n'
                        break
                    
                    try:
                        parsed = json.loads(line)
                        log("Streaming chunk", parsed)

                        # Handle tool calls in streaming response
                        if 'choices' in parsed and parsed['choices']:
                            choice = parsed['choices'][0]
                            if 'delta' in choice and 'tool_calls' in choice['delta']:
                                tool_call = choice['delta']['tool_calls'][0]
                                
                                if ('function' in tool_call and 
                                    'name' in tool_call['function'] and 
                                    'arguments' in tool_call['function']):
                                    
                                    func_name = tool_call['function']['name']
                                    args = json.loads(tool_call['function']['arguments'])
                                    
                                    if func_name == 'add_numbers':
                                        result = add_numbers(args['a'], args['b'])
                                        yield f'data: {json.dumps({"choices": [{"delta": {"content": str(result)}}]})}\n\n'
                                        continue

                        yield f'data: {line}\n\n'
                    except json.JSONDecodeError as e:
                        log(f"Failed to parse streaming response: {str(e)}", level='error')
                        continue

        except httpx.RequestError as e:
            error_msg = f"Streaming request failed: {str(e)}"
            log(error_msg, level='error')
            yield f"data: {json.dumps({'error': error_msg})}\n\n"
        
    log("Stream connection closed")

vLLM error

vllm | ERROR 10-30 14:55:01 hermes_tool_parser.py:337] Error trying to handle streaming tool call. vllm | ERROR 10-30 14:55:01 hermes_tool_parser.py:337] Traceback (most recent call last): vllm | ERROR 10-30 14:55:01 hermes_tool_parser.py:337] File "/usr/local/lib/python3.12/dist-packages/vllm/entrypoints/openai/tool_parsers/hermes_tool_parser.py", line 226, in extract_tool_calls_streaming vllm | ERROR 10-30 14:55:01 hermes_tool_parser.py:337] function_name: Union[str, None] = current_tool_call.get("name") vllm | ERROR 10-30 14:55:01 hermes_tool_parser.py:337] ^^^^^^^^^^^^^^^^^^^^^ vllm | ERROR 10-30 14:55:01 hermes_tool_parser.py:337] AttributeError: 'NoneType' object has no attribute 'get'

Please note that everything works if

Streaming with no tools
Not streaming with tools

Any guidance ?
Thanks in advance everyone

PS: I have seen the posts from #9693 but my issue seems different since i actually use a "supported" model.

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

DarkLight1337 · 2024-10-31T11:29:10Z

cc @K-Mistele

K-Mistele · 2024-10-31T16:49:10Z

Thanks for the ping @DarkLight1337
@githebs can you share a request configuration that reproduces the issue consistently (temperature=0 is great for reproducibility, but no worries if you need a higher temp and it only happens sometimes) so that I can debug and take a look?

K-Mistele · 2024-11-01T06:05:12Z

Hi @githebs - we have had a discussion on this issue in #9693. Please see my comment here and let me know if this seems like a good path forward for you.

K-Mistele · 2024-11-01T06:51:18Z

Please check #9908 :)

frei-x · 2024-11-05T06:21:53Z

Stream output, if the function has no parameters, an error will be reported directly

K-Mistele · 2024-11-05T06:39:40Z

Stream output, if the function has no parameters, an error will be reported directly

Yeah, this is what I'm thinking too. #9908 (comment)

githebs · 2024-11-15T20:31:20Z

@frei-x @K-Mistele

thanks for the answer, sorry for the delay, I answered in the PR here #9908 (comment) but basically, yes, if the argument is blank, it doesn't work

wangluyi · 2024-12-26T05:42:36Z

In my case, when I use tool call and no argument is needed for the function, the code throws exception like this :

It appears that the bug is here:
chat_utils.py line 498:

def _postprocess_messages(messages: List[ConversationMessage]) -> None:
    # per the Transformers docs & maintainers, tool call arguments in
    # assistant-role messages with tool_calls need to be dicts not JSON str -
    # this is how tool-use chat templates will expect them moving forwards
    # so, for messages that have tool_calls, parse the string (which we get
    # from openAI format) to dict
    for message in messages:
        if (message["role"] == "assistant" and "tool_calls" in message
                and isinstance(message["tool_calls"], list)):

            for item in message["tool_calls"]:
                item["function"]["arguments"] = json.loads(
                    item["function"]["arguments"])

I modified the code like this (simply deal with blank argument) and it goes right:

def _postprocess_messages(messages: List[ConversationMessage]) -> None:
    # per the Transformers docs & maintainers, tool call arguments in
    # assistant-role messages with tool_calls need to be dicts not JSON str -
    # this is how tool-use chat templates will expect them moving forwards
    # so, for messages that have tool_calls, parse the string (which we get
    # from openAI format) to dict
    for message in messages:
        if (message["role"] == "assistant" and "tool_calls" in message
                and isinstance(message["tool_calls"], list)):

            for item in message["tool_calls"]:
                if item["function"] is not None and item["function"]["arguments"] is not None and len(item["function"]["arguments"]) > 0:
                    item["function"]["arguments"] = json.loads(item["function"]["arguments"])
                else:
                    item["function"]["arguments"] = {}

K-Mistele · 2024-12-27T21:30:52Z

Related #11522

wey-gu · 2025-01-27T09:49:39Z

Any heroes are working on this, please?🙏

Thanks!
cc @K-Mistele

github-actions · 2025-04-28T02:10:18Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

github-actions · 2025-05-28T02:13:22Z

This issue has been automatically closed due to inactivity. Please feel free to reopen if you feel it is still relevant. Thank you!

githebs added the bug Something isn't working label Oct 31, 2024

ankush13r mentioned this issue Oct 31, 2024

[Bug]: Function calling with stream vs without stream, arguments=None when stream option is enabled #9693

Closed

1 task

K-Mistele mentioned this issue Nov 1, 2024

[Bugfix] Hermes tool parser fails to check for & handle None values in some cases #9908

Closed

Sala8888 mentioned this issue Nov 23, 2024

[Bug] Streaming output error of tool calling has still not been resolved. #10589

Closed

K-Mistele mentioned this issue Dec 27, 2024

[Bug]: [v0.6.5] Streaming tool call responses with the hermes template is inconsistent with the non-stream version. #11392

Open

1 task

marcelodiaz558 mentioned this issue Jan 31, 2025

[Frontend] [Bugfix] Refactor tool parsers and simplify the tool parsing interface. #11554

Draft

github-actions bot added the stale Over 90 days of inactivity label Apr 28, 2025

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale May 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: Function calling with Qwen & Streaming ('NoneType' object has no attribute 'get') #9874

[Bug]: Function calling with Qwen & Streaming ('NoneType' object has no attribute 'get') #9874

githebs commented Oct 31, 2024

DarkLight1337 commented Oct 31, 2024

Uh oh!

K-Mistele commented Oct 31, 2024

Uh oh!

K-Mistele commented Nov 1, 2024

Uh oh!

K-Mistele commented Nov 1, 2024

Uh oh!

frei-x commented Nov 5, 2024

Uh oh!

K-Mistele commented Nov 5, 2024

Uh oh!

githebs commented Nov 15, 2024

Uh oh!

wangluyi commented Dec 26, 2024

Uh oh!

K-Mistele commented Dec 27, 2024

Uh oh!

wey-gu commented Jan 27, 2025

Uh oh!

github-actions bot commented Apr 28, 2025

Uh oh!

github-actions bot commented May 28, 2025

Uh oh!

Uh oh!

[Bug]: Function calling with Qwen & Streaming ('NoneType' object has no attribute 'get') #9874

[Bug]: Function calling with Qwen & Streaming ('NoneType' object has no attribute 'get') #9874

Comments

githebs commented Oct 31, 2024

Your current environment

Model Input Dumps

🐛 Describe the bug

vLLM Version

Model

Docker command for vLLM

Parsing from my own fastapi

vLLM error

Before submitting a new issue...

DarkLight1337 commented Oct 31, 2024

Uh oh!

K-Mistele commented Oct 31, 2024

Uh oh!

K-Mistele commented Nov 1, 2024

Uh oh!

K-Mistele commented Nov 1, 2024

Uh oh!

frei-x commented Nov 5, 2024

Uh oh!

K-Mistele commented Nov 5, 2024

Uh oh!

githebs commented Nov 15, 2024

Uh oh!

wangluyi commented Dec 26, 2024

Uh oh!

K-Mistele commented Dec 27, 2024

Uh oh!

wey-gu commented Jan 27, 2025

Uh oh!

github-actions bot commented Apr 28, 2025

Uh oh!

github-actions bot commented May 28, 2025

Uh oh!