Pydantic Issue when running Ollama + FastAPI backend #244

BastianSpatz · 2024-08-19T13:09:32Z

When using ollama as a model source, i get the error:

ERROR: Error when generating next question: 1 validation error for LLMStructuredPredictEndEvent output value is not a valid dict (type=type_error.dict)

when it wants to generate the NextQuestions.

create-llama/templates/types/streaming/fastapi/app/api/services/suggestion.py

Lines 50 to 55 in 1d93775

    
           output: NextQuestions = await Settings.llm.astructured_predict( 
        
               NextQuestions, 
        
               prompt=NEXT_QUESTIONS_SUGGESTION_PROMPT, 
        
               conversation=conversation, 
        
               number_of_questions=number_of_questions, 
        
           )

I think this is a llama-index/pydantic problem when calling astructured_predict in the call dispatcher.event(LLMStructuredPredictEndEvent(output=result)).

Has anybody anybody seen or fixed this error?

The text was updated successfully, but these errors were encountered:

marcusschiesser · 2024-08-20T04:58:18Z

The astructured_predict call requires good function calling. What model are you using?
The code generated by create llama is only trying this call and shouldn't show the next questions if it fails - is this your behavior?

BastianSpatz · 2024-08-20T07:28:06Z

Thanks for the reply.

I'm using llama 3.1 8B
ANd yes it just throws the error and doesnt generate questions. But the app works fine nonetheless

marcusschiesser · 2024-08-20T07:58:55Z

@BastianSpatz
I guess that the model is not capable enough to use structured_predict.

As Typescript doesn't have structured_predict, it's using a simple LLM call that is parsed; see:

create-llama/templates/components/llamaindex/typescript/streaming/suggestion.ts

Lines 16 to 39 in 8ce4a85

    
           export async function generateNextQuestions( 
        
             conversation: ChatMessage[], 
        
             numberOfQuestions: number = N_QUESTIONS_TO_GENERATE, 
        
           ) { 
        
             const llm = Settings.llm; 
        
             // Format conversation 
        
             const conversationText = conversation 
        
               .map((message) => `${message.role}: ${message.content}`) 
        
               .join("\n"); 
        
             const message = NEXT_QUESTION_PROMPT_TEMPLATE.replace( 
        
               "$conversation", 
        
               conversationText, 
        
             ).replace("$number_of_questions", numberOfQuestions.toString()); 
        
             try { 
        
               const response = await llm.complete({ prompt: message }); 
        
               const questions = extractQuestions(response.text); 
        
               return questions; 
        
             } catch (error) { 
        
               console.error("Error when generating the next questions: ", error); 
        
               return []; 
        
             } 
        
           }

Can you try using the NextJS template first with your Ollama model - if that works you could modify suggestion.py accordingly.

BastianSpatz · 2024-08-20T08:24:51Z

Thank you for the help Ill check it out :)

marcusschiesser · 2024-08-20T08:44:56Z

Great. can you let me know the result, we can keep the ticket open till then

BastianSpatz · 2024-08-20T09:00:25Z

Using the same approach as in the Typescript version it works

marcusschiesser · 2024-08-20T11:22:35Z

cool. can you send a PR or post here your changes?

BastianSpatz · 2024-08-26T05:30:44Z

Sorry here is what i changed in the suggestions.py:

NEXT_QUESTIONS_SUGGESTION_PROMPT = PromptTemplate(
    "You're a helpful assistant! Your task is to suggest the next question that user might ask. "
    "\nHere is the conversation history"
    "\n---------------------\n{conversation}\n---------------------"
    "Given the conversation history, please give me {number_of_questions} questions that you might ask next!"
    "Keep the answers relevant to the conversation history and its context."
    "Your answer should be wrapped in three sticks which follows the following format:"
    "\`\`\`"
    "<question 1>\n"
    "<question 2>\n\`\`\`"
)

class NextQuestionSuggestion:
    @staticmethod
    def suggest_next_questions(
        messages: List[Message],
        number_of_questions: int = N_QUESTION_TO_GENERATE,
    ) -> List[str]:
        """
        Suggest the next questions that user might ask based on the conversation history
        Return as empty list if there is an error
        """
        try:
            # Reduce the cost by only using the last two messages
            last_user_message = None
            last_assistant_message = None
            for message in reversed(messages):
                if message.role == "user":
                    last_user_message = f"User: {message.content}"
                elif message.role == "assistant":
                    last_assistant_message = f"Assistant: {message.content}"
                if last_user_message and last_assistant_message:
                    break
            conversation: str = f"{last_user_message}\n{last_assistant_message}"

            # output: NextQuestions = await Settings.llm.astructured_predict(
            #     NextQuestions,
            #     prompt=NEXT_QUESTIONS_SUGGESTION_PROMPT,
            #     conversation=conversation,
            #     number_of_questions=number_of_questions,
            # )
            prompt = (
                NEXT_QUESTIONS_SUGGESTION_PROMPT.get_template()
                .replace("{conversation}", conversation)
                .replace("{number_of_questions}", str(number_of_questions))
            )
            output = Settings.llm.complete(prompt)
            questions = extract_questions_from_text(output.text)

            return questions
        except Exception as e:
            logger.error(f"Error when generating next question: {e}")
            return []


def extract_questions_from_text(prompt: str) -> List[str]:
    # Regular expression to match content within triple backticks
    pattern = r"\`\`\`(.*?)\`\`\`"

    # Find the content inside the backticks
    match = re.search(pattern, prompt, re.DOTALL)

    if match:
        # Split the content by newlines and strip any leading/trailing whitespace
        questions = [
            line.strip() for line in match.group(1).splitlines() if line.strip()
        ]
        questions = [question for question in questions if "?" in question]
        return questions

    return []

I have noticed that after a few questions the format of the output questions by the llm seem to deteriorate.

marcusschiesser · 2024-08-29T08:48:53Z

Thanks @BastianSpatz

BastianSpatz closed this as completed Aug 20, 2024

marcusschiesser reopened this Aug 20, 2024

marcusschiesser assigned leehuwuj Aug 29, 2024

marcusschiesser added this to Framework Aug 29, 2024

marcusschiesser moved this to Todo in Framework Aug 29, 2024

marcusschiesser moved this from Todo to In Progress in Framework Sep 6, 2024

leehuwuj mentioned this issue Sep 6, 2024

feat: Make suggest next questions configurable #275

Merged

marcusschiesser closed this as completed in #275 Sep 9, 2024

github-project-automation bot moved this from In Progress to Done in Framework Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pydantic Issue when running Ollama + FastAPI backend #244

Pydantic Issue when running Ollama + FastAPI backend #244

BastianSpatz commented Aug 19, 2024

marcusschiesser commented Aug 20, 2024

Uh oh!

BastianSpatz commented Aug 20, 2024

Uh oh!

marcusschiesser commented Aug 20, 2024

Uh oh!

BastianSpatz commented Aug 20, 2024

Uh oh!

marcusschiesser commented Aug 20, 2024

Uh oh!

BastianSpatz commented Aug 20, 2024

Uh oh!

marcusschiesser commented Aug 20, 2024

Uh oh!

BastianSpatz commented Aug 26, 2024

Uh oh!

marcusschiesser commented Aug 29, 2024

Uh oh!

Pydantic Issue when running Ollama + FastAPI backend #244

Pydantic Issue when running Ollama + FastAPI backend #244

Comments

BastianSpatz commented Aug 19, 2024

marcusschiesser commented Aug 20, 2024

Uh oh!

BastianSpatz commented Aug 20, 2024

Uh oh!

marcusschiesser commented Aug 20, 2024

Uh oh!

BastianSpatz commented Aug 20, 2024

Uh oh!

marcusschiesser commented Aug 20, 2024

Uh oh!

BastianSpatz commented Aug 20, 2024

Uh oh!

marcusschiesser commented Aug 20, 2024

Uh oh!

BastianSpatz commented Aug 26, 2024

Uh oh!

marcusschiesser commented Aug 29, 2024

Uh oh!