Skip to content

feat: TensorRT-LLM Support for logits_prob #54

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
hiro-v opened this issue Jul 9, 2024 · 1 comment
Closed

feat: TensorRT-LLM Support for logits_prob #54

hiro-v opened this issue Jul 9, 2024 · 1 comment
Labels
status: wontfix This will not be worked on

Comments

@hiro-v
Copy link

hiro-v commented Jul 9, 2024

Hi there,

Currently I'm using cortex.tensorrt-llm in order to benchmark for MMLU and TruthfulQA using https://github.com/EleutherAI/lm-evaluation-harness/tree/main

However, the current API does not return logits_prob

Please add logits_prob in chat_completion API as it's supported from 0.8.0 https://github.com/NVIDIA/TensorRT-LLM/issues/983

Thank you

@hiro-v hiro-v added P2: nice to have Nice to have feature type: feature request A new feature labels Jul 9, 2024
@Van-QA Van-QA added this to Menlo Sep 5, 2024
@dan-menlo dan-menlo changed the title [Request] Support for logits_prob feat: TensorRT-LLM Support for logits_prob Sep 8, 2024
@dan-menlo dan-menlo moved this to Investigating in Menlo Oct 13, 2024
@gabrielle-ong
Copy link

gabrielle-ong commented Nov 28, 2024

Closing all open Tensorrt-llm stories due to TensorRT-LLM not supporting Desktop
Parent issue: menloresearch/cortex.cpp#1742

@github-project-automation github-project-automation bot moved this from Investigating to Review + QA in Menlo Nov 28, 2024
@gabrielle-ong gabrielle-ong moved this from Review + QA to Completed in Menlo Nov 28, 2024
@gabrielle-ong gabrielle-ong added status: wontfix This will not be worked on and removed P2: nice to have Nice to have feature type: feature request A new feature labels Nov 28, 2024
@gabrielle-ong gabrielle-ong moved this from Completed to Discontinued in Menlo Dec 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
status: wontfix This will not be worked on
Projects
Archived in project
Development

No branches or pull requests

2 participants