Skip to content

Add EPP micro benchmark #462

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
liu-cong opened this issue Mar 7, 2025 · 1 comment
Open

Add EPP micro benchmark #462

liu-cong opened this issue Mar 7, 2025 · 1 comment

Comments

@liu-cong
Copy link
Contributor

liu-cong commented Mar 7, 2025

What would you like to be added:

https://github.com/kubernetes-sigs/gateway-api-inference-extension/pull/460/files removed the outdated benchmark.go file, which wasn't using the integration framework. We should add it back but do it properly.

  • Run all the reconcilers and don't inject objects to the datastore.
  • Create the inference pool and model CRs for stress testing

Why is this needed:

This helps us understand performance characteristics of EPP, such as how much QPS it can handle.

@kfswain
Copy link
Collaborator

kfswain commented Apr 22, 2025

We seem to have other benchmarking solutions proposed, we need to converge on an agreed upon solution. I think we can hold on this for now and build some direction alignment

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants