In hermetic test, add additional test cases and move k8sClient object creation so it's called once for all tests #278

BenjaminBraunDev · 2025-02-03T22:59:59Z

Adds additional testing for EPP logic for selecting model servers for an inference request. The two new test cases check:

If there is no active LoRA for a request for one, the behavior is based solely off queue size and KV-utilization.
If there is an active LoRA model but the queue size is above the schedulers threshold value queueingThresholdLoRA than a lower queue size model is chosen despite the higher already having the requested LoRA active to avoid overloading a heavy traffic model.

This also moves the logic for adding test InferenceModel and InferencePool objects into the k8sClient into the BeforeSuit() so it's only called once, otherwise it will error upon trying to create duplicate objects.

Related Issue: #80

… setup to BeforeSuit() so it is set up once for all test cases. Add getter function to scheduling to reference queue threshold for lora affinity inside integration tests.

k8s-ci-robot · 2025-02-03T23:00:10Z

Hi @BenjaminBraunDev. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

netlify · 2025-02-03T23:00:46Z

✅ Deploy Preview for gateway-api-inference-extension ready!

Name	Link
🔨 Latest commit	`68c91e4`
🔍 Latest deploy log	https://app.netlify.com/sites/gateway-api-inference-extension/deploys/67a54f6ecc89590008566f27
😎 Deploy Preview	https://deploy-preview-278--gateway-api-inference-extension.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

pkg/ext-proc/scheduling/scheduler.go

test/integration/hermetic_test.go

…ts, remove unreachable error check.

ahg-g · 2025-02-06T22:38:32Z

/ok-to-test

ahg-g · 2025-02-06T22:39:44Z

/approve

leaving lgtm to @liu-cong

k8s-ci-robot · 2025-02-06T22:39:51Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahg-g, BenjaminBraunDev

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ahg-g]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

test/integration/hermetic_test.go

… non-lora test case to use a different model name.

test/integration/hermetic_test.go

liu-cong · 2025-02-07T00:11:59Z

/lgtm

BenjaminBraunDev added 2 commits February 3, 2025 21:18

merge with main

Loading
Loading status checks…

e3a2015

k8s-ci-robot requested review from danehans and robscott February 3, 2025 23:00

k8s-ci-robot added cncf-cla: yes needs-ok-to-test labels Feb 3, 2025

k8s-ci-robot added the size/L label Feb 3, 2025

Merge branch 'kubernetes-sigs:main' into test-cases

Loading
Loading status checks…

3eb5845

liu-cong reviewed Feb 3, 2025

View reviewed changes

remove vestigial unit test from hermetic test, minor change to commen…

Loading
Loading status checks…

a89b080

…ts, remove unreachable error check.

k8s-ci-robot added the needs-rebase label Feb 4, 2025

k8s-ci-robot added ok-to-test and removed needs-ok-to-test needs-rebase labels Feb 6, 2025

k8s-ci-robot added the approved label Feb 6, 2025

liu-cong reviewed Feb 6, 2025

View reviewed changes

test/integration/hermetic_test.go Outdated Show resolved Hide resolved

test/integration/hermetic_test.go Outdated Show resolved Hide resolved

test/integration/hermetic_test.go Show resolved Hide resolved

BenjaminBraunDev added 2 commits February 6, 2025 23:31

Add test-case for sheddable that is not shed, fix nits and rename the…

Loading
Loading status checks…

cc89e72

… non-lora test case to use a different model name.

Merge branch 'kubernetes-sigs:main' into test-cases

Loading
Loading status checks…

fb2dcf5

liu-cong reviewed Feb 7, 2025

View reviewed changes

test/integration/hermetic_test.go Outdated Show resolved Hide resolved

Fix small typo.

Loading
Loading status checks…

68c91e4

k8s-ci-robot assigned liu-cong Feb 7, 2025

k8s-ci-robot added the lgtm label Feb 7, 2025

k8s-ci-robot merged commit 3ff0af8 into kubernetes-sigs:main Feb 7, 2025
8 checks passed

BenjaminBraunDev deleted the test-cases branch February 7, 2025 00:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In hermetic test, add additional test cases and move k8sClient object creation so it's called once for all tests #278

In hermetic test, add additional test cases and move k8sClient object creation so it's called once for all tests #278

BenjaminBraunDev commented Feb 3, 2025

k8s-ci-robot commented Feb 3, 2025

netlify bot commented Feb 3, 2025 •

edited

Loading

ahg-g commented Feb 6, 2025

ahg-g commented Feb 6, 2025

k8s-ci-robot commented Feb 6, 2025

liu-cong commented Feb 7, 2025

In hermetic test, add additional test cases and move k8sClient object creation so it's called once for all tests #278

In hermetic test, add additional test cases and move k8sClient object creation so it's called once for all tests #278

Conversation

BenjaminBraunDev commented Feb 3, 2025

k8s-ci-robot commented Feb 3, 2025

netlify bot commented Feb 3, 2025 • edited Loading

✅ Deploy Preview for gateway-api-inference-extension ready!

ahg-g commented Feb 6, 2025

ahg-g commented Feb 6, 2025

k8s-ci-robot commented Feb 6, 2025

liu-cong commented Feb 7, 2025

netlify bot commented Feb 3, 2025 •

edited

Loading