Add running request gauge metric #604

JeffLuoo · 2025-03-28T18:51:26Z

Tested locally and compare the result with vLLM requests count gauge metrics:

Blue Line: vLLM

Green Line: EPP

Resolve #593

netlify · 2025-03-28T18:51:43Z

✅ Deploy Preview for gateway-api-inference-extension ready!

Name	Link
🔨 Latest commit	`319587c`
🔍 Latest deploy log	https://app.netlify.com/sites/gateway-api-inference-extension/deploys/67eb0eff228a460008f298bf
😎 Deploy Preview	https://deploy-preview-604--gateway-api-inference-extension.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

JeffLuoo · 2025-03-28T18:53:45Z

cc: @ahg-g @courageJ

ahg-g · 2025-03-28T19:01:22Z

pkg/epp/handlers/streamingserver.go

@@ -322,6 +323,7 @@ func (s *StreamingServer) HandleRequestBody(
 	if !ok {
 		return reqCtx, errutil.Error{Code: errutil.BadRequest, Msg: "model not found in request"}
 	}
+	metrics.IncRunningRequests(model)


Do we want to increment this only if we successfully responded with a picked endpoint? @smarterclayton wdyt?

If an endpoint is not picked, the gauge will decrease immediately in the error handling so I think it should be fine to increase the request number it here.

pkg/epp/handlers/streamingserver.go

pkg/epp/metrics/metrics.go

ahg-g · 2025-03-31T21:57:46Z

/approve
/lgtm

k8s-ci-robot · 2025-03-31T21:57:54Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahg-g, JeffLuoo

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ahg-g]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Mar 28, 2025

k8s-ci-robot requested review from danehans and kfswain March 28, 2025 18:51

k8s-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Mar 28, 2025

ahg-g reviewed Mar 28, 2025

View reviewed changes

pkg/epp/handlers/streamingserver.go Outdated Show resolved Hide resolved

JeffLuoo force-pushed the request-gauge branch from 352f3a2 to 092769a Compare March 31, 2025 18:15

kfswain reviewed Mar 31, 2025

View reviewed changes

pkg/epp/metrics/metrics.go Show resolved Hide resolved

JeffLuoo force-pushed the request-gauge branch from 092769a to cd1c90c Compare March 31, 2025 21:52

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Mar 31, 2025

JeffLuoo requested review from kfswain and ahg-g March 31, 2025 21:52

[Metrics] Add running requests gauge metric

319587c

JeffLuoo force-pushed the request-gauge branch from cd1c90c to 319587c Compare March 31, 2025 21:54

k8s-ci-robot assigned ahg-g Mar 31, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 31, 2025

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 31, 2025

k8s-ci-robot merged commit 4d392ce into kubernetes-sigs:main Mar 31, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add running request gauge metric #604

Add running request gauge metric #604

JeffLuoo commented Mar 28, 2025 •

edited

Loading

netlify bot commented Mar 28, 2025 •

edited

Loading

JeffLuoo commented Mar 28, 2025

ahg-g Mar 28, 2025 •

edited

Loading

JeffLuoo Mar 31, 2025

ahg-g commented Mar 31, 2025

k8s-ci-robot commented Mar 31, 2025

Add running request gauge metric #604

Add running request gauge metric #604

Conversation

JeffLuoo commented Mar 28, 2025 • edited Loading

netlify bot commented Mar 28, 2025 • edited Loading

✅ Deploy Preview for gateway-api-inference-extension ready!

JeffLuoo commented Mar 28, 2025

ahg-g Mar 28, 2025 • edited Loading

Choose a reason for hiding this comment

JeffLuoo Mar 31, 2025

Choose a reason for hiding this comment

ahg-g commented Mar 31, 2025

k8s-ci-robot commented Mar 31, 2025

JeffLuoo commented Mar 28, 2025 •

edited

Loading

netlify bot commented Mar 28, 2025 •

edited

Loading

ahg-g Mar 28, 2025 •

edited

Loading