You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Signed-off-by: Jie WU <[email protected]>
add request metrics
Signed-off-by: Jie WU <[email protected]>
rename api and metrics
fix go mod
Adding metrics handler
Signed-off-by: Jie WU <[email protected]>
Adding metrics handler
Signed-off-by: Jie WU <[email protected]>
add request metrics
rename api and metrics
fix mod
Updated request metrics to be handled in server processing loop
Signed-off-by: Jie WU <[email protected]>
Updated request metrics to be handled in server processing loop
Signed-off-by: Jie WU <[email protected]>
fix go mod
Signed-off-by: Jie WU <[email protected]>
fix go mod
Signed-off-by: Jie WU <[email protected]>
remove preconfigured buffered response
Signed-off-by: Jie WU <[email protected]>
Add streamed response
Signed-off-by: Jie WU <[email protected]>
Handle latency with response
Signed-off-by: Jie WU <[email protected]>
refactor
Signed-off-by: Jie WU <[email protected]>
fmt
Signed-off-by: Jie WU <[email protected]>
fmt
Signed-off-by: Jie WU <[email protected]>
fmt
Signed-off-by: Jie WU <[email protected]>
refactor server
Signed-off-by: Jie WU <[email protected]>
metrics auth
// RecordRequstLatencies records duration of request.
78
+
funcRecordRequestLatencies(modelName, targetModelNamestring, received time.Time, complete time.Time) bool {
79
+
if!complete.After(received) {
80
+
klog.Errorf("request latency value error for model name %v, target model name %v: complete time %v is before received time %v", modelName, targetModelName, complete, received)
0 commit comments