Provide Flexability in InferencePool Assignment #252

danehans · 2025-01-29T16:49:40Z

Currently, EPP will only reconcile an InferencePool that matches its configured poolName CLI flag. A more flexible approach should be considered since changing InferencePools requires a restart. One approach to consider is using labels, where EPP can use a predicate to filter reconciling all InferencePools in its namespace to only one that matches its configured poolLabel matcher. EPP can follow Gateway API guidelines for conflict resolution.

The text was updated successfully, but these errors were encountered:

danehans · 2025-02-03T15:07:02Z

@ahg-g @kfswain @robscott PTAL and provide your thoughts when you have a moment.

ahg-g · 2025-02-03T20:34:35Z

I am supportive, would you like to make a concrete proposal? We need to resolve a few things:

If we are going with a label, define a label key
How to ensure that this aligns with the configuration api that defines a service reference? Should the EPP look at that instead? or spec that the EPP service reference and the label must be aligned, and if not the behavior is unknown?
Since our current implementation of EPP assumes a single inferencePool, we need to also define a predictable behavior as to what exact pool the EPP will serve until we decide whether or not we want EPP to support multiple pools.

This was referenced Jan 29, 2025

InferencePool Ownership #117

Closed

Remove EndpointSlice dependency #256

Closed

ahg-g mentioned this issue Feb 2, 2025

Replace EndpointSlice reconciler with pod list backed by informer #271

Merged

kfswain added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide Flexability in InferencePool Assignment #252

Provide Flexability in InferencePool Assignment #252

danehans commented Jan 29, 2025

danehans commented Feb 3, 2025

ahg-g commented Feb 3, 2025 •

edited

Loading

Provide Flexability in InferencePool Assignment #252

Provide Flexability in InferencePool Assignment #252

Comments

danehans commented Jan 29, 2025

danehans commented Feb 3, 2025

ahg-g commented Feb 3, 2025 • edited Loading

ahg-g commented Feb 3, 2025 •

edited

Loading