Separate client vs provider APIEndpoints fields #1197

detiber · 2019-07-25T14:23:40Z

/kind feature

Describe the solution you'd like
Currently both external consumers of Cluster API and internal providers can use the existing APIEndpoints field in different ways. I'd like to propose that we separate these uses into at least two separate fields, so that each field has a single purpose and the semantics of how and when to consume the fields are well known.

External consumers are looking for an endpoint to use to pass to client tooling or a client SDK, which only accepts a single endpoint. In most cases implementations here just take the head of the API Endpoints list. This is currently the behavior used by clusterctl and https://github.com/vmware/cluster-api-upgrade-tool for example.

Internal provider use cases varies by provider. The AWS provider populates this value with the Load Balancer address that is created as part of Cluster instantiation while some other providers use this to track endpoints for each Control Plane Machine for the use of populating a Load Balancer or VIP configuration that does not exist prior to Machines being created.

The dual use of field this causes quite a few potential pain points.

Clients may not be connecting to the correct endpoint, or may not get updated information as the endpoint list is changed (saved kubeconfig for example). Client use can and should expect a stable IP/DNS name that will not change over the life of the Cluster.
Advertised and accepted addresses/names for an API server need to be known in advance of configuration so certificates can be generated properly.
In the case of multiple endpoints and rolling control plane upgrade scenarios, the cached set of API endpoints the client is using may not be valid part way through the upgrade, or the current endpoint may suddenly drop connection out from underneath the client.

The text was updated successfully, but these errors were encountered:

detiber · 2019-07-25T14:23:58Z

/cc @ncdc @moshloop @akutz

detiber · 2019-07-25T14:25:06Z

I'd also like to target this change for v1alpha2
/assign @vincepri

detiber · 2019-07-25T14:27:02Z

/cc @davidewatson @dlipovetsky

moshloop · 2019-07-25T14:51:54Z

I think splitting them into Internal and External is a good option.

Client use can and should expect a stable IP/DNS name that will not change over the life of the Cluster.

Yes, they SHOULD, not MUST - We still need to cater for scenarios where it is not possible

Advertised and accepted addresses/names for an API server need to be known in advance of configuration so certificates can be generated properly.

With a client-side load balancer it is always localhost

In the case of multiple endpoints and rolling control plane upgrade scenarios, the cached set of API endpoints the client is using may not be valid part way through the upgrade, or the current endpoint may suddenly drop connection out from underneath the client.

If the APIEndpoints respect "cordoned" masters, then this shouldn't be an issue - When using a single endpoint, if that endpoint fails, or is upgraded then clients are left unable to connect at all

akutz · 2019-07-25T14:57:03Z

Without HA and a LB you can’t support this. At best you can only provide a deterministic means if selecting a control plane node, which CAPV selects as the oldest member of the control plane.

It doesn’t matter what MUST occur, but rather what the cluster and its operators design. If they design a cluster with no external HA and multiple control plane nodes, then it’s possible those nodes don’t stay static, and one is ejected, and if that’s the oldest then the cluster is unreachable.

moshloop · 2019-07-25T15:02:59Z

Without HA and a LB you can’t support this. At best you can only provide a deterministic means if > selecting a control plane node, which CAPV selects as the oldest member of the control plane.

Which part are you referring to that can't be supported?

It doesn’t matter what MUST occur, but rather what the cluster and its operators design

How do you plan on designing this on-premise (without an ELB on VMC) Most CAPV users will be on-premise, where a client-side load balancer makes the most sense.

detiber · 2019-07-25T15:20:48Z

@moshloop you keep mentioning a client-side load balancer. How would that work with external clients and integrations (such as Jenkins, etc)?

moshloop · 2019-07-25T16:05:21Z

DNS round-robin would be suitable for most external clients. If APIEndpoints are the source of truth, then different out of band controllers (client-side, dns, F5, etc..) can be created that work across CAPI providers.

A client-side load balancer could also be used for bootstrapping, and then pivot to a real load balancer once the load balancer is healthy via removal of a iptables rule or /etc/hosts entry

vincepri · 2019-07-25T19:16:10Z

/area api
/priority important-soon
/milestone v1alpha2

vincepri · 2019-08-12T21:06:10Z

Have we reached some consensus around this? I'd prefer to move to a single endpoint and have additional ones in a different slice.

moshloop · 2019-08-13T07:35:55Z

How About:

// APIEndpoints refers to all valid/healthy API Endpoints at their InternalIP address
APIEndpoints  []string

// ClientEndpoint refers to a URL or IP that load balances across all APIEndpoints, 
// or a random healthy APIEndpoint if no load balancer is available
ClientEndpoint string

detiber · 2019-08-13T13:14:03Z

@moshloop I'm good with that proposed suggestion, but I would remove the limitation that APIEndpoints refer to InternalIP, since it might be implementation dependent on how they would be consumed by what would be providing the ClientEndpoint. (For example DNS-based config would require Public IPs)

vincepri · 2019-08-13T14:19:25Z

We should keep using a struct type for endpoints. I'd propose the following:

// AdditionalAPIEndpoints refers to all valid/healthy API Endpoints to communicate with the control plane.
AdditionalAPIEndpoints  []APIEndpoint `json:"additionalApiEndpoints,omitempty"`

// APIEndpoint refers to a URL or IP that load balances a control plane,
// or a random healthy endpoint if no load balancer is available.
APIEndpoint APIEndpoint `json:"apiEndpoint,omitempty"`

akutz · 2019-08-13T14:26:54Z

Hi @vincepri,

I truly don't see how the above proposal is any different than having a single slice like there is today. This appears to add an additional field for no real reason other than avoiding accessing the first element of the original slice.

detiber · 2019-08-13T14:29:56Z

@akutz it provides a single canonical source for generating things like a kubeconfig, rather than just randomly taking the first field from a slice that is used for various different reasons.

This gives us a way to not have to interpret provider intent when creating the client for setting NodeRefs or generating the kubeconfig secret for users.

vincepri · 2019-08-13T14:30:18Z

We've seen lots of folks using a single API endpoint, some others like @moshloop have use cases to use more than one, which is ok. This proposal would simplify the 80% use case and allow for others as well.

akutz · 2019-08-13T14:36:32Z

just randomly taking the first field from a slice that is used for various different reasons.

Except that is what the doc on the single field now says short of an LB:

a random healthy endpoint if no load balancer is available.

So to me this just separates things into two fields when all that's required is more explicit documentation on the existing field.

moshloop · 2019-08-13T16:28:41Z

a random healthy endpoint if no load balancer is available.

So this is intentionally random rather than just random - in theory if you have a controller looping trying to get to a ControlPlaneReady state, and you use the same endpoint (e.g. first) and that endpoint happens to be down, then it will never reconcile, even though it could potentially reconcile with a random endpoint.

Pick a random APIEndpoint
Set it as the ClientEndpoint
Attempt to connect to cluster
3a. If successful set to ControlPlaneReady == true
3b. If not, repeat at step 1.

I am going to document this in the Load Balancer Provider CAEP

dlipovetsky · 2019-08-13T16:49:06Z

I agree with the premises. Given that we are modifying the API used by all providers, I think good usability should also be a goal: both human (i.e. reading/editing the manifest), and programmatic.

Have we considered the approach taken by corev1.NodeAddress?

For example:

type APIEndpointType string

const (
	ExternalAPIEndpoint APIEndpointType = "ExternalEndpoint"
	InternalAPIEndpoint APIEndpointType = "InternalEndpoint"
)

type APIEndpoint struct {
	Type    APIEndpointType
	Host    string
        Port    int
}

With this approach, we can assign explicit meanings to API Endpoints (ExternalEndpoint, InternalEndpoint, etc). And we can implement helper functions that provide the right semantics (e.g. set semantics) that all providers can use.

dlipovetsky · 2019-08-14T06:34:38Z

Does anyone use an authenticating proxy?

If I did, I might want to list it under APIEndpoints.

(I'm not sure if it merits its own type, e.g. AuthProxyAPIEndpoint APIEndpointType = "AuthProxyEndpoint")

vincepri · 2019-11-20T01:30:04Z

@moshloop Are you ok closing this in favor or #1687 ?

moshloop · 2019-11-20T19:00:52Z

/close

Multiple endpoints become possible with Machine Load Balancers, but only the control plane endpoint which other masters/workers connect to is stored on the cluster object.

k8s-ci-robot · 2019-11-20T19:00:53Z

@moshloop: Closing this issue.

In response to this:

/close

Multiple endpoints become possible with Machine Load Balancers, but only the control plane endpoint which other masters/workers connect to is stored on the cluster object.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Jul 25, 2019

k8s-ci-robot assigned vincepri Jul 25, 2019

ncdc mentioned this issue Jul 25, 2019

Revert proposal change of ClusterStatus.APIEndpoints to APIEndpoint #1193

Merged

k8s-ci-robot added the area/api Issues or PRs related to the APIs label Jul 25, 2019

k8s-ci-robot added this to the v1alpha2 milestone Jul 25, 2019

k8s-ci-robot added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Jul 25, 2019

moshloop mentioned this issue Aug 13, 2019

Load Balancer Provider #1250

Closed

moshloop mentioned this issue Aug 13, 2019

Add a field on Cluster Status indicating the control plane is ready #1243

Closed

detiber modified the milestones: v0.2.0 (v1alpha2), Next Aug 28, 2019

timothysc unassigned vincepri Sep 26, 2019

timothysc modified the milestones: Next, v0.3.0 Sep 26, 2019

ncdc mentioned this issue Oct 31, 2019

RFE: Control plane support for various ways to provide a stable API endpoint #1687

Closed

k8s-ci-robot closed this as completed Nov 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate client vs provider APIEndpoints fields #1197

Separate client vs provider APIEndpoints fields #1197

detiber commented Jul 25, 2019

detiber commented Jul 25, 2019

detiber commented Jul 25, 2019

detiber commented Jul 25, 2019

moshloop commented Jul 25, 2019 •

edited

Loading

akutz commented Jul 25, 2019

moshloop commented Jul 25, 2019

detiber commented Jul 25, 2019

moshloop commented Jul 25, 2019

vincepri commented Jul 25, 2019

vincepri commented Aug 12, 2019

moshloop commented Aug 13, 2019

detiber commented Aug 13, 2019

vincepri commented Aug 13, 2019 •

edited

Loading

akutz commented Aug 13, 2019

detiber commented Aug 13, 2019

vincepri commented Aug 13, 2019

akutz commented Aug 13, 2019

moshloop commented Aug 13, 2019

dlipovetsky commented Aug 13, 2019

dlipovetsky commented Aug 14, 2019

vincepri commented Nov 20, 2019

moshloop commented Nov 20, 2019

k8s-ci-robot commented Nov 20, 2019

Separate client vs provider APIEndpoints fields #1197

Separate client vs provider APIEndpoints fields #1197

Comments

detiber commented Jul 25, 2019

detiber commented Jul 25, 2019

detiber commented Jul 25, 2019

detiber commented Jul 25, 2019

moshloop commented Jul 25, 2019 • edited Loading

akutz commented Jul 25, 2019

moshloop commented Jul 25, 2019

detiber commented Jul 25, 2019

moshloop commented Jul 25, 2019

vincepri commented Jul 25, 2019

vincepri commented Aug 12, 2019

moshloop commented Aug 13, 2019

detiber commented Aug 13, 2019

vincepri commented Aug 13, 2019 • edited Loading

akutz commented Aug 13, 2019

detiber commented Aug 13, 2019

vincepri commented Aug 13, 2019

akutz commented Aug 13, 2019

moshloop commented Aug 13, 2019

dlipovetsky commented Aug 13, 2019

dlipovetsky commented Aug 14, 2019

vincepri commented Nov 20, 2019

moshloop commented Nov 20, 2019

k8s-ci-robot commented Nov 20, 2019

moshloop commented Jul 25, 2019 •

edited

Loading

vincepri commented Aug 13, 2019 •

edited

Loading