Commit Graph

3376 Commits

Author SHA1 Message Date
Rafael Fernández López
14fe7225c1 kubeadm: Improve resiliency in CreateOrMutateConfigMap
CreateOrMutateConfigMap was not resilient when it was trying to Create
the ConfigMap. If this operation returned an unknown error the whole
operation would fail, because it was strict in what error it was
expecting right afterwards: if the error returned by the Create call
was a IsAlreadyExists error, it would work fine. However, if an
unexpected error (such as an EOF) happened, this call would fail.

We are seeing this error specially when running control plane node
joins in an automated fashion, where things happen at a relatively
high speed pace.

It was specially easy to reproduce with kind, with several control
plane instances. E.g.:

```
[upload-config] Storing the configuration used in ConfigMap "kubeadm-config" in the "kube-system" Namespace
I1130 11:43:42.788952     887 round_trippers.go:443] POST https://172.17.0.2:6443/api/v1/namespaces/kube-system/configmaps?timeout=10s  in 1013 milliseconds
Post https://172.17.0.2:6443/api/v1/namespaces/kube-system/configmaps?timeout=10s: unexpected EOF
unable to create ConfigMap
k8s.io/kubernetes/cmd/kubeadm/app/util/apiclient.CreateOrMutateConfigMap
	/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/cmd/kubeadm/app/util/apiclient/idempotency.go:65
```

This change makes this logic more resilient to unknown errors. It will
retry on the light of unknown errors until some of the expected error
happens: either `IsAlreadyExists`, in which case we will mutate the
ConfigMap, or no error, in which case the ConfigMap has been created.
2019-11-30 22:48:16 +01:00
Antonio Ojea
335a3e9efb kubeadm: use correct IP family for etcd localhost
kubeadm always use the IPv4 localhost address by defaultA for etcd

The probe hostname is obtained before the generation of the etcd
parameters, so it can't detect the right IP familiy for the
host of the probe.
This causes that with IPv6 clusters doesn't work because the probe
uses the IPv4 localhost address.

This patchs configures the right localhost address based on the used
AdvertiseAddress IP family.
2019-11-30 17:25:30 +01:00
Antonio Ojea
798d2fb75a Fix comment typo 2019-11-29 13:38:47 +01:00
Lucas Käldström
b46e541eee Add a YAML MetaFactory
- Use `schema.TypeMeta` instead of custom `struct` for VK
- More strict check on GVK after `Interpret` in `SplitYAMLDocuments`
- Adjust `Interpret` comment to include JSON
2019-11-27 17:44:59 +08:00
Lubomir I. Ivanov
ce68a89d26 kubeadm: simplify discover/token and add detailed unit tests
- Add retrieveValidatedConfigInfo to be able to better unit
test the function.
- Break some of the logic in RetrieveValidatedConfigInfo into
helper functions.
- Pass JoinConfiguration.Discovery to RetrieveValidatedConfigInfo
instead of JoinConfiguration.
- Use the discovery timeout per API call to fetch cluster-info
(optionally the user value can be slit in 2).
- Add detailed unit tests for retrieveValidatedConfigInfo.
2019-11-27 01:16:43 +02:00
Kubernetes Prow Robot
2bc3804a0e Merge pull request #81319 from neolit123/1.16-kubeadm-upgrade-health-check
kubeadm: add a upgrade health check that deploys a Job
2019-11-26 10:52:51 -08:00
Kubernetes Prow Robot
e51caab732 Merge pull request #85603 from bart0sh/PR008-kubeadm-dont-check-if-image-exists
kubeadm: don't check if image exists before pulling
2019-11-26 05:27:10 -08:00
Rostislav M. Georgiev
b881f19c8b kubeadm: Group centric component configs
kubeadm's current implementation of component config support is "kind" centric.
This has its downsides. Namely:
- Kind names and numbers can change between config versions.
  Newer kinds can be ignored. Therefore, detection of a version change is
  considerably harder.
- A component config can have only one kind that is managed by kubeadm.
Thus a more appropriate way to identify component configs is required.

Probably the best solution identified so far is a config group.
A group name is unlikely to change between versions, while the kind names and
structure can.
Tracking component configs by group name allows us to:
- Spot more easily config version changes and manage alternate versions.
- Support more than one kind in a config group/version.
- Abstract component configs by hiding their exact structure.

Hence, this change rips off the old kind based support for component configs
and replaces it with a group name based one. This also has the following
extra benefits:
- More tests were added.
- kubeadm now errors out if an unsupported version of a known component group
  is used.

Signed-off-by: Rostislav M. Georgiev <rostislavg@vmware.com>
2019-11-26 13:55:28 +02:00
Arvinderpal Wander
e8ee8622b8 Add support for new dual-stack flags for kubernetes-controller-
manager in kubeadm:
 - node-cidr-mask-size-ipv4
 - node-cidr-mask-size-ipv6
2019-11-25 09:11:16 -08:00
Ed Bartosh
7898b3f0a8 kubeadm: update image pull tests 2019-11-25 15:04:28 +02:00
Ed Bartosh
f6f3738a00 kubeadm: don't check if image exists before pulling
Removed image existence as kubeadm may miss
image tags if they're updated.
2019-11-25 13:00:50 +02:00
Kubernetes Prow Robot
f01a4d0608 Merge pull request #85265 from SataQiu/fix-kubeadm-20191114
kubeadm: reset raises warnings if it cannot delete folders
2019-11-22 17:30:17 -08:00
Lubomir I. Ivanov
906d3159b1 kubeadm: add a upgrade health check that deploys a Job
- Add a new preflight check for upgrade that runs the pause container
with -v in a Job.
- Wait for the Job to complete and return an error after N seconds.
- Manually clean the Job because we don't have the TTL controller
enabled in kubeadm yet (it's still alpha).
2019-11-22 18:04:40 +02:00
Arvinderpal Wander
c025cfcd9c Initialize FeatureGate map for KubeProxy config. #1929 2019-11-21 20:46:25 -08:00
Ted Yu
5be960c34d Fill in default node cidr mask size when dual stack is not enabled 2019-11-20 19:50:33 -08:00
Kubernetes Prow Robot
01e1dcf429 Merge pull request #85351 from rosti/kubeadm-fix-kubelet-version-check
kubeadm: Use only stdout when calling kubelet for its version
2019-11-15 19:09:41 -08:00
Kubernetes Prow Robot
b8c29966cb Merge pull request #85348 from ereslibre/validator-names
kubeadm: use the right name for the no validator function
2019-11-15 16:17:42 -08:00
Rostislav M. Georgiev
d425520806 kubeadm: Use only stdout when calling kubelet for its version
Currently this uses the combined kubelet output (stdout + stderr), but this
causes parsing issues if the kubelet logs something on stderr.
Thus we ignore the entire stderr and use stdout only.

We do disable a couple of tests here. That is because the fakeexecer only
supports combined output and return a "not supported" error if `.Output()`
gets invoked thus permanently failing those.

Signed-off-by: Rostislav M. Georgiev <rostislavg@vmware.com>
2019-11-15 18:50:40 +02:00
Rafael Fernández López
d049eb71fa kubeadm: use the right name for the no validator function 2019-11-15 16:03:40 +01:00
SataQiu
b7b10fc899 kubeadm: reset raises warnings if it cannot delete folders 2019-11-15 22:54:05 +08:00
Kubernetes Prow Robot
d1e8702d36 Merge pull request #85201 from fabriziopandini/add-retry-to-etcd
kubeadm: add retry to etcd calls
2019-11-14 07:07:34 -08:00
fabriziopandini
0573a2227f add retry to etcd operations 2019-11-14 09:27:03 +01:00
Kubernetes Prow Robot
c6a38a9700 Merge pull request #85229 from liggitt/kubeadm-imports
guard kubeadm dependencies on k8s.io/kubernetes
2019-11-13 17:29:05 -08:00
Kubernetes Prow Robot
f5df681b80 Merge pull request #85156 from chuckha/remove-infinte-poll
[kubeadm] Remove nested retries
2019-11-13 13:04:23 -08:00
Kubernetes Prow Robot
c33af5b2fe Merge pull request #85108 from rajansandeep/prepcorednsfor1.17-kubeadm
Bump CoreDNS version to 1.6.5 and update manifest
2019-11-13 13:03:22 -08:00
Jordan Liggitt
752cda4fc4 guard kubeadm dependencies on k8s.io/kubernetes 2019-11-13 15:05:11 -05:00
Kubernetes Prow Robot
66e0ca800d Merge pull request #85137 from rosti/kubeadm-amend-hyperkube-deprecation
kubeadm: Amend the hyperkube deprecation change
2019-11-13 09:28:09 -08:00
Kubernetes Prow Robot
4f6aa5a921 Merge pull request #85091 from fabriziopandini/alpha-certs-reads-incluster-config
kubeadm: alpha certs command should read in-cluster config
2019-11-13 09:27:42 -08:00
Kubernetes Prow Robot
9648d56765 Merge pull request #84118 from neolit123/1.17-kubeadm-add-kubelet-post-phase
kubeadm: enable kubelet client certificate rotation on primary CP nodes
2019-11-13 09:25:13 -08:00
Chuck Ha
8fcf892498 Remove an infinite poll
This infinite poll can make kubeadm init/join hang forever due
to a lack of function cancelling.

Signed-off-by: Chuck Ha <chuckh@vmware.com>
2019-11-13 10:10:41 -05:00
Lubomir I. Ivanov
431fd630ff kubeadm: enable kubelet client certificate rotation on primary CP nodes
- Don't always print to stdout that the kubelet is starting.
instead delegate this to the callers of TryStartKubelet.
- Add a new root kubeadm init phase called "kubelet-finalize"
- Add a sub-phase to "kubelet-finalize"
called "experimental-cert-rotation"
- "cert-rotation" performs the following actions:
  - tries to guess if kubelet client cert rotation is enabled
  - update the kubelet.conf to use the rotatable cert/key
2019-11-13 15:49:33 +02:00
Kubernetes Prow Robot
eedfb6bc7a Merge pull request #79223 from rosti/kubeadm-component-configs-versioned
kubeadm: component configs versioned
2019-11-12 15:38:11 -08:00
Kubernetes Prow Robot
ea2750eaa6 Merge pull request #84718 from yastij/remove-validators
remove system validators package from kubeadm and use k8s.io/system-validators
2019-11-12 11:50:53 -08:00
Sandeep Rajan
f931dad722 bump coredns version and update manifest 2019-11-12 13:05:09 -05:00
Rostislav M. Georgiev
ec4840f0b2 kubeadm: Amend the hyperkube deprecation change
The PR introducing 5bb8069 got merged accidentally (the CI robot not
respecting a hold). Hence, the feedback to that PR is merged separately.

Signed-off-by: Rostislav M. Georgiev <rostislavg@vmware.com>
2019-11-12 16:23:05 +02:00
fabriziopandini
a372dff648 alpha certs tries to read in-cluster config 2019-11-12 10:35:31 +01:00
Yassine TIJANI
b9fe59c93e remove system validators package from kubeadm and use k8s.io/system-validators instead
Signed-off-by: Yassine TIJANI <ytijani@vmware.com>
2019-11-12 01:09:56 +01:00
Kubernetes Prow Robot
e008523f76 Merge pull request #85092 from fabriziopandini/alpha-certs-skips-missing-certs
kubeadm: alpha certs should skip missing files
2019-11-11 14:46:47 -08:00
Kubernetes Prow Robot
fc0bf06983 Merge pull request #85044 from neolit123/1.17-deprecate-cri-socket-upgrade
kubeadm: remove the deprecated "--cri-socket" flag for "upgrade apply"
2019-11-11 12:56:37 -08:00
Rostislav M. Georgiev
5bb80694ec kubeadm: Deprecate hyperkube use
As the hyperkube image is itself deprecated and moved out of tree, its use with
kubeadm gets deprecated too. Hence, deprecation messages will be printed when
it is used.

Signed-off-by: Rostislav M. Georgiev <rostislavg@vmware.com>
2019-11-11 17:03:25 +02:00
fabriziopandini
7d986a982c alpha certs skip missing files 2019-11-11 15:48:49 +01:00
Kubernetes Prow Robot
939e1e6488 Merge pull request #85043 from neolit123/1.17-enable-secure-ports
kubeadm: use the secure ports for kube-scheduler and kcm health checks
2019-11-10 13:27:01 -08:00
Lubomir I. Ivanov
ebfdb25c05 kubeadm: always use a short timeout for clientset creation
ToClientSet() in kubeconfig.go creates a clientset from
the passed Config object (kubeconfig). For IP addresses
that are not reachable e.g. Get() calls for ConfigMaps
can block for a few minutes with the default timeout.

Modify the timeout to a shorter value by passing an override.
2019-11-10 05:39:38 +02:00
Kubernetes Prow Robot
01676976c9 Merge pull request #85017 from LouisJackman/remove-goroutine-leak-in-kubeadm-wait
Remove potential Goroutine leak in kubeadm wait.go
2019-11-09 14:03:40 -08:00
Lubomir I. Ivanov
efc0ec5ffe kubeadm: remove the deprecated "--cri-socket" flag for "upgrade apply"
Remove the flag "--cri-socket" from the "kubeadm upgrade apply"
command. It has been marked as deprecated since 1.14.
2019-11-09 23:34:56 +02:00
Lubomir I. Ivanov
23ba857b50 kubeadm: use the secure ports for kube-scheduler and kcm health checks
The insecure ports were deprecated in 1.12 and 1.13.
2019-11-09 23:21:39 +02:00
Kubernetes Prow Robot
9920b733ba Merge pull request #85024 from neolit123/1.17-fix-etcd-node-upgrade
kubeadm: fix skipped etcd upgrade on secondary CP nodes
2019-11-09 07:13:39 -08:00
Kubernetes Prow Robot
49a9b6cadf Merge pull request #84727 from danwinship/ipv6-bind
fix apiserver to advertise IPv6 endpoints if bound to IPv6
2019-11-08 18:44:24 -08:00
Lubomir I. Ivanov
cc0320d41a kubeadm: fix skipped etcd upgrade on secondary cp nodes 2019-11-09 03:59:59 +02:00
Kubernetes Prow Robot
3c7706bf9c Merge pull request #84591 from TheLastProject/feature/print-join-as-admin-command
Let token create help with joining a master
2019-11-08 13:48:43 -08:00