58 lines
2.9 KiB
Markdown
58 lines
2.9 KiB
Markdown
# Cluster Management
|
|
|
|
This doc is in progress.
|
|
|
|
## Upgrading a cluster
|
|
|
|
The `cluster/kube-push.sh` script will do a rudimentary update; it is a 1.0 roadmap item to have a robust live cluster update system.
|
|
|
|
## Updgrading to a different API version
|
|
|
|
There is a sequence of steps to upgrade to a new API version.
|
|
|
|
1. Turn on the new version.
|
|
2. Upgrade the cluster's storage to use the new version.
|
|
3. Upgrade all config files. Identify users of the old api version endpoints.
|
|
3. Turn off the old version.
|
|
|
|
### Turn on or off an API version for your cluster
|
|
|
|
TODO: There's an apiserver flag for this.
|
|
|
|
### Switching your cluster's storage API version
|
|
|
|
TODO: This functionality hasn't been written yet.
|
|
|
|
### Switching your config files to a new API version
|
|
|
|
You can use the kube-version-change utility to convert config files between different API versions.
|
|
|
|
```
|
|
$ hack/build-go.sh cmd/kube-version-change
|
|
$ _output/local/go/bin/kube-version-change -i myPod.v1beta1.yaml -o myPod.v1beta3.yaml
|
|
```
|
|
|
|
### Maintenance on a Node
|
|
|
|
If you need to reboot a node (such as for a kernel upgrade, libc upgrade, hardware repair, etc.), and the downtime is
|
|
brief, then when the Kubelet restarts, it will attempt to restart the pods scheduled to it. If the reboot takes longer,
|
|
then the node controller will terminate the pods that are bound to the unavailable node. If there is a corresponding
|
|
replication controller, then a new copy of the pod will be started on a different node. So, in the case where all
|
|
pods are replicated, upgrades can be done without special coordination.
|
|
|
|
If you want more control over the upgrading process, you may use the following workflow:
|
|
1. Mark the node to be rebooted as unschedulable:
|
|
`kubectl update nodes $NODENAME --patch='{"apiVersion": "v1beta1", "unschedulable": true}'`.
|
|
This keeps new pods from landing on the node while you are trying to get them off.
|
|
1. Get the pods off the machine, via any of the following strategies:
|
|
1. wait for finite-duration pods to complete
|
|
1. delete pods with `kubectl delete pods $PODNAME`
|
|
1. for pods with a replication controller, the pod will eventually be replaced by a new pod which will be scheduled to a new node. additionally, if the pod is part of a service, then clients will automatically be redirected to the new pod.
|
|
1. for pods with no replication controller, you need to bring up a new copy of the pod, and assuming it is not part of a service, redirect clients to it.
|
|
1. Work on the node
|
|
1. Make the node schedulable again:
|
|
`kubectl update nodes $NODENAME --patch='{"apiVersion": "v1beta1", "unschedulable": false}'`.
|
|
If you deleted the node's VM instance and created a new one, then a new schedulable node resource will
|
|
be created automatically when you create a new VM instance (if you're using a cloud provider that supports
|
|
node discovery; currently this is only GCE, not including CoreOS on GCE using kube-register). See [Node](node.md).
|