kubernetes/cluster/addons
Kubernetes Submit Queue 4f91113075
Merge pull request #54826 from mindprince/addon-manager
Automatic merge from submit-queue (batch tested with PRs 54826, 53576, 55591, 54946, 54825). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Run nvidia-gpu device-plugin daemonset as an addon on GCE nodes that have nvidia GPUs attached

- Instead of the old `Accelerators` feature that added `alpha.kubernetes.io/nvidia-gpu` resource, use the new `DevicePlugins` feature that adds vendor specific resources. (In case of nvidia GPUs it will
add `nvidia.com/gpu` resource.)

- Add node label to GCE nodes with accelerators attached. This node label is the same as what GKE attaches to node pools with accelerators attached. (For example, for nvidia-tesla-p100 GPU, the label would be `cloud.google.com/gke-accelerator=nvidia-tesla-p100`) This will help us target accelerator specific
daemonsets etc. to these nodes.

- Run nvidia-gpu device-plugin daemonset as an addon on GCE nodes that have nvidia GPUs attached.

- Some minor documentation improvements in addon manager.

**Release note**:
```release-note
GCE nodes with NVIDIA GPUs attached now expose `nvidia.com/gpu` as a resource instead of `alpha.kubernetes.io/nvidia-gpu`.
```

/sig cluster-lifecycle
/sig scheduling
/area hw-accelerators

https://github.com/kubernetes/features/issues/368
2017-11-13 14:46:55 -08:00
..
addon-manager Remove redundant comment and improve documentation. 2017-11-02 12:58:29 -07:00
calico-policy-controller Disable the grace termination period for the calico/node pod 2017-11-02 17:31:35 +00:00
cluster-loadbalancing Update OWNERS files for networking components 2017-08-03 11:08:54 -07:00
cluster-monitoring Bump version of prometheus-to-sd to 0.2.2. 2017-10-26 15:54:54 +02:00
dashboard update dashboard image version 2017-07-31 11:08:08 +08:00
device-plugins/nvidia-gpu Run nvidia-gpu device-plugin daemonset as an addon on GCE nodes that have nvidia GPUs attached. 2017-11-02 12:58:29 -07:00
dns Update kube-dns 1.14.7 2017-10-23 14:37:13 -07:00
dns-horizontal-autoscaler Merge kube-dns-autoscaler templates into a single file 2017-10-03 09:43:57 -07:00
etcd-empty-dir-cleanup make all static system pods critical 2017-06-12 15:22:04 -07:00
fluentd-elasticsearch Merge pull request #55506 from Random-Liu/fix-cri-fluentd 2017-11-11 10:45:27 -08:00
fluentd-gcp Fix CRI fluentd config. 2017-11-10 20:55:56 +00:00
ip-masq-agent Update OWNERS files for networking components 2017-08-03 11:08:54 -07:00
kube-proxy Merge pull request #52003 from vfreex/mount-lib-modules 2017-10-25 11:38:36 -07:00
metadata-proxy Add prometheus-to-sd-exporter to metadata-proxy addon; bump to proxy to v0.1.4 and e2e to v0.0.2; remove configmag 2017-11-03 10:23:05 -07:00
metrics-server Adjust resources for Metrics Server 2017-10-31 10:42:00 +01:00
node-problem-detector update related files 2017-08-24 17:49:18 +08:00
python-image Always --pull in docker build to ensure recent base images 2017-01-10 16:21:05 -08:00
rbac don't add kubelet legacy binding if we aren't registering the master kubelet 2017-10-26 13:30:59 -07:00
registry fix invalid url link 2017-08-24 18:50:14 +08:00
storage-class [addon/storage-class] update storageclass groupversion in storage-class 2017-10-22 19:50:47 +08:00
BUILD Run hack/update-bazel.sh to generate BUILD files 2017-08-02 18:33:25 -07:00
README.md Updated cluster/addons readme to match and point to docs 2017-10-18 10:36:24 -04:00

Legacy Cluster add-ons

For more information on add-ons see the documentation.

Overview

Cluster add-ons are resources like Services and Deployments (with pods) that are shipped with the Kubernetes binaries and are considered an inherent part of the Kubernetes clusters.

There are currently two classes of add-ons:

  • Add-ons that will be reconciled.
  • Add-ons that will be created if they don't exist.

More details could be found in addon-manager/README.md.

Cooperating Horizontal / Vertical Auto-Scaling with "reconcile class addons"

"Reconcile" class addons will be periodically reconciled to the original state given by the initial config. In order to make Horizontal / Vertical Auto-scaling functional, the related fields in config should be left unset. More specifically, leave replicas in ReplicationController / Deployment / ReplicaSet unset for Horizontal Scaling, leave resources for container unset for Vertical Scaling. The periodic reconcile won't clobbered these fields, hence they could be managed by Horizontal / Vertical Auto-scaler.

Add-on naming

The suggested naming for most of the resources is <basename> (with no version number). Though resources like Pod, ReplicationController and DaemonSet are exceptional. It would be hard to update Pod because many fields in Pod are immutable. For ReplicationController and DaemonSet, in-place update may not trigger the underlying pods to be re-created. You probably need to change their names during update to trigger a complete deletion and creation.

Analytics