kubernetes

Author	SHA1	Message	Date
vikaschoudhary16	cedbd93255	Make 'pod' package to use unified checkpointManager Signed-off-by: vikaschoudhary16 <choudharyvikas16@gmail.com>	2018-04-16 01:30:20 -04:00
Antonin Stefanutti	a82c84e087	Init Kubelet runtime cache before dependent stats provider	2018-04-13 16:30:23 +02:00
Yu-Ju Hong	9a76f73978	Move hairpin mode logic to dockershim Also moves the CNI binary directory parsing logic into dockerhsim.	2018-04-11 09:21:17 -07:00
Yu-Ju Hong	37d30a0815	Remove outdated network plugin code The code was added to support rktnetes and non-CRI docker integrations. These legacy integrations have already been removed from the codebase. This change removes the compatibility code existing soley for the legacy integrations.	2018-04-11 09:21:17 -07:00
Rohit Agarwal	87dda3375b	Delete in-tree support for NVIDIA GPUs. This removes the alpha Accelerators feature gate which was deprecated in 1.10. The alternative feature DevicePlugins went beta in 1.10.	2018-04-02 20:17:01 -07:00
Mike Danese	7354bbe5ac	certs: only append locally discovered addresses when we got none from the cloudprovider The cloudprovider is right, and only cloudprovider addresses can be verified centrally, so don't add any extra.	2018-03-30 09:22:12 -07:00
Kubernetes Submit Queue	7a946e6fb0	Merge pull request #61870 from mikedanese/serverauth2 Automatic merge from submit-queue (batch tested with PRs 57658, 61304, 61560, 61859, 61870). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. certs: exclude more nonsensical addresses from SANs I noticed this when I saw 169.254.* SANs using server TLS bootstrap. This change excludes more nonsensical addresses from being requested as SANs in that flow.	2018-03-29 15:03:16 -07:00
Mike Danese	473d34eff6	certs: exclude more nonsensical addresses from SANs I noticed this when I saw 169.254.* SANs using server TLS bootstrap. This change excludes more nonsensical addresses from being requested as SANs in that flow.	2018-03-28 19:03:18 -07:00
Filipe Brandenburger	8df9274e02	Remove rktnetes code rktnetes is scheduled to be deprecated in 1.10 (#53601). According to the deprecation policy for beta CLI and flags, we can remove the feature in 1.11. Fixes #58721	2018-03-27 09:29:35 -07:00
Kubernetes Submit Queue	971c97af35	Merge pull request #61078 from hzxuzhonghu/kubelet-clean Automatic merge from submit-queue (batch tested with PRs 61487, 58353, 61078, 61219, 60792). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. remove dead code in kubelet clean up dead code /kind cleanup /sig node Release note: ```release-note NONE ```	2018-03-21 14:15:13 -07:00
Kubernetes Submit Queue	7bd2263566	Merge pull request #58714 from dcbw/cni-plugin-dirs Automatic merge from submit-queue (batch tested with PRs 59740, 59728, 60080, 60086, 58714). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. kubelet: make --cni-bin-dir accept a comma-separated list of CNI plugin directories Allow CNI-related network plugin drivers (kubenet, cni) to search a list of directories for plugin binaries instead of just one. This allows using an administrator-provided path and fallbacks to others (like the previous default of /opt/cni/bin) for backwards compatibility. ```release-note kubelet's --cni-bin-dir option now accepts multiple comma-separated CNI binary directory paths, which are search for CNI plugins in the given order. ``` @kubernetes/rh-networking @kubernetes/sig-network-misc @freehan @pecameron @rajatchopra	2018-03-19 21:34:39 -07:00
Kubernetes Submit Queue	b2ace84fc3	Merge pull request #51423 from jiaxuanzhou/imageGC Automatic merge from submit-queue (batch tested with PRs 51423, 53880). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Disable ImageGC when high threshold is set to 100 What this PR does / why we need it: Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes #51268 Special notes for your reviewer: Release note: ```release-note NONE ```	2018-03-19 19:35:22 -07:00
hzxuzhonghu	80872881ed	remove dead code in kubelet	2018-03-13 11:57:02 +08:00
Jing Xu	b2e744c620	Promote LocalStorageCapacityIsolation feature to beta The LocalStorageCapacityIsolation feature added a new resource type ResourceEphemeralStorage "ephemeral-storage" so that this resource can be allocated, limited, and consumed as the same way as CPU/memory. All the features related to resource management (resource request/limit, quota, limitrange) are avaiable for local ephemeral storage. This local ephemeral storage represents the storage for root file system, which will be consumed by containers' writtable layer and logs. Some volumes such as emptyDir might also consume this storage.	2018-03-02 15:10:08 -08:00
Dan Williams	8778e50083	kubelet: make --cni-bin-dir accept a comma-separated list of CNI plugin directories Allow CNI-related network plugin drivers (kubenet, cni) to search a list of directories for plugin binaries instead of just one. This allows using an administrator-provided path and fallbacks to others (like the previous default of /opt/cni/bin) for backwards compatibility.	2018-03-01 10:51:18 -06:00
Kubernetes Submit Queue	729f691d74	Merge pull request #60246 from mtaufen/backoff-pleg Automatic merge from submit-queue (batch tested with PRs 60157, 60337, 60246, 59714, 60467). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. backoff runtime errors in kubelet sync loop The runtime health check can race with PLEG's first relist, and this often results in an unnecessary 5 second wait during Kubelet bootstrap. This change aims to improve the performance. ```release-note NONE ```	2018-02-27 12:05:37 -08:00
Michael Taufen	b4bddcc998	expunge the word 'manifest' from Kubelet's config API The word 'manifest' technically refers to a container-group specification that predated the Pod abstraction. We should avoid using this legacy terminology where possible. Fortunately, the Kubelet's config API will be beta in 1.10 for the first time, so we still had the chance to make this change. I left the flags alone, since they're deprecated anyway. I changed a few var names in files I touched too, but this PR is the just the first shot, not the whole campaign (`git grep -i manifest \| wc -l -> 1248`).	2018-02-23 11:44:06 -08:00
Lantao Liu	d7b21a3358	Use container log manager in kubelet	2018-02-23 01:42:35 +00:00
Michael Taufen	7290313dfd	backoff runtime errors in kubelet sync loop The runtime health check can race with PLEG's first relist, and this often results in an unnecessary 5 second wait during Kubelet bootstrap. This change aims to improve the performance.	2018-02-22 11:54:31 -08:00
Kubernetes Submit Queue	742c9b158d	Merge pull request #59906 from abhi/log_stats Automatic merge from submit-queue (batch tested with PRs 54191, 59374, 59824, 55032, 59906). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Adding per container stats for CRI runtimes What this PR does / why we need it This commit aims to collect per container log stats. The change was proposed as a part of #55905. The change includes change the log path from /var/pod/<pod uid>/containername_attempt.log to /var/pod/<pod uid>/containername/containername_attempt.log. The logs are collected by reusing volume package to collect metrics from the log path. Fixes #55905 Special notes for your reviewer: cc @Random-Liu Release note: ``` Adding container log stats for CRI runtimes. ```	2018-02-21 19:40:42 -08:00
abhi	6649d38c96	Adding per container stats for CRI runtimes This commit aims to collect per container log stats. The change was proposed as a part of #55905. The change includes change of the log path from /var/pod/<pod uid>/containername_attempt.log to /var/pod/<pod uid>/containername/containername_attempt.log. The logs are collected by reusing volume package to collect metrics from the log path. Signed-off-by: abhi <abhi@docker.com>	2018-02-20 19:50:47 -08:00
jiaxuanzhou	039b695e29	Disable image GC when high threshold is set to 100	2018-02-20 14:07:19 +08:00
David Ashpole	960856f4e8	collect metrics on the /kubepods cgroup on-demand	2018-02-17 12:32:40 -08:00
Kubernetes Submit Queue	244549f02a	Merge pull request #59769 from dashpole/capacity_ephemeral_storage Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Collect ephemeral storage capacity on initialization What this PR does / why we need it: We have had some node e2e flakes where a pod can be rejected if it requests ephemeral storage. This is because we don't set capacity and allocatable for ephemeral storage on initialization. This PR causes cAdvisor to do one round of stats collection during initialization, which will allow it to get the disk capacity when it first sets the node status. It also sets the node to NotReady if capacities have not been initialized yet. Special notes for your reviewer: Release note: ```release-note NONE ``` /assign @jingxu97 @Random-Liu /sig node /kind bug /priority important-soon	2018-02-16 11:17:02 -08:00
Kubernetes Submit Queue	bfdd94c6a0	Merge pull request #59170 from cofyc/fix_kubelet_volume_metrics Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix kubelet PVC stale metrics What this PR does / why we need it: Volumes on each node changes, we should not only add PVC metrics into gauge vector. It's better use a collector to collector metrics from internal stats. Currently, if a PV (bound to a PVC `testpv`) is attached and used by node A, then migrated to node B or just deleted from node A later. `testpvc` metrics will not disappear from kubelet on node A. After a long running time, `kubelet` process will keep a lot of stale volume metrics in memory. For these dynamic metrics, it's better to use a collector to collect metrics from a data source (`StatsProvider` here), like [kube-state-metrics](https://github.com/kubernetes/kube-state-metrics) scraping metrics from kube-apiserver. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes https://github.com/kubernetes/kubernetes/issues/57686 Special notes for your reviewer: Release note: ```release-note Fix kubelet PVC stale metrics ```	2018-02-15 18:44:08 -08:00
David Ashpole	b259543985	collect ephemeral storage capacity on initialization	2018-02-15 17:33:22 -08:00
Yecheng Fu	fecff55c59	Fix kubelet PVC metrics using a volume stats collector. Volumes on each node changes, we should not only add PVC metrics into gauge vector. It's better use a collector to collector metrics from stats.	2018-02-11 23:48:06 +08:00
Kubernetes Submit Queue	475457537b	Merge pull request #59276 from roboll/roboll/kubelet-fix Automatic merge from submit-queue (batch tested with PRs 59276, 51042, 58973, 59377, 59472). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. kubelet: only register api source when connecting What this PR does / why we need it: before this change, an api source was always registered, even when there was no kubeclient. this lead to some operations blocking waiting for podConfig.SeenAllSources to pass, which it never would. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #59275 Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-07 12:00:40 -08:00
Walter Fender	e18e8ec3c0	Add context to all relevant cloud APIs This adds context to all the relevant cloud provider interface signatures. Callers of those APIs are currently satisfied using context.TODO(). There will be follow on PRs to push the context through the stack. For an idea of the full scope of this change please look at PR #58532.	2018-02-06 12:49:17 -08:00
rob boll	7da7b750fd	kubelet: only register api source when connecting before this change, an api source was always registered, even when there was no kubeclient. this lead to some operations blocking waiting for podConfig.SeenAllSources to pass, which it never would.	2018-02-01 15:28:02 -05:00
Kubernetes Submit Queue	06472a054a	Merge pull request #58930 from smarterclayton/background_rotate Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Only rotate certificates in the background Change the Kubelet to not block until the first certs have rotated (we didn't act on it anyway) and fall back to the bootstrap cert if the most recent rotated cert is expired on startup. The certificate manager originally had a "block on startup" rotation behavior to ensure at least one rotation happened on startup. However, since rotation may not succeed within the first time window the code was changed to simply print the error rather than return it. This meant that the blocking rotation has no purpose - it cannot cause the kubelet to fail, and it does block the kubelet from starting static pods before the api server becomes available. The current block behavior causes a bootstrapped kubelet that is also set to run static pods to wait several minutes before actually launching the static pods, which means self-hosted masters using static pods have a pointless delay on startup. Since blocking rotation has no benefit and can't actually fail startup, this commit removes the blocking behavior and simplifies the code at the same time. The goroutine for rotation now completely owns the deadline, the shouldRotate() method is removed, and the method that sets rotationDeadline now returns it. We also explicitly guard against a negative sleep interval and omit the message. Should have no impact on bootstrapping except the removal of a long delay on startup before static pods start. The other change is that an expired certificate from the cert manager is not considered a valid cert, which triggers an immediate rotation. This causes the cert manager to fall back to the original bootstrap certificate until a new certificate is issued. This allows the bootstrap certificate on masters to be "higher powered" and allow the node to function prior to initial approval, which means someone configuring the masters with a pre-generated client cert can be guaranteed that the kubelet will be able to communicate to report self-hosted static pod status, even if the first client rotation hasn't happened. This makes master self-hosting more predictable for static configuration environments. ```release-note When using client or server certificate rotation, the Kubelet will no longer wait until the initial rotation succeeds or fails before starting static pods. This makes running self-hosted masters with rotation more predictable. ```	2018-02-01 12:05:15 -08:00
Kubernetes Submit Queue	a18f086220	Merge pull request #59020 from brendandburns/kubelet-hang Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove setInitError. What this PR does / why we need it: Removes setInitError, it's not sure it was ever really used, and it causes the kubelet to hang and get wedged. Which issue(s) this PR fixes Fixes #46086 Special notes for your reviewer: If `initializeModules()` in `kubelet.go` encounters an error, it calls `runtimeState.setInitError(...)` `47d61ef472/pkg/kubelet/kubelet.go (L1339)` The trouble with this is that `initError` is never cleared, which means that `runtimeState.runtimeErrors()` always returns this `initError`, and thus pods never start sync-ing. In normal operation, this is expected and desired because eventually the runtime is expected to become healthy, but in this case, `initError` is never updated, and so the system just gets wedged. `47d61ef472/pkg/kubelet/kubelet.go (L1751)` We could add some retry to `initializeModules()` but that seems unnecessary, as eventually we'd want to just die anyway. Instead, just log fatal and die, a supervisor will restart us. Note, I'm happy to add some retry here too, if that makes reviewers happier. Release note: ```release-note Prevent kubelet from getting wedged if initialization of modules returns an error. ``` @feiskyer @dchen1107 @janetkuo @kubernetes/sig-node-bugs	2018-01-30 14:56:28 -08:00
Lantao Liu	68dadcfd15	Make eviction manager work with CRI container runtime. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-01-30 17:57:46 +00:00
Brendan Burns	3a23c678c5	Remove setInitError.	2018-01-29 21:44:54 -08:00
Clayton Coleman	44493de195	Only rotate certificates in the background The certificate manager originally had a "block on startup" rotation behavior to ensure at least one rotation happened on startup. However, since rotation may not succeed within the first time window the code was changed to simply print the error rather than return it. This meant that the blocking rotation has no purpose - it cannot cause the kubelet to fail, and it does block the kubelet from starting static pods before the api server becomes available. The current block behavior causes a bootstrapped kubelet that is also set to run static pods to wait several minutes before actually launching the static pods, which means self-hosted masters using static pods have a pointless delay on startup. Since blocking rotation has no benefit and can't actually fail startup, this commit removes the blocking behavior and simplifies the code at the same time. The goroutine for rotation now completely owns the deadline, the shouldRotate() method is removed, and the method that sets rotationDeadline now returns it. We also explicitly guard against a negative sleep interval and omit the message. Should have no impact on bootstrapping except the removal of a long delay on startup before static pods start. Also add a guard condition where if the current cert in the store is expired, we fall back to the bootstrap cert initially (we use the bootstrap cert to communicate with the server). This is consistent with when we don't have a cert yet.	2018-01-28 17:48:17 -05:00
Kubernetes Submit Queue	47d61ef472	Merge pull request #58418 from yujuhong/deprecate-rktnetes Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add deprecation warnings for rktnetes flags What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #53601 Special notes for your reviewer: Release note: ```release-note rktnetes has been deprecated in favor of rktlet. Please see https://github.com/kubernetes-incubator/rktlet for more information. ```	2018-01-24 08:54:49 -08:00
Yu-Ju Hong	0957afbbd9	dockershim: clean up the legacy interface	2018-01-19 17:09:40 -08:00
Yu-Ju Hong	9728c56a5a	dockershim: call DockerService.Start() during grpc server startup	2018-01-19 16:31:18 -08:00
Yu-Ju Hong	794f03e0ad	Add deprecation warnings for rktnetes flags	2018-01-17 14:05:51 -08:00
Seth Jennings	19a546758c	kubelet: imagegc: exempt sandbox image	2018-01-17 15:10:44 -06:00
Jonathan Basseri	30b89d830b	Move scheduler code out of plugin directory. This moves plugin/pkg/scheduler to pkg/scheduler and plugin/cmd/kube-scheduler to cmd/kube-scheduler. Bulk of the work was done with gomvpkg, except for kube-scheduler main package.	2018-01-05 15:05:01 -08:00
Rohit Agarwal	f52628db60	Deprecate the alpha Accelerators feature gate. Encourage people to use DevicePlugins instead.	2017-12-19 13:38:56 -08:00
Kubernetes Submit Queue	94327c5f72	Merge pull request #56754 from dims/remove-hacks-for-mesos Automatic merge from submit-queue (batch tested with PRs 57127, 57011, 56754, 56601, 56483). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove hacks added for mesos What this PR does / why we need it: Since Mesos is no longer in your main repository and since we have things like dynamic kubelet configuration in progress, we should drop these undocumented, untested, private hooks. cmd/kubelet/app/server.go::CreateAPIServerClientConfig CreateAPIServerClientConfig::getRuntime pkg/kubelet/kubelet_pods.go::getPhase Also remove stuff from Dependencies struct that were specific to the Mesos integration (ContainerRuntimeOptions and Options) Also remove stale references in test/e2e and and test owners file Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note Drop hacks used for Mesos integration that was already removed from main kubernetes repository ```	2017-12-17 06:25:56 -08:00
Kubernetes Submit Queue	d936754269	Merge pull request #56287 from stewart-yu/removeDeprecatedCode Automatic merge from submit-queue (batch tested with PRs 54902, 56831, 56702, 56287, 56878). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove the kubelet's `--cloud-provider=auto-detect` feature What this PR does / why we need it: Set no cloud provider as the default in kubelet, remove deprecated explain and variable. This PR covers step 3: `v1.10 - completely remove the option to use auto-detect` For more details [https://github.com/kubernetes/kubernetes/issues/50986](https://github.com/kubernetes/kubernetes/issues/50986) Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes [https://github.com/kubernetes/kubernetes/issues/50986](https://github.com/kubernetes/kubernetes/issues/50986) Special notes for your reviewer: Release note: ```release-note [action required] Remove the kubelet's `--cloud-provider=auto-detect` feature ```	2017-12-16 09:33:42 -08:00
Kubernetes Submit Queue	2d57d9b1ea	Merge pull request #56146 from jiulongzaitian/style_code Automatic merge from submit-queue (batch tested with PRs 57172, 55382, 56147, 56146, 56158). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. delete useless params containerized Signed-off-by: zhangjie <zhangjie0619@yeah.net> What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note delete useless params containerized ```	2017-12-14 12:38:19 -08:00
Davanum Srinivas	7568462ec3	Remove hacks added for mesos Since Mesos is no longer in your main repository and since we have things like dynamic kubelet configuration in progress, we should drop these undocumented, untested, private hooks. cmd/kubelet/app/server.go::CreateAPIServerClientConfig CreateAPIServerClientConfig::getRuntime pkg/kubelet/kubelet_pods.go::getPhase Also remove stuff from Dependencies struct that were specific to the Mesos integration (ContainerRuntimeOptions and Options) Also remove stale references in test/e2e and and test owners file	2017-12-03 13:52:30 -05:00
stewart-yu	50520be649	completely remove the option to use auto-detect	2017-11-28 09:54:28 +08:00
Kubernetes Submit Queue	277d866111	Merge pull request #50984 from timothysc/checkpoint Automatic merge from submit-queue (batch tested with PRs 55812, 55752, 55447, 55848, 50984). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Initial basic bootstrap-checkpoint support What this PR does / why we need it: Adds initial support for Pod checkpointing to allow for controlled recovery of the control plane during self host failure conditions. fixes #49236 xref https://github.com/kubernetes/features/issues/378 Special notes for your reviewer: Proposal is here: https://docs.google.com/document/d/1hhrCa_nv0Sg4O_zJYOnelE8a5ClieyewEsQM6c7-5-o/edit?ts=5988fba8# 1. Controlled tests work, but I have not tested the self hosted api-server recovery, that requires validation and logs. /cc @luxas 2. In adding hooks for checkpoint manager much of the tests around basicpodmanager appears to be stub'd. This has become an anti-pattern in the code and should be avoided. 3. I need a node-e2e to ensure consistency of behavior. Release note: ``` Add basic bootstrap checkpointing support to the kubelet for control plane recovery ``` /cc @kubernetes/sig-cluster-lifecycle-misc @kubernetes/sig-node-pr-reviews	2017-11-21 17:57:40 -08:00
zhangjie	226f8b3c73	delete useless params containerized Signed-off-by: zhangjie <zhangjie0619@yeah.net>	2017-11-21 18:21:59 +08:00
Jiaying Zhang	1eb4e79453	Extends deviceplugin to gracefully handle full device plugin lifecycle. - Instead of using cm.capacity field to communicate device plugin resource capacity, this PR changes to use an explicit cm.GetDevicePluginResourceCapacity() function that returns device plugin resource capacity as well as any inactive device plugin resource. Kubelet syncNodeStatus call this function during its periodic run to update node status capacity and allocatable. After this call, device plugin can remove the inactive device plugin resource from its allDevices field as the update is already pushed to API server. - Extends device plugin checkpoint data to record registered resources so that we can finish resource removing even upon kubelet restarts. - Passes sourcesReady from kubelet to device plugin to avoid removing inactive pods during grace period of kubelet restart.	2017-11-20 23:40:14 -08:00
Timothy St. Clair	ed4401c126	Addition of bootstrap checkpointing	2017-11-20 21:54:15 -06:00
Kubernetes Submit Queue	563edef707	Merge pull request #55983 from mtaufen/seccomp-is-alpha Automatic merge from submit-queue (batch tested with PRs 55839, 54495, 55884, 55983, 56069). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. seccomp is an alpha feature and not feature gated Move SeccompProfileRoot to KubeletFlags and document flag as alpha. wrt https://github.com/kubernetes/kubernetes/pull/53833#issuecomment-345396575, seccomp is an alpha feature, but this isn't clearly documented anywhere (the annotation just has the word "alpha" in it, and that's your signal that it's alpha). Since seccomp was around before feature gates, it doesn't have one. Thus SeccompProfileRoot should not be part of KubeletConfiguration, and this PR moves it to KubeletFlags, and amends the help text to note the alpha state of the feature. fixes: #56087 ```release-note NONE ```	2017-11-20 13:08:12 -08:00
Kubernetes Submit Queue	ef3b27cbd4	Merge pull request #55642 from dashpole/disable_cadvisor_disk_for_cri Automatic merge from submit-queue (batch tested with PRs 55642, 55897, 55835, 55496, 55313). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Disable container disk metrics when using the CRI stats integration Issue: https://github.com/kubernetes/kubernetes/issues/51798 As explained in the issue, runtimes which make use of the CRI Stats API still have the performance overhead of collecting those same stats through cAdvisor. The CRI Stats API has metrics for CPU, Memory, and Disk. This PR significantly reduces the added overhead due to collecting these stats in both cAdvisor and in the runtime. This PR disables container disk metrics, which are very expensive to collect. This PR does not disable node-level disk stats, as the "Raw" container handler does not currently respect ignoring DiskUsageMetrics. This PR factors out the logic for determining whether or not to use the CRI stats provider into a helper function, as cAdvisor is instantiated before it is passed to the kubelet as a dependency. cc @kubernetes/sig-node-pr-reviews @derekwaynecarr /kind feature /sig node /assign @Random-Liu @derekwaynecarr	2017-11-18 10:46:30 -08:00
Michael Taufen	ca8cffef24	seccomp is an alpha feature and not feature gated Move SeccompProfileRoot to KubeletFlags and document flag as alpha	2017-11-17 17:57:53 -08:00
Kubernetes Submit Queue	0eb999c26a	Merge pull request #55562 from mtaufen/eject-non-gated-alpha-fields Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Move 'alpha' KubeletConfiguration fields that aren't feature-gated and self-registration fields to KubeletFlags Some of these fields are marked "alpha" in help text. They cannot be in the KubeletConfiguration object unless they are feature gated or graduated from alpha. Others relate to Kubelet self-registration, and given https://github.com/kubernetes/community/pull/911 I think its prudent to wait and see if these really should be in the KubeletConfiguration type. For now we just leave them all as flags. ```release-note NONE ```	2017-11-16 10:36:10 -08:00
Michael Taufen	523c68ff65	Move ungated 'alpha' KubeletConfiguration fields and self-registration fields to KubeletFlags	2017-11-15 17:47:10 -08:00
Zihong Zheng	0bc2e1f62f	Move DNS related kubelet codes into its own package	2017-11-15 10:56:44 -08:00
David Ashpole	220edbc6e3	disable container disk metrics when using the CRI stats integration	2017-11-14 11:43:08 -08:00
Cao Shufeng	86968e44d0	remove duplicated import	2017-11-14 17:18:17 +08:00
Kubernetes Submit Queue	41fe3ed5bc	Merge pull request #54405 from resouer/clean-docker-dep Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. [Part 1] Remove docker dep in kubelet startup What this PR does / why we need it: Remove dependency of docker during kubelet start up. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): Part 1 of #54090 Special notes for your reviewer: Changes include: 1. Move docker client initialization into dockershim pkg. 2. Pass a docker `ClientConfig` from kubelet to dockershim 3. Pass parameters needed by `FakeDockerClient` thru `ClientConfig` to dockershim (TODO, the second part) Make dockershim tolerate when dockerd is down, otherwise it will still fail kubelet Please note after this PR, kubelet will still fail if dockerd is down, this will be fixed in the subsequent PR by making dockershim tolerate dockerd failure (initializing docker client in a separate goroutine), and refactoring cgroup and log driver detection. Release note: ```release-note Remove docker dependency during kubelet start up ```	2017-11-13 03:59:53 -08:00
Kubernetes Submit Queue	f14c0382e4	Merge pull request #54460 from yanxuean/cnibindir Automatic merge from submit-queue (batch tested with PRs 54460, 55258, 54858, 55506, 55510). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. redendancy code and error log message in cni What this PR does / why we need it: redendancy code and error log message in cni Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes # Special notes for your reviewer: Release note: ```release-note NONE ``` /sig-node	2017-11-11 10:45:16 -08:00
Zihong Zheng	5915b87f8a	Rearrange kubelet networking codes	2017-11-09 13:43:30 -08:00
Dr. Stefan Schimanski	012b085ac8	pkg/apis/core: mechanical import fixes in dependencies	2017-11-09 12:14:08 +01:00
Shawn Hsiao	5cba1f47c3	Support copying options in resolv.conf into pod sandbox when dnsPolicy is Default	2017-11-07 07:54:52 -05:00
Kubernetes Submit Queue	2084f7f4f3	Merge pull request #54488 from lichuqiang/plugin_base Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add admission handler for device resources allocation What this PR does / why we need it: Add admission handler for device resources allocation to fail fast during pod creation Which issue this PR fixes fixes #51592 Special notes for your reviewer: @jiayingz Sorry, there is something wrong with my branch in #51895. And I think the existing comments in the PR might be too long for others to view. So I closed it and opened the new one, as we have basically reach an agreement on the implement :) I have covered the functionality and unit test part here, and would set about the e2e part ASAP /cc @jiayingz @vishh @RenaudWasTaken Release note: ```release-note NONE ```	2017-11-02 17:24:06 -07:00
Kubernetes Submit Queue	3a15fdbe7e	Merge pull request #54643 from mtaufen/structure-manifest-url-header Automatic merge from submit-queue (batch tested with PRs 52367, 53363, 54989, 54872, 54643). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Lift embedded structure out of ManifestURLHeader field Related: #53833 ```release-note It is now possible to set multiple manifest url headers via the Kubelet's --manifest-url-header flag. Multiple headers for the same key will be added in the order provided. The ManifestURLHeader field in KubeletConfiguration object (kubeletconfig/v1alpha1) is now a map[string][]string, which facilitates writing JSON and YAML files. ```	2017-11-02 12:59:24 -07:00
lichuqiang	ebd445eb8c	add admission handler for device resources allocation	2017-11-02 09:17:48 +08:00
Harry Zhang	de1c305356	Remove docker dep in kubelet startup Update bazel	2017-11-01 10:03:01 +08:00
Kubernetes Submit Queue	94935721d5	Merge pull request #54160 from mtaufen/runtime-config-to-flags Automatic merge from submit-queue (batch tested with PRs 54160, 54016). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Move runtime-related flags from KubeletConfiguration to KubeletFlags With respect to https://github.com/kubernetes/kubernetes/pull/53833#issuecomment-336317287, move runtime-related flags out of KubeletConfiguration. Broader issue: https://github.com/kubernetes/features/issues/281 ```release-note NONE ```	2017-10-31 01:23:15 -07:00
Michael Taufen	7cb21746c0	Lift embedded structure out of ManifestURLHeader field	2017-10-30 15:37:55 -07:00
Kevin	4c8539cece	use core client with explicit version globally	2017-10-27 15:48:32 +08:00
yanxuean	dc0f3ce05c	remove redendancy code for cni Signed-off-by: yanxuean <yan.xuean@zte.com.cn>	2017-10-24 15:21:55 +08:00
Michael Taufen	f90b46c784	Move runtime-related flags from KubeletConfiguration to KubeletFlags	2017-10-23 11:15:48 -07:00
supereagle	0b88971505	kubelet: remove the --network-plugin-dir flag	2017-10-18 09:37:19 +08:00
Kubernetes Submit Queue	e3e2e24cc5	Merge pull request #52503 from joelsmith/journald-log-fallback Automatic merge from submit-queue (batch tested with PRs 54040, 52503). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Get fallback termination msg from docker when using journald log driver What this PR does / why we need it: When using the legacy docker container runtime and when a container has `terminationMessagePolicy=FallbackToLogsOnError` and when docker is configured with a log driver other than `json-log` (such as `journald`), the kubelet should not try to get the container's log from the json log file (since it's not there) but should instead ask docker for the logs. Which issue this PR fixes fixes #52502 Special notes for your reviewer: Release note: ```release-note Fixed log fallback termination messages when using docker with journald log driver ```	2017-10-17 13:18:15 -07:00
Kubernetes Submit Queue	0ba7c52b8c	Merge pull request #53458 from dims/fix-pkg-cmd-dependencies Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix pkg/ depends on cmd/ problems What this PR does / why we need it: Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes # Partial fix for https://github.com/kubernetes/kubernetes/issues/53341 Special notes for your reviewer: No logic changes, Just moving things around Release note: ```release-note NONE ```	2017-10-13 23:56:55 -07:00
Derek Carr	54224600ec	kubelet syncPod throws specific events	2017-10-13 10:24:09 -04:00
Davanum Srinivas	48433c8773	Remove cmd/kubelet dependency from pkg/kubelet	2017-10-13 07:06:21 -04:00
Michael Taufen	8180536bed	Mulligan: Remove deprecated and experimental fields from KubeletConfiguration Revert "Merge pull request #51857 from kubernetes/revert-51307-kc-type-refactor" This reverts commit `9d27d92420`, reversing changes made to `2e69d4e625`. See original: #51307 We punted this from 1.8 so it could go through an API review. The point of this PR is that we are trying to stabilize the kubeletconfig API so that we can move it out of alpha, and unblock features like Dynamic Kubelet Config, Kubelet loading its initial config from a file instead of flags, kubeadm and other install tools having a versioned API to rely on, etc. We shouldn't rev the version without both removing all the deprecated junk from the KubeletConfiguration struct, and without (at least temporarily) removing all of the fields that have "Experimental" in their names. It wouldn't make sense to lock in to deprecated fields. "Experimental" fields can be audited on a 1-by-1 basis after this PR, and if found to be stable (or sufficiently alpha-gated), can be restored to the KubeletConfiguration without the "Experimental" prefix.	2017-10-11 09:52:39 -07:00
Jacob Simpson	415c4d2c3a	Move certificate manager to client.	2017-10-05 12:54:38 -07:00
Kubernetes Submit Queue	93862282a4	Merge pull request #53233 from dashpole/kubelet_gc_faster Automatic merge from submit-queue (batch tested with PRs 53403, 53233). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove containers from deleted pods once containers have exited Issue #51899 Since container deletion is currently done through periodic garbage collection every 30 seconds, it takes a long time for pods to be deleted, and causes the kubelet to send all delete pod requests at the same time, which has performance issues. This PR makes the kubelet actively remove containers of deleted pods rather than wait for them to be removed in periodic garbage collection. /release-note-none	2017-10-03 17:21:15 -07:00
Di Xu	32199cb95b	don't recreate static pods when node gets deleted	2017-10-03 10:28:08 +08:00
David Ashpole	1eddab3313	remove containers of deleted pods once all containers have exited	2017-10-02 10:15:21 -07:00
Kubernetes Submit Queue	a0b7d467e2	Merge pull request #53094 from yguo0905/fix Automatic merge from submit-queue (batch tested with PRs 51021, 53225, 53094, 53219). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Change ImageGCManage to consume ImageFS stats from StatsProvider Fixes #53083. Release note: ``` Change ImageGCManage to consume ImageFS stats from StatsProvider ``` /assign @Random-Liu	2017-09-29 12:38:22 -07:00
Kubernetes Submit Queue	69b2e73d5f	Merge pull request #44596 from yanxuean/bugfix Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Caller of HandlePodSyncs should be handler in kubelet syncLoopIteration	2017-09-28 21:15:13 -07:00
Yang Guo	f6c36474f2	Change ImageGCManage to consume ImageFS stats from StatsProvider	2017-09-28 10:27:22 -07:00
Zihong Zheng	69b5e0ab67	Revert "Make kubelet touch iptables lock file during initialization"	2017-09-27 13:34:43 -07:00
Joel Smith	d53d29faf7	Get fallback termination msg from docker when using journald log driver When using the legacy docker container runtime and when a container has terminationMessagePolicy=FallbackToLogsOnError and when docker is configured with a log driver other than json-log (such as journald), the kubelet should not try to get the container's log from the json log file (since it's not there) but should instead ask docker for the logs.	2017-09-26 07:14:15 -06:00
Yu-Ju Hong	3837a016ef	kubelet: remove the --docker-exec-handler flag Stop supporting the "nsenter" exec handler. Only the Docker native exec handler is supported. The flag was deprecated in Kubernetes 1.6 and is safe to remove in Kubernetes 1.9 according to the deprecation policy.	2017-09-22 12:13:31 -07:00
Kubernetes Submit Queue	3277de69b4	Merge pull request #52176 from liggitt/heartbeat-timeout Automatic merge from submit-queue (batch tested with PRs 52176, 43152). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.. Eliminate hangs/throttling of node heartbeat Fixes https://github.com/kubernetes/kubernetes/issues/48638 Fixes #50304 Stops kubelet from wedging when updating node status if unable to establish tcp connection. Notes that this only affects the node status loop. The pod sync loop would still hang until the dead TCP connections timed out, so more work is needed to keep the sync loop responsive in the face of network issues, but this change lets existing pods coast without the node controller trying to evict them ```release-note kubelet to master communication when doing node status updates now has a timeout to prevent indefinite hangs ```	2017-09-16 09:45:29 -07:00
Jordan Liggitt	f8f57d8959	Use separate client for node status loop	2017-09-14 15:56:22 -04:00
Yu-Ju Hong	2c415cc506	kubelet: enable CRI container metrics	2017-09-13 15:09:35 -07:00
yanxuean	799d0e5a6e	correct to handler	2017-09-12 13:47:08 +08:00
Kubernetes Submit Queue	e8d99f5839	Merge pull request #51645 from jingxu97/Aug/nameserver Automatic merge from submit-queue (batch tested with PRs 51186, 50350, 51751, 51645, 51837) Set up DNS server in containerized mounter path During NFS/GlusterFS mount, it requires to have DNS server to be able to resolve service name. This PR gets the DNS server ip from kubelet and add it to the containerized mounter path. So if containerized mounter is used, service name could be resolved during mount Release note: ```release-note Allow DNS resolution of service name for COS using containerized mounter. It fixed the issue with DNS resolution of NFS and Gluster services. ```	2017-09-05 17:30:09 -07:00
Kubernetes Submit Queue	78c820803c	Merge pull request #50350 from dashpole/eviction_container_deletion Automatic merge from submit-queue (batch tested with PRs 51186, 50350, 51751, 51645, 51837) Wait for container cleanup before deletion We should wait to delete pod API objects until the pod's containers have been cleaned up. See issue: #50268 for background. This changes the kubelet container gc, which deletes containers belonging to pods considered "deleted". It adds two conditions under which a pod is considered "deleted", allowing containers to be deleted: Pods where deletionTimestamp is set, and containers are not running Pods that are evicted This PR also changes the function PodResourcesAreReclaimed by making it return false if containers still exist. The eviction manager will wait for containers of previous evicted pod to be deleted before evicting another pod. The status manager will wait for containers to be deleted before removing the pod API object. /assign @vishh	2017-09-05 17:30:03 -07:00
Jing Xu	3d4bc931d3	Set up DNS server in containerized mounter path During NFS/GlusterFS mount, it requires to have DNS server to be able to resolve service name. This PR gets the DNS server ip from kubelet and add it to the containerized mounter path. So if containerized mounter is used, service name could be resolved during mount	2017-09-05 11:40:23 -07:00
David Ashpole	9ac30e2c28	wait for container cleanup before deletion	2017-09-04 17:38:09 -07:00
Connor Doyle	ec706216e6	Un-revert "CPU manager wiring and `none` policy" This reverts commit `8d2832021a`.	2017-09-04 07:24:59 -07:00
Jan Safranek	d9500105d8	Share /var/lib/kubernetes on startup Kubelet makes sure that /var/lib/kubelet is rshared when it starts. If not, it bind-mounts it with rshared propagation to containers that mount volumes to /var/lib/kubelet can benefit from mount propagation.	2017-08-30 16:45:04 +02:00
Kubernetes Submit Queue	7c70decd27	Merge pull request #51312 from andrewsykim/50986 Automatic merge from submit-queue (batch tested with PRs 50932, 49610, 51312, 51415, 50705) Deprecation warnings for auto detecting cloud providers What this PR does / why we need it: Adds deprecation warnings for auto detecting cloud providers. As part of the initiative for out-of-tree cloud providers, this feature is conflicting since we're shifting the dependency of kubernetes core into cAdvisor. In the future kubelets should be using `--cloud-provider=external` or no cloud provider at all. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes #50986 Special notes for your reviewer: NOTE: I still have to coordinate with sig-node and kubernetes-dev to get approval for this deprecation, I'm only opening this PR since we're close to code freeze and it's something presentable. Release note: ```release-note Deprecate auto detecting cloud providers in kubelet. Auto detecting cloud providers go against the initiative for out-of-tree cloud providers as we'll now depend on cAdvisor integrations with cloud providers instead of the core repo. In the near future, `--cloud-provider` for kubelet will either be an empty string or `external`. ```	2017-08-29 01:17:37 -07:00
Kubernetes Submit Queue	c27cdb11a9	Merge pull request #50932 from yguo0905/stats-cadvisor Automatic merge from submit-queue (batch tested with PRs 50932, 49610, 51312, 51415, 50705) Implement StatsProvider interface using cadvisor Ref: https://github.com/kubernetes/kubernetes/issues/46984 - This PR changes the `StatsProvider` interface in `pkg/kubelet/server/stats` so that it can provide container stats from either cadvisor or CRI, and the summary API can consume the stats without knowing how they are provided. - The `StatsProvider` struct in the newly added package `pkg/kubelet/stats` implements part of the `StatsProvider` interface in `pkg/kubelet/server/stats`. - In `pkg/kubelet/stats`, - `stats_provider.go`: implements the node level stats and provides the entry point for this package. - `cadvisor_stats_provider.go`: implements the container level stats using cadvisor. - `cri_stats_provider.go`: implements the container level stats using CRI. - `helper.go`: utility functions shared by the above three components. - There should be no user visible behaviors change in this PR. - A follow up PR will implement the StatsProvider interface using CRI. Release note: ``` None ``` /assign @yujuhong /assign @WIZARD-CXY	2017-08-29 01:17:29 -07:00
andrewsykim	fd86022714	add deprecation warnings for auto detecting cloud providers	2017-08-25 19:30:52 -04:00
Yang Guo	f9767d2f71	Change StatsProvider interface to provide container stats from either cadvisor or CRI and implement this interface using cadvisor	2017-08-25 13:11:26 -07:00
Cheng Xing	396c3c7c6f	Adding dynamic Flexvolume plugin discovery capability, using filesystem watch.	2017-08-25 11:42:32 -07:00
Michael Taufen	24bab4c20f	move KubeletConfiguration out of componentconfig API group	2017-08-15 08:12:42 -07:00
Pengfei Ni	c242432a3b	Rename runtime/default to docker default	2017-08-13 15:42:15 +08:00
Pengfei Ni	f3150c9c8c	Support seccomp profile from container's security context	2017-08-13 15:42:15 +08:00
Michael Taufen	443d58e40a	Dynamic Kubelet Configuration Alpha implementation of the Dynamic Kubelet Configuration feature. See the proposal doc in #29459.	2017-08-08 12:21:37 -07:00
Kubernetes Submit Queue	b20beaa98a	Merge pull request #49724 from sjenning/skip-sync-mount-terminated-pods Automatic merge from submit-queue (batch tested with PRs 49284, 49555, 47639, 49526, 49724) skip WaitForAttachAndMount for terminated pods in syncPod Fixes https://github.com/kubernetes/kubernetes/issues/49663 I tried to tread lightly with a small localized change because this needs to be picked to 1.7 and 1.6 as well. I suspect this has been as issue since we started unmounting volumes on pod termination https://github.com/kubernetes/kubernetes/pull/37228 xref openshift/origin#14383 @derekwaynecarr @eparis @smarterclayton @saad-ali @jwforres /release-note-none	2017-08-01 01:42:02 -07:00
Kubernetes Submit Queue	72c6251508	Merge pull request #47019 from jessfraz/allowPrivilegeEscalation Automatic merge from submit-queue (batch tested with PRs 49651, 49707, 49662, 47019, 49747) Add support for `no_new_privs` via AllowPrivilegeEscalation What this PR does / why we need it: Implements kubernetes/community#639 Fixes #38417 Adds `AllowPrivilegeEscalation` and `DefaultAllowPrivilegeEscalation` to `PodSecurityPolicy`. Adds `AllowPrivilegeEscalation` to container `SecurityContext`. Adds the proposed behavior to `kuberuntime`, `dockershim`, and `rkt`. Adds a bunch of unit tests to ensure the desired default behavior and that when `DefaultAllowPrivilegeEscalation` is explicitly set. Tests pass locally with docker and rkt runtimes. There are also a few integration tests with a `setuid` binary for sanity. Release note: ```release-note Adds AllowPrivilegeEscalation to control whether a process can gain more privileges than it's parent process ```	2017-07-31 16:56:58 -07:00
Seth Jennings	265db191f1	skip WaitForAttachAndMount for terminated pods in syncPod	2017-07-27 11:25:58 -05:00
Kubernetes Submit Queue	2189314895	Merge pull request #40050 from mtaufen/standalone-mode Automatic merge from submit-queue (batch tested with PRs 48976, 49474, 40050, 49426, 49430) Use presence of kubeconfig file to toggle standalone mode Fixes #40049 ```release-note The deprecated --api-servers flag has been removed. Use --kubeconfig to provide API server connection information instead. The --require-kubeconfig flag is now deprecated. The default kubeconfig path is also deprecated. Both --require-kubeconfig and the default kubeconfig path will be removed in Kubernetes v1.10.0. ``` /cc @kubernetes/sig-cluster-lifecycle-misc @kubernetes/sig-node-misc	2017-07-25 12:14:43 -07:00
Kubernetes Submit Queue	9350afd772	Merge pull request #48976 from supereagle/cleanup-api-package Automatic merge from submit-queue (batch tested with PRs 48976, 49474, 40050, 49426, 49430) Remove duplicated import and wrong alias name of api package What this PR does / why we need it: Which issue this PR fixes: fixes #48975 Special notes for your reviewer: /assign @caesarxuchao Release note: ```release-note NONE ```	2017-07-25 12:14:38 -07:00
Kubernetes Submit Queue	7f1d9382ec	Merge pull request #48846 from dashpole/remove_ood Automatic merge from submit-queue Remove flags low-diskspace-threshold-mb and outofdisk-transition-frequency issue: #48843 This removes two flags replaced by the eviction manager. These have been depreciated for two releases, which I believe correctly follows the kubernetes depreciation guidelines. ```release-note Remove depreciated flags: --low-diskspace-threshold-mb and --outofdisk-transition-frequency, which are replaced by --eviction-hard ``` cc @mtaufen since I am changing kubelet flags cc @vishh @derekwaynecarr /sig node	2017-07-24 23:05:50 -07:00
Kubernetes Submit Queue	e623fed778	Merge pull request #48636 from jingxu97/July/allocatable Automatic merge from submit-queue (batch tested with PRs 48636, 49088, 49251, 49417, 49494) Fix issues for local storage allocatable feature This PR fixes the following issues: 1. Use ResourceStorageScratch instead of ResourceStorage API to represent local storage capacity 2. In eviction manager, use container manager instead of node provider (kubelet) to retrieve the node capacity and reserved resources. Node provider (kubelet) has a feature gate so that storagescratch information may not be exposed if feature gate is not set. On the other hand, container manager has all the capacity and allocatable resource information. This PR fixes issue #47809	2017-07-24 19:30:33 -07:00
supereagle	adc0eef43e	remove duplicated import and wrong alias name of api package	2017-07-25 10:04:25 +08:00
Michael Taufen	38aee0464d	Providing kubeconfig file is now the switch for standalone mode Replaces use of --api-servers with --kubeconfig in Kubelet args across the turnup scripts. In many cases this involves generating a kubeconfig file for the Kubelet and placing it in the correct location on the node.	2017-07-24 11:03:00 -07:00
Jess Frazelle	e1493c9c88	allowPrivilegeEscalation: apply to correct docker api versions Signed-off-by: Jess Frazelle <acidburn@google.com>	2017-07-24 12:52:43 -04:00
David Ashpole	7a23f8b018	remove deprecated flags LowDiskSpaceThresholdMB and OutOfDiskTransitionFrequency	2017-07-20 13:23:13 -07:00
ymqytw	3dfc8bf7f3	update import	2017-07-20 11:03:49 -07:00
Kubernetes Submit Queue	c0287ce420	Merge pull request #47316 from k82cn/k8s_47315 Automatic merge from submit-queue (batch tested with PRs 48981, 47316, 49180) Added golint check for pkg/kubelet. What this PR does / why we need it: Added golint check for pkg/kubelet, and make golint happy. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes #47315 Release note: ```release-note-none ```	2017-07-19 11:21:25 -07:00
Klaus Ma	63b78a37e0	Added golint check for pkg/kubelet.	2017-07-19 11:33:06 +08:00
Mikhail Mazurskiy	d789615902	Shared Informer Run blocks until all goroutines finish Fixes #45454	2017-07-18 14:05:08 +10:00
Jacob Simpson	29c1b81d4c	Scripted migration from clientset_generated to client-go.	2017-07-17 15:05:37 -07:00
Jing Xu	bb1920edcc	Fix issues for local storage allocatable feature This PR fixes the following issues: 1. Use ResourceStorageScratch instead of ResourceStorage API to represent local storage capacity 2. In eviction manager, use container manager instead of node provider (kubelet) to retrieve the node capacity and reserved resources. Node provider (kubelet) has a feature gate so that storagescratch information may not be exposed if feature gate is not set. On the other hand, container manager has all the capacity and allocatable resource information.	2017-07-13 12:06:19 -07:00
Clayton Coleman	b8e662fcea	Move the kubelet certificate management code into a single package Code is very similar and belongs together.	2017-07-05 18:11:49 -04:00
Dan Williams	5b8ad3f7c5	kubelet: remove unused bandwidth shaping teardown code Since v1.5 and the removal of --configure-cbr0: `0800df74ab` "Remove the legacy networking mode --configure-cbr0" kubelet hasn't done any shaping operations internally. They have all been delegated to network plugins like kubenet or external CNI plugins. But some shaping code was still left in kubelet, so remove it now that it's unused.	2017-06-30 11:51:22 -05:00
Vishnu kannan	82f7820066	Kubelet: Centralize Capacity discovery of standard resources in Container manager. Have storage derive node capacity from container manager. Move certain cAdvisor interfaces to the cAdvisor package in the process. This patch fixes a bug in container manager where it was writing to a map without synchronization. Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-06-27 18:45:02 -07:00
Kubernetes Submit Queue	d95a8bf66b	Merge pull request #47783 from NickrenREN/containerruntime Automatic merge from submit-queue (batch tested with PRs 47694, 47772, 47783, 47803, 47673) Make different container runtimes constant Make different container runtimes constant to avoid hardcode Release note: ```release-note NONE ```	2017-06-23 08:29:28 -07:00
Kubernetes Submit Queue	dd126ae19c	Merge pull request #38431 from NickrenREN/newVolumeMgr-return Automatic merge from submit-queue Modify NewVolumeManager() function return value	2017-06-22 16:43:29 -07:00
Chao Xu	60604f8818	run hack/update-all	2017-06-22 11:31:03 -07:00
Chao Xu	f2d3220a11	run root-rewrite-import-client-go-api-types	2017-06-22 11:30:59 -07:00
Chao Xu	cde4772928	run ./root-rewrite-all-other-apis.sh, then run make all, pkg/... compiles	2017-06-22 11:30:52 -07:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
NickrenREN	6de7e3f3dc	Make different container runtimes constant	2017-06-20 19:58:39 +08:00
NickrenREN	312cd1bbe6	Modify NewVolumeManager() function return value Since function NewVolumeManager() will always return vm and nil, we do not need the second return value, it will always be nil.	2017-06-17 23:33:12 +08:00
Jacob Simpson	334de1cbe1	Auto approve kubelet certificate signing requests.	2017-06-16 08:47:12 -07:00
Kubernetes Submit Queue	17244ea5d9	Merge pull request #47124 from andyxning/remove_sync_loop_health_check Automatic merge from submit-queue (batch tested with PRs 47000, 47188, 47094, 47323, 47124) fix sync loop health check This PR will do error logging about the fall behind sync for kubelet instead of sync loop healthz checking. The reason is kubelet can not do sync loop and therefore can not update sync loop time when there is any runtime error, such as docker hung. When there is any runtime error, according to current implementation, kubelet will not do sync operation and thus kubelet's sync loop time will not be updated. This will make when there is any runtime error, kubelet will also return non 200 response status code when accessing healthz endpoint. This is contrary with #37865 which prevents kubelet from being killed when docker hangs. Release note: ```release-note fix sync loop health check with seperating runtime errors ``` /cc @yujuhong @Random-Liu @dchen1107	2017-06-12 18:19:51 -07:00
Andy Xie	96cb43993a	fix sync loop health check	2017-06-10 11:25:59 +08:00
Zihong Zheng	d5c9d27ed7	Make kubelet touch iptables lock file during initialization	2017-06-09 09:34:48 -07:00
David Ashpole	889afa5e2d	trigger aggressive container garbage collection when under disk pressure	2017-06-03 07:52:36 -07:00
Jing Xu	dd67e96c01	Add local storage (scratch space) allocatable support This PR adds the support for allocatable local storage (scratch space). This feature is only for root file system which is shared by kubernetes componenets, users' containers and/or images. User could use --kube-reserved flag to reserve the storage for kube system components. If the allocatable storage for user's pods is used up, some pods will be evicted to free the storage resource.	2017-06-01 15:57:50 -07:00
Shyam Jeedigunta	1cf6b339f6	Use TTL-based caching configmap manager in kubelet	2017-05-31 10:39:40 +02:00
Shyam Jeedigunta	4425864707	Migrate kubelet configmap management logic to an interface	2017-05-31 10:39:36 +02:00
Kubernetes Submit Queue	f2074ba8de	Merge pull request #45059 from jcbsmpsn/rotate-server-certificate Automatic merge from submit-queue (batch tested with PRs 46635, 45619, 46637, 45059, 46415) Certificate rotation for kubelet server certs. Replaces the current kubelet server side self signed certs with certs signed by the Certificate Request Signing API on the API server. Also renews expiring kubelet server certs as expiration approaches. Two Points: 1. With `--feature-gates=RotateKubeletServerCertificate=true` set, the kubelet will request a certificate during the boot cycle and pause waiting for the request to be satisfied. 2. In order to have the kubelet's certificate signing request auto approved, `--insecure-experimental-approve-all-kubelet-csrs-for-group=` must be set on the cluster controller manager. There is an improved mechanism for auto approval [proposed](https://github.com/kubernetes/kubernetes/issues/45030). Release note: ```release-note With `--feature-gates=RotateKubeletServerCertificate=true` set, the kubelet will request a server certificate from the API server during the boot cycle and pause waiting for the request to be satisfied. It will continually refresh the certificate as the certificates expiration approaches. ```	2017-05-30 19:49:02 -07:00
Yu-Ju Hong	c82350214e	Group container-runtime-specific flags/options together Do not store them in kubelet's configuration. Eventually, we would like to deprecate all these flags as they should not be part of kubelet.	2017-05-30 08:10:39 -07:00
Jacob Simpson	4c22e6bc6a	Certificate rotation for kubelet server certs. Replaces the current kubelet server side self signed certs with certs signed by the Certificate Request Signing API on the API server. Also renews expiring kubelet server certs as expiration approaches.	2017-05-29 12:28:01 -07:00
Kubernetes Submit Queue	5e853709a7	Merge pull request #46089 from karataliu/wincri1 Automatic merge from submit-queue (batch tested with PRs 46124, 46434, 46089, 45589, 46045) Support TCP type runtime endpoint for kubelet What this PR does / why we need it: Currently the grpc server for kubelet and dockershim has a hardcoded endpoint: unix socket '/var/run/dockershim.sock', which is not applicable on non-unix OS. This PR is to support TCP endpoint type besides unix socket. Which issue this PR fixes This is a first attempt to address issue https://github.com/kubernetes/kubernetes/issues/45927 Special notes for your reviewer: Before this change, running on Windows node results in: ``` Container Manager is unsupported in this build ``` After adding the cm stub, error becomes: ``` listen unix /var/run/dockershim.sock: socket: An address incompatible with the requested protocol was used. ``` This PR is to fix those two issues. After this change, still meets 'seccomp' related issue when running on Windows node, needs more updates later. Release note:	2017-05-25 21:40:02 -07:00
Dong Liu	fb26c9100a	Support TCP type runtime endpoint for kubelet.	2017-05-25 09:16:11 +08:00
Kubernetes Submit Queue	90250220a9	Merge pull request #44428 from qiujian16/commenttypo Automatic merge from submit-queue Fix some typo of comment in kubelet.go What this PR does / why we need it: The PR is to fix some typo in kubelet.go Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes # N/A Special notes for your reviewer: Release note: ```release-note ```	2017-05-23 18:45:34 -07:00
Kubernetes Submit Queue	99a8f7c303	Merge pull request #43590 from dashpole/eviction_complete_deletion Automatic merge from submit-queue (batch tested with PRs 46022, 46055, 45308, 46209, 43590) Eviction does not evict unless the previous pod has been cleaned up Addresses #43166 This PR makes two main changes: First, it makes the eviction loop re-trigger immediately if there may still be pressure. This way, if we already waited 10 seconds to delete a pod, we dont need to wait another 10 seconds for the next synchronize call. Second, it waits for the pod to be cleaned up (including volumes, cgroups, etc), before moving on to the next synchronize call. It has a timeout for this operation currently set to 30 seconds.	2017-05-22 20:00:03 -07:00
Clayton Coleman	3e095d12b4	Refactor move of client-go/util/clock to apimachinery	2017-05-20 14:19:48 -04:00
David Ashpole	21fb487245	wait for previous evicted pod to be cleaned up	2017-05-16 14:23:42 -07:00
Xing Zhou	a2e68e96cb	Fix typo. Fixed typo.	2017-05-15 14:01:30 +08:00
Kubernetes Submit Queue	3619c33350	Merge pull request #42759 from mtaufen/kubelet-apis-reorg Automatic merge from submit-queue Reorganize kubelet tree so apis can be independently versioned @yujuhong @lavalamp @thockin @bgrant0607 This is an example of how we might reorganize `pkg/kubelet` so the apis it exposes can be independently versioned. This would also provide a logical place to put the `KubeletConfiguration` type, which currently lives in `pkg/apis/componentconfig`; it could live in e.g. `pkg/kubelet/apis/config` instead. Take a look when you have a chance and let me know what you think. The most significant change in this PR is reorganizing `pkg/kubelet/api` to `pkg/kubelet/apis`, the rest is pretty much updating import paths and `BUILD` files.	2017-05-12 17:43:22 -07:00
Kubernetes Submit Queue	9c8287d629	Merge pull request #45624 from dashpole/kubelet_cleanup Automatic merge from submit-queue (batch tested with PRs 45685, 45572, 45624, 45723, 45733) Remove unused fields from Kubelet struct Just a small attempt to clean up some unused fields in the kubelet struct. This doesn't make any actual code changes. /assign @mtaufen	2017-05-12 14:00:57 -07:00
Michael Taufen	cbad320205	Reorganize kubelet tree so apis can be independently versioned	2017-05-12 10:02:33 -07:00
David Ashpole	b69dacbd86	remove unused fields from Kubelet struct	2017-05-10 16:25:09 -07:00
Yu-Ju Hong	daa329c9ae	Remove the deprecated `--enable-cri` flag Except for rkt, CRI is the default and only integration point for container runtimes.	2017-05-10 13:03:41 -07:00
Kubernetes Submit Queue	77b2e6302c	Merge pull request #45236 from verb/sharedpid-2-default Automatic merge from submit-queue Enable shared PID namespace by default for docker pods What this PR does / why we need it: This PR enables PID namespace sharing for docker pods by default, bringing the behavior of docker in line with the other CRI runtimes when used with docker >= 1.13.1. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): ref #1615 Special notes for your reviewer: cc @dchen1107 @yujuhong Release note: ```release-note Kubernetes now shares a single PID namespace among all containers in a pod when running with docker >= 1.13.1. This means processes can now signal processes in other containers in a pod, but it also means that the `kubectl exec {pod} kill 1` pattern will cause the pod to be restarted rather than a single container. ```	2017-05-10 12:06:01 -07:00
Kubernetes Submit Queue	51a3413371	Merge pull request #45307 from yujuhong/mv-docker-client Automatic merge from submit-queue (batch tested with PRs 45453, 45307, 44987) Migrate the docker client code from dockertools to dockershim Move docker client code from dockertools to dockershim/libdocker. This includes DockerInterface (renamed to Interface), FakeDockerClient, etc. This is part of #43234	2017-05-09 20:23:44 -07:00
Kubernetes Submit Queue	a062782524	Merge pull request #44258 from wlan0/master Automatic merge from submit-queue (batch tested with PRs 45508, 44258, 44126, 45441, 45320) cloud initialize node in external cloud controller @thockin This PR adds support in the `cloud-controller-manager` to initialize nodes (instead of kubelet, which did it previously) This also adds support in the kubelet to skip node cloud initialization when `--cloud-provider=external` Specifically, Kubelet 1. The kubelet has a new flag called `--provider-id` which uniquely identifies a node in an external DB 2. The kubelet sets a node taint - called "ExternalCloudProvider=true:NoSchedule" if cloudprovider == "external" Cloud-Controller-Manager 1. The cloud-controller-manager listens on "AddNode" events, and then processes nodes that starts with that above taint. It performs the cloud node initialization steps that were previously being done by the kubelet. 2. On addition of node, it figures out the zone, region, instance-type, removes the above taint and updates the node. 3. Then periodically queries the cloudprovider for node addresses (which was previously done by the kubelet) and updates the node if there are new addresses ```release-note NONE ```	2017-05-08 16:34:43 -07:00
Kubernetes Submit Queue	f4fc4be805	Merge pull request #44727 from x1957/master Automatic merge from submit-queue adds log when gpuManager.start() failed If gpuManager.start() returns error, there is no log. We confused with scheduler do not schedule any pod(with gpu) to one node. kubectl describe node xxx shows there is no gpu on that node, because the gpu driver do not work on that node, gpuManager.start() failed, but we can not see anything in log.	2017-05-08 14:27:48 -07:00
wlan0	45d2bc06b7	cloud initialize node in external cloud controller	2017-05-05 16:51:45 -07:00
Yu-Ju Hong	389c140eaf	Move docker client code from dockertools to dockershim/dockerlib The code affected include DockerInterface (renamed to Interface), FakeDockerClient, etc.	2017-05-05 11:48:08 -07:00
Kubernetes Submit Queue	f6ec7bade1	Merge pull request #45316 from yujuhong/dockershim-plugin-settings Automatic merge from submit-queue (batch tested with PRs 45316, 45341) Pass NoOpLegacyHost to dockershim in --experimental-dockershim mode This allows dockershim to use network plugins, if needed. /cc @Random-Liu	2017-05-04 05:19:49 -07:00
Yu-Ju Hong	40b0474956	pass noopnetworkhost to dockershim	2017-05-03 16:32:01 -07:00
Yu-Ju Hong	78b2c3b4c2	kuberuntime: remove the unused network plugin Network plugin is completely handled by the container runtimes. Remove this unused field in the kuberuntime manager.	2017-05-03 16:21:46 -07:00
Lee Verberne	b668371a63	Enable shared PID namespace by default for docker	2017-05-03 17:12:08 +00:00
Jian Qiu	b0a415e453	Fix some typo of comment in kubelet.go	2017-05-03 10:40:28 +08:00
Yu-Ju Hong	93ecaf6812	Move exec.go from dockertools to dockershim	2017-05-01 16:00:46 -07:00
Yu-Ju Hong	9f3184c5a4	Remove DockerManager from kubelet This commit deletes code in dockertools that is only used by DockerManager. A follow-up change will rename and clean up the rest of the files in this package. The commit also sets EnableCRI to true if the container runtime is not rkt. A follow-up change will remove the flag/field and all references to it.	2017-05-01 12:14:50 -07:00
Lee Verberne	d22dd0fa35	Implement shared PID namespace in the dockershim	2017-04-27 23:43:53 +00:00
x1957	3db1127e72	adds log when gpuManager.start() failed	2017-04-20 23:09:25 +08:00
Klaus Ma	6d29cfc0cc	Registered node before other initialization.	2017-04-18 10:43:56 +08:00
Yu-Ju Hong	1d3d12dfc2	Don't check runtime condition for rktnetes rktnetes is not a CRI implementation, and does not provide runtime conditions. This change fixes the issue where rkt will never be considered running from kubelet's point of view.	2017-04-17 11:33:58 -07:00
Andy Goldstein	00e11566f2	Make the dockershim root directory configurable Make the dockershim root directory configurable so things like integration tests (e.g. in OpenShift) can run as non-root.	2017-04-12 09:06:21 -04:00
Andy Goldstein	010b71a5f7	kubelet: make dockershim.sock configurable Make the location of dockershim.sock configurable, so downstream projects (such as OpenShift) can place it in a location that does not require root access (e.g. for integration tests). Make the kubelet respect and use the values of --container-runtime-endpoint and --image-service-endpoint, if set. If unset, the default value of /var/run/dockershim.sock is used.	2017-04-06 12:01:21 -04:00
Kubernetes Submit Queue	faf2eca226	Merge pull request #42916 from dashpole/misleading_log Automatic merge from submit-queue Clearer ImageGC failure errors. Fewer events. Addresses #26000. Kubelet often "fails" image garbage collection if cAdvisor has not completed the first round of stats collection. Don't create events for a single failure, and make log messages more specific. @kubernetes/sig-node-bugs	2017-04-04 11:23:32 -07:00
Michael Taufen	f5eed7e91d	Add a separate flags struct for Kubelet flags Kubelet flags are not necessarily appropriate for the KubeletConfiguration object. For example, this PR also removes HostnameOverride and NodeIP from KubeletConfiguration. This is a preleminary step to enabling Nodes to share configurations, as part of the dynamic Kubelet configuration feature (#29459). Fields that must be unique for each node inhibit sharing, because their values, by definition, cannot be shared.	2017-04-03 13:28:29 -07:00
David Ashpole	2cd65ea863	only create event for multiple imagegc failures	2017-03-30 16:19:18 -07:00
NickrenREN	2f89a6bda6	optimize getPullSecretsForPod() and syncPod() Since getPullSecretsForPod() will never return err,we do not need the second return value,and modify syncPod() function.	2017-03-25 11:05:13 +08:00
Vishnu kannan	ff158090b3	use active pods instead of runtime pods in gpu manager Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-03-13 10:58:26 -07:00
Andy Goldstein	b011529d8a	Add pprof trace support Add pprof trace support and --enable-contention-profiling to those components that don't already have it.	2017-03-07 10:10:42 -05:00
Kubernetes Submit Queue	4bbf98850f	Merge pull request #42500 from vishh/fix-gpu-init Automatic merge from submit-queue [Bug] Fix gpu initialization in Kubelet Kubelet incorrectly fails if `AllAlpha=true` feature gate is enabled with container runtimes that are not `docker`. Replaces #42407	2017-03-04 20:28:08 -08:00
Vishnu kannan	038585626d	fix gpu initialization Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-03-03 12:13:01 -08:00
David Ashpole	ac612eab8e	eviction manager changes for allocatable	2017-03-02 07:36:24 -08:00
Kubernetes Submit Queue	fa0387c9fe	Merge pull request #42195 from Random-Liu/cri-support-non-json-logging Automatic merge from submit-queue (batch tested with PRs 41931, 39821, 41841, 42197, 42195) Use `docker logs` directly if the docker logging driver is not `json-file` Fixes https://github.com/kubernetes/kubernetes/issues/41996. Post the PR first, I still need to manually test this, because we don't have test coverage for journald logging pluggin. @yujuhong @dchen1107 /cc @kubernetes/sig-node-pr-reviews	2017-03-01 20:08:08 -08:00
Random-Liu	7c261bfed7	Use `docker logs` directly if the docker logging driver is not supported.	2017-03-01 10:50:11 -08:00
vefimova	fc8a37ec86	Added ability for Docker containers to set usage of dns settings along with hostNetwork is true Introduced chages: 1. Re-writing of the resolv.conf file generated by docker. Cluster dns settings aren't passed anymore to docker api in all cases, not only for pods with host network: the resolver conf will be overwritten after infra-container creation to override docker's behaviour. 2. Added new one dnsPolicy - 'ClusterFirstWithHostNet', so now there are: - ClusterFirstWithHostNet - use dns settings in all cases, i.e. with hostNet=true as well - ClusterFirst - use dns settings unless hostNetwork is true - Default Fixes #17406	2017-03-01 17:10:00 +00:00
Kubernetes Submit Queue	ed479163fa	Merge pull request #42116 from vishh/gpu-experimental-support Automatic merge from submit-queue Extend experimental support to multiple Nvidia GPUs Extended from #28216 ```release-note `--experimental-nvidia-gpus` flag is replaced by `Accelerators` alpha feature gate along with support for multiple Nvidia GPUs. To use GPUs, pass `Accelerators=true` as part of `--feature-gates` flag. Works only with Docker runtime. ``` 1. Automated testing for this PR is not possible since creation of clusters with GPUs isn't supported yet in GCP. 1. To test this PR locally, use the node e2e. ```shell TEST_ARGS='--feature-gates=DynamicKubeletConfig=true' FOCUS=GPU SKIP="" make test-e2e-node ``` TODO: - [x] Run manual tests - [x] Add node e2e - [x] Add unit tests for GPU manager (< 100% coverage) - [ ] Add unit tests in kubelet package	2017-03-01 04:52:50 -08:00
Vishnu kannan	2554b95994	Map nvidia devices one to one. Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-02-28 13:42:08 -08:00
Vishnu kannan	69acb02394	use feature gate instead of flag to control support for GPUs Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-02-28 13:42:07 -08:00
Vishnu kannan	3b0a408e3b	improve gpu integration Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-02-28 11:27:53 -08:00
Hui-Zhi	57c77ffbdd	Add support for multiple nvidia gpus	2017-02-28 11:24:48 -08:00
Seth Jennings	b9adb66426	kubelet: cm: refactor QoS logic into seperate interface	2017-02-28 09:19:29 -06:00
Vishnu Kannan	cc5f5474d5	add support for node allocatable phase 2 to kubelet Signed-off-by: Vishnu Kannan <vishnuk@google.com>	2017-02-27 21:24:44 -08:00
Kubernetes Submit Queue	16f87fe7d8	Merge pull request #40952 from dashpole/premption Automatic merge from submit-queue (batch tested with PRs 41994, 41969, 41997, 40952, 40576) Guaranteed admission for Critical Pods This is the first step in implementing node-level preemption for critical pods. It defines the AdmissionFailureHandler interface, which allows callers, like the kubelet, to define how failed predicates are handled, and take steps to correct failures if necessary. In the kubelet's implementation, it triggers preemption if the pod being admitted is critical, and if the only failed predicates are InsufficientResourceErrors, then it prempts (not yet implemented) other other pods to allow admission of the critical pod. cc: @vishh	2017-02-26 12:57:59 -08:00
Kubernetes Submit Queue	067f92e789	Merge pull request #41801 from riverzhang/patch-1 Automatic merge from submit-queue (batch tested with PRs 41854, 41801, 40088, 41590, 41911) Fix some typos Release note: ```release-note ```	2017-02-25 05:02:53 -08:00
David Ashpole	c58970e47c	critical pods can preempt other pods to be admitted	2017-02-23 10:31:20 -08:00
Andy Goldstein	9d8d6ad16c	Switch scheduler to use generated listers/informers Where possible, switch the scheduler to use generated listers and informers. There are still some places where it probably makes more sense to use one-off reflectors/informers (listing/watching just a single node, listing/watching scheduled & unscheduled pods using a field selector).	2017-02-23 09:57:12 -05:00
riverzhang	5156b7f8cf	Fix some typos	2017-02-21 07:15:40 -06:00
Kubernetes Submit Queue	05c05de798	Merge pull request #41569 from yujuhong/add_healthcheck Automatic merge from submit-queue (batch tested with PRs 38101, 41431, 39606, 41569, 41509) Report node not ready on failed PLEG health check Report node not ready if PLEG health check fails.	2017-02-16 15:49:18 -08:00
Kubernetes Submit Queue	6376ad134d	Merge pull request #39606 from NickrenREN/kubelet-pod Automatic merge from submit-queue (batch tested with PRs 38101, 41431, 39606, 41569, 41509) optimize killPod() and syncPod() functions make sure that one of the two arguments must be non-nil: runningPod, status ,just like the function note says and judge the return value in syncPod() function before setting podKilled	2017-02-16 15:49:17 -08:00
Kubernetes Submit Queue	3c606cdd20	Merge pull request #41456 from dashpole/pod_volume_cleanup Automatic merge from submit-queue (batch tested with PRs 41466, 41456, 41550, 41238, 41416) Delay Deletion of a Pod until volumes are cleaned up #41436 fixed the bug that caused #41095 and #40239 to have to be reverted. Now that the bug is fixed, this shouldn't cause problems. @vishh @derekwaynecarr @sjenning @jingxu97 @kubernetes/sig-storage-misc	2017-02-16 10:14:05 -08:00
Yu-Ju Hong	5bb43a3a24	Report node not ready on failed PLEG health check	2017-02-16 09:00:22 -08:00
NickrenREN	b40e575076	optimize killPod() and syncPod() functions make sure that one of the two arguments must be non-nil: runningPod, status ,just like the function note says and judge the return value in syncPod() function before setting podKilled	2017-02-16 09:13:23 +08:00
Kubernetes Submit Queue	3bc575c91f	Merge pull request #33550 from rtreffer/kubelet-allow-multiple-dns-server Automatic merge from submit-queue Allow multipe DNS servers as comma-seperated argument for kubelet --dns This PR explores how kubectls "--dns" could be extended to specify multiple DNS servers for in-cluster PODs. Testing on the local libvirt-coreos cluster shows that multiple DNS server are injected without issues. Specifying multiple DNS servers increases resilience against - Packet drops - Single server failure I am debugging services that do 50+ DNS requests for a single incoming interactive request, thus highly increase the chance of a slowdown (+5s) due to a single packet drop. Switching to two DNS servers will reduce the impact of the issues (roughly +1s on glibc, 0s on musl, error-rate goes down to error-rate^2). Note that there is no need to change any runtime related code as far as I know. In the case of "default" dns the /etc/resolv.conf is parsed and multiple DNS server are send to the backend anyway. This only adds the same capability for the clusterFirst case. I've heard from @thockin that multiple DNS entries are somehow considered. I've no idea what was considered, though. This is what I would like to see for our production use, though. ```release-note NONE ```	2017-02-15 12:45:32 -08:00
David Ashpole	1d38818326	Revert "Merge pull request #41202 from dashpole/revert-41095-deletion_pod_lifecycle" This reverts commit `ff87d13b2c`, reversing changes made to `46becf2c81`.	2017-02-15 08:44:03 -08:00
Kubernetes Submit Queue	dd696683b7	Merge pull request #40647 from NickrenREN/secretManager Automatic merge from submit-queue (batch tested with PRs 41360, 41423, 41430, 40647, 41352) optimize NewSimpleSecretManager and cleanupOrphanedPodCgroups	2017-02-15 05:06:11 -08:00
Yu-Ju Hong	fb94f441ce	Set EnableCRI to true by default This change makes kubelet to use the CRI implementation by default, unless the users opt out explicitly by using --enable-cri=false. For the rkt integration, the --enable-cri flag will have no effect since rktnetes does not use CRI. Also, mark the original --experimental-cri flag hidden and deprecated, so that we can remove it in the next release.	2017-02-14 16:15:51 -08:00
NickrenREN	31bfefca3c	optimize NewSimpleSecretManager and cleanupOrphanedPodCgroups remove NewSimpleSecretManager second return value and cleanupOrphanedPodCgroups's return since they will never return err	2017-02-14 09:47:05 +08:00
Kubernetes Submit Queue	e9de1b0221	Merge pull request #40992 from k82cn/rm_empty_line Automatic merge from submit-queue (batch tested with PRs 41236, 40992) Removed unnecessarly empty line.	2017-02-10 05:38:42 -08:00
Kubernetes Submit Queue	8188c3cca4	Merge pull request #40796 from wojtek-t/use_node_ttl_in_secret_manager Automatic merge from submit-queue (batch tested with PRs 40796, 40878, 36033, 40838, 41210) Implement TTL controller and use the ttl annotation attached to node in secret manager For every secret attached to a pod as volume, Kubelet is trying to refresh it every sync period. Currently Kubelet has a ttl-cache of secrets of its pods and the ttl is set to 1 minute. That means that in large clusters we are targetting (5k nodes, 30pods/node), given that each pod has a secret associated with ServiceAccount from its namespaces, and with large enough number of namespaces (where on each node (almost) every pod is from a different namespace), that resource in ~30 GETs to refresh all secrets every minute from one node, which gives ~2500QPS for GET secrets to apiserver. Apiserver cannot keep up with it very easily. Desired solution would be to watch for secret changes, but because of security we don't want a node watching for all secrets, and it is not possible for now to watch only for secrets attached to pods from my node. So as a temporary solution, we are introducing an annotation that would be a suggestion for kubelet for the TTL of secrets in the cache and a very simple controller that would be setting this annotation based on the cluster size (the large cluster is, the bigger ttl is). That workaround mean that only very local changes are needed in Kubelet, we are creating a well separated very simple controller, and once watching "my secrets" will be possible it will be easy to remove it and switch to that. And it will allow us to reach scalability goals. @dchen1107 @thockin @liggitt	2017-02-10 00:04:44 -08:00
David Ashpole	b224f83c37	Revert "[Kubelet] Delay deletion of pod from the API server until volumes are deleted"	2017-02-09 08:45:18 -08:00
Wojciech Tyczynski	6c0535a939	Use secret TTL annotation in secret manager	2017-02-09 13:53:32 +01:00
Kubernetes Submit Queue	42d8d4ca88	Merge pull request #40948 from freehan/cri-hostport Automatic merge from submit-queue (batch tested with PRs 40873, 40948, 39580, 41065, 40815) [CRI] Enable Hostport Feature for Dockershim Commits: 1. Refactor common hostport util logics and add more tests 2. Add HostportManager which can ADD/DEL hostports instead of a complete sync. 3. Add Interface for retreiving portMappings information of a pod in Network Host interface. Implement GetPodPortMappings interface in dockerService. 4. Teach kubenet to use HostportManager	2017-02-08 14:14:43 -08:00
Minhan Xia	bd05e1af2b	add portmapping getter into network host	2017-02-08 09:35:04 -08:00
David Ashpole	67cb2704c5	delete volumes before pod deletion	2017-02-08 07:34:49 -08:00
Kubernetes Submit Queue	843e6d1cc3	Merge pull request #40770 from apilloud/clientset_interface Automatic merge from submit-queue (batch tested with PRs 41103, 41042, 41097, 40946, 40770) Use Clientset interface in KubeletDeps What this PR does / why we need it: This replaces the Clientset struct with the equivalent interface for the KubeClient injected via KubeletDeps. This is useful for testing and for accessing the Node and Pod status event stream without an API server. Special notes for your reviewer: Follow up to #4907 Release note: `NONE`	2017-02-07 22:12:39 -08:00
Klaus Ma	cc26fe6ee9	Removed unnecessarly empty line.	2017-02-06 11:10:34 +08:00
Kubernetes Submit Queue	a777a8e3ba	Merge pull request #39972 from derekwaynecarr/pod-cgroups-default Automatic merge from submit-queue (batch tested with PRs 40289, 40877, 40879, 39972, 40942) Rename experimental-cgroups-per-pod flag What this PR does / why we need it: 1. Rename `experimental-cgroups-per-qos` to `cgroups-per-qos` 1. Update hack/local-up-cluster to match `CGROUP_DRIVER` with docker runtime if used. Special notes for your reviewer: We plan to roll this feature out in the upcoming release. Previous node e2e runs were running with this feature on by default. We will default this feature on for all e2es next week. Release note: ```release-note Rename --experiemental-cgroups-per-qos to --cgroups-per-qos ```	2017-02-04 04:43:08 -08:00
Kubernetes Submit Queue	f20b4fc67f	Merge pull request #40655 from vishh/flag-gate-critical-pod-annotation Automatic merge from submit-queue Optionally avoid evicting critical pods in kubelet For #40573 ```release-note When feature gate "ExperimentalCriticalPodAnnotation" is set, Kubelet will avoid evicting pods in "kube-system" namespace that contains a special annotation - `scheduler.alpha.kubernetes.io/critical-pod` This feature should be used in conjunction with the rescheduler to guarantee availability for critical system pods - https://kubernetes.io/docs/admin/rescheduler/ ```	2017-02-03 16:22:26 -08:00
Derek Carr	04a909a257	Rename cgroups-per-qos flag to not be experimental	2017-02-03 17:10:53 -05:00
Andrew Pilloud	3f8505022c	Use clientset.Interface for KubeClient	2017-02-03 07:36:16 -08:00
Vishnu Kannan	6ddb528446	Revert "Sort critical pods before admission" This reverts commit `b7409e0038`.	2017-02-02 10:41:24 -08:00
Wojciech Tyczynski	ec6a95a665	Use caching secret manager in kubelet	2017-02-02 15:32:07 +01:00
Rene Treffer	42ff859c27	Allow multipe DNS servers as comma-seperated argument for --dns Depending on an exact cluster setup multiple dns may make sense. Comma-seperated lists of DNS server are quite common as DNS servers are always plain IPs.	2017-02-01 22:38:40 +01:00
Michael Fraenkel	beb53fb71a	Port forward over websockets - split out port forwarding into its own package Allow multiple port forwarding ports - Make it easy to determine which port is tied to which channel - odd channels are for data - even channels are for errors - allow comma separated ports to specify multiple ports Add portfowardtester 1.2 to whitelist	2017-02-01 06:32:04 -07:00
deads2k	a106d9f848	switch kubelet to use external (client-go) object references for events	2017-01-31 19:15:33 -05:00
deads2k	8a12000402	move client/record	2017-01-31 19:14:13 -05:00
Dr. Stefan Schimanski	bc6fdd925d	pkg/api/resource: move to apimachinery	2017-01-29 21:41:44 +01:00
Aleksandra Malinowska	74e1d8078e	Revert "Delay deletion of pod from the API server until volumes are deleted"	2017-01-27 13:31:02 +01:00
Yu-Ju Hong	202488995a	docker-CRI: Remove legacy code for non-grpc integration	2017-01-26 17:23:20 -08:00
David Ashpole	9094b57570	cleanup volumes before deleting from the api server	2017-01-25 10:21:15 -08:00
deads2k	b0b156b381	make tools/cache authoritative	2017-01-25 08:29:45 -05:00
deads2k	c2ae6d5b40	remove api to util dependency hiding types	2017-01-25 08:28:28 -05:00
Dr. Stefan Schimanski	82826ec273	pkg/util/flag: move to k8s.io/apiserver	2017-01-24 20:56:03 +01:00
Dr. Stefan Schimanski	a6b2ebb50c	pkg/flag: make feature gate extensible and split between generic and kube	2017-01-24 20:56:03 +01:00
Dr. Stefan Schimanski	56d60cfae6	pkg/util: move flags from pkg/util/config to pkg/util/flags	2017-01-24 20:56:03 +01:00
deads2k	5a8f075197	move authoritative client-go utils out of pkg	2017-01-24 08:59:18 -05:00
Clayton Coleman	469df12038	refactor: move ListOptions references to metav1	2017-01-23 17:52:46 -05:00
Wojciech Tyczynski	bf7138652f	SecretVolume using secret manager	2017-01-23 16:10:01 +01:00
Kubernetes Submit Queue	470e732d7f	Merge pull request #40235 from deads2k/generic-26-listers Automatic merge from submit-queue (batch tested with PRs 40232, 40235, 40237, 40240) move listers out of cache to reduce import tree Moving the listers from `pkg/client/cache` snips links to all the different API groups from `pkg/storage`, but the dreaded `ListOptions` remains. @sttts	2017-01-20 14:22:51 -08:00
Kubernetes Submit Queue	dcf14add92	Merge pull request #37228 from sjenning/teardown-terminated-volumes Automatic merge from submit-queue (batch tested with PRs 37228, 40146, 40075, 38789, 40189) kubelet: storage: teardown terminated pod volumes This is a continuation of the work done in https://github.com/kubernetes/kubernetes/pull/36779 There really is no reason to keep volumes for terminated pods attached on the node. This PR extends the removal of volumes on the node from memory-backed (the current policy) to all volumes. @pmorie raised a concern an impact debugging volume related issues if terminated pod volumes are removed. To address this issue, the PR adds a `--keep-terminated-pod-volumes` flag the kubelet and sets it for `hack/local-up-cluster.sh`. For consideration in 1.6. Fixes #35406 @derekwaynecarr @vishh @dashpole ```release-note kubelet tears down pod volumes on pod termination rather than pod deletion ```	2017-01-20 12:34:52 -08:00
deads2k	1ce0637b27	move listers out of cache to reduce import tree	2017-01-20 15:01:38 -05:00
Seth Jennings	e2750a305a	reclaim terminated pod volumes	2017-01-20 11:08:35 -06:00
Kubernetes Submit Queue	53b43d6f8f	Merge pull request #40190 from yujuhong/nsenter_exec Automatic merge from submit-queue (batch tested with PRs 40168, 40165, 39158, 39966, 40190) dockershim: add support for the 'nsenter' exec handler This change simply plumbs the kubelet configuration (--docker-exec-handler) to DockerService. This fixes #35747.	2017-01-20 08:28:53 -08:00
Yu-Ju Hong	f9479ed84b	dockershim: add support for the 'nsenter' exec handler This change simply plumbs the kubelet configuration (--docker-exec-handler) to DockerService.	2017-01-19 16:23:48 -08:00
Wojciech Tyczynski	09e4de385c	Enable nontrivial secret manager	2017-01-19 19:47:33 +01:00

... 3 4 5 6 7 ...

1961 Commits