kubernetes

Author	SHA1	Message	Date
Kubernetes Submit Queue	6de28fab7d	Merge pull request #42942 from vishh/gpu-cont-fix Automatic merge from submit-queue (batch tested with PRs 42942, 42935) [Bug] Handle container restarts and avoid using runtime pod cache while allocating GPUs Fixes #42412 Background Support for multiple GPUs is an experimental feature in v1.6. Container restarts were handled incorrectly which resulted in stranding of GPUs Kubelet is incorrectly using runtime cache to track running pods which can result in race conditions (as it did in other parts of kubelet). This can result in same GPU being assigned to multiple pods. What does this PR do This PR tracks assignment of GPUs to containers and returns pre-allocated GPUs instead of (incorrectly) allocating new GPUs. GPU manager is updated to consume a list of active pods derived from apiserver cache instead of runtime cache. Node e2e has been extended to validate this failure scenario. Risk Minimal/None since support for GPUs is an experimental feature that is turned off by default. The code is also isolated to GPU manager in kubelet. Workarounds In the absence of this PR, users can mitigate the original issue by setting `RestartPolicyNever` in their pods. There is no workaround for the race condition caused by using the runtime cache though. Hence it is worth including this fix in v1.6.0. cc @jianzhangbjz @seelam @kubernetes/sig-node-pr-reviews Replaces #42560	2017-03-14 10:19:17 -07:00
Kubernetes Submit Queue	f1e9004da9	Merge pull request #42927 from Random-Liu/fix-kubelet-panic Automatic merge from submit-queue (batch tested with PRs 42802, 42927, 42669, 42988, 43012) Fix kubelet panic in cgroup manager. Fixes https://github.com/kubernetes/kubernetes/issues/42920. Fixes https://github.com/kubernetes/kubernetes/issues/42875 Fixes #42927 Fixes #43059 Check the error in walk function, so that we don't use info when there is an error. @yujuhong @dchen1107 @derekwaynecarr @vishh /cc @kubernetes/sig-node-bugs	2017-03-14 07:31:31 -07:00
Random-Liu	e6341cc3c7	Fix kubelet panic in cgroup manager.	2017-03-13 12:06:08 -07:00
Vishnu kannan	ad743a922a	remove dead code in gpu manager Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-03-13 10:58:26 -07:00
Vishnu kannan	ff158090b3	use active pods instead of runtime pods in gpu manager Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-03-13 10:58:26 -07:00
Vishnu Kannan	8ed9bff073	handle container restarts for GPUs Signed-off-by: Vishnu Kannan <vishnuk@google.com>	2017-03-13 10:58:26 -07:00
Kubernetes Submit Queue	59aa924a9b	Merge pull request #42642 from fraenkel/envfrom Automatic merge from submit-queue Invalid environment var names are reported and pod starts When processing EnvFrom items, all invalid keys are collected and reported as a single event. The Pod is allowed to start. fixes #42583	2017-03-10 17:37:31 -08:00
Kubernetes Submit Queue	d790851c8f	Merge pull request #42694 from dchen1107/master Automatic merge from submit-queue (batch tested with PRs 42734, 42745, 42758, 42814, 42694) Dropped docker 1.9.x support. Changed the minimumDockerAPIVersion to 1.22 cc/ @Random-Liu @yujuhong We talked about dropping docker 1.9.x support for a while. I just realized that we haven't really done it yet. ```release-note Dropped the support for docker 1.9.x and the belows. ```	2017-03-09 15:07:00 -08:00
Dawn Chen	69eaea2fcc	Merge pull request #42779 from dashpole/fix_status [Bug Fix] Allow Status Updates for Pods that can be deleted	2017-03-09 13:23:00 -08:00
David Ashpole	e3e0bc6ce0	do not skip pods that can be deleted	2017-03-09 09:35:50 -08:00
Kubernetes Submit Queue	9cfc4f1a10	Merge pull request #42739 from yujuhong/created_time Automatic merge from submit-queue (batch tested with PRs 42762, 42739, 42425, 42778) FakeDockerClient: add creation timestamp This fixes #42736	2017-03-09 02:51:38 -08:00
Kubernetes Submit Queue	4cf553f78e	Merge pull request #42767 from Random-Liu/cleanup-infra-container-on-error Automatic merge from submit-queue (batch tested with PRs 42768, 42760, 42771, 42767) Stop sandbox container when hit network error. Fixes https://github.com/kubernetes/kubernetes/issues/42698. This PR stops the sandbox container when hitting a network error. This PR also adds a unit test for it. I'm not sure whether we should try teardown pod network after `SetUpPod` failure. We don't do that in dockertools https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/dockertools/docker_manager.go#L2276. @yujuhong @freehan	2017-03-09 00:08:01 -08:00
Michael Fraenkel	c4d07466e8	Invalid environment var names are reported and pod starts When processing EnvFrom items, all invalid keys are collected and reported as a single event. The Pod is allowed to start.	2017-03-09 07:21:53 +00:00
Kubernetes Submit Queue	6fac75c80a	Merge pull request #42768 from yujuhong/fix_sandbox_listing Automatic merge from submit-queue dockershim: Fix the race condition in ListPodSandbox In ListPodSandbox(), we 1. List all sandbox docker containers 2. List all sandbox checkpoints. If the checkpoint does not have a corresponding container in (1), we return partial result based on the checkpoint. The problem is that new PodSandboxes can be created between step (1) and (2). In those cases, we will see the checkpoints, but not the sandbox containers. This leads to strange behavior because the partial result from the checkpoint does not include some critical information. For example, the creation timestamp'd be zero, and that would cause kubelet's garbage collector to immediately remove the sandbox. This change fixes that by getting the list of checkpoints before listing all the containers (since in RunPodSandbox we create them in the reverse order).	2017-03-08 21:33:31 -08:00
Kubernetes Submit Queue	ec46846a25	Merge pull request #38691 from xiangpengzhao/fix-empty-logpath Automatic merge from submit-queue (batch tested with PRs 42211, 38691, 42737, 42757, 42754) Only create the symlink when container log path exists When using `syslog` logging driver instead of `json-file`, there will not be container log files such as `<containerID-json.log>`. We should not create symlink in this case.	2017-03-08 18:52:26 -08:00
Random-Liu	2690461cbb	Stop sandbox container when hit network error.	2017-03-08 17:28:42 -08:00
Yu-Ju Hong	38d8da1215	FakeDockerClient: add creation timestamp This is necessary for kubemark to work correctly.	2017-03-08 17:11:16 -08:00
Yu-Ju Hong	8328a66bdf	dockershim: Fix the race condition in ListPodSandbox In ListPodSandbox(), we 1. List all sandbox docker containers 2. List all sandbox checkpoints. If the checkpoint does not have a corresponding container in (1), we return partial result based on the checkpoint. The problem is that new PodSandboxes can be created between step (1) and (2). In those cases, we will see the checkpoints, but not the sandbox containers. This leads to strange behavior because the partial result from the checkpoint does not include some critical information. For example, the creation timestamp'd be zero, and that would cause kubelet's garbage collector to immediately remove the sandbox. This change fixes that by getting the list of checkpoints before listing all the containers (since in RunPodSandbox we create them in the reverse order).	2017-03-08 17:02:34 -08:00
Yu-Ju Hong	1095652cb8	Add more logs to help debugging	2017-03-08 12:27:49 -08:00
xiangpengzhao	7fed242d55	Only create the symlink when container log path exists	2017-03-08 01:36:48 -05:00
Kubernetes Submit Queue	5bc7387b3c	Merge pull request #42169 from ncdc/pprof-trace Automatic merge from submit-queue (batch tested with PRs 42692, 42169, 42173) Add pprof trace support Add support for `/debug/pprof/trace` Can wait for master to reopen for 1.7. cc @smarterclayton @wojtek-t @gmarek @timothysc @jeremyeder @kubernetes/sig-scalability-pr-reviews	2017-03-07 20:10:26 -08:00
Dawn Chen	ab790b6a3a	Dropped docker 1.9.x support. Changed the minimumDockerAPIVersion to 1.22	2017-03-07 17:07:07 -08:00
Kubernetes Submit Queue	1ed3aa6750	Merge pull request #42264 from yujuhong/kubemark_cri Automatic merge from submit-queue kubemark: enable CRI for the hollow nodes This fixes #41488	2017-03-07 13:04:39 -08:00
Yu-Ju Hong	a0f90e1490	Use FakeDockerPuller to bypass auth/keyring logic in tests	2017-03-07 10:11:49 -08:00
Yu-Ju Hong	516848c37d	Various fixes for the fake docker client * Properly return ImageNotFoundError * Support inject "Images" or "ImageInspects" and keep both in sync. * Remove the FakeDockerPuller and let FakeDockerClient subsumes its functinality. This reduces the overhead to maintain both objects. * Various small fixes and refactoring of the testing utils.	2017-03-07 10:11:49 -08:00
Kubernetes Submit Queue	5cc6a4e269	Merge pull request #42609 from intelsdi-x/test-out-of-oir Automatic merge from submit-queue (batch tested with PRs 41890, 42593, 42633, 42626, 42609) Pods pending due to insufficient OIR should get scheduled once sufficient OIR becomes available (e2e disabled). #41870 was reverted because it introduced an e2e test flake. This is the same code with the e2e for OIR disabled again. We can attempt to enable the e2e test cases one-by-one in follow-up PRs, but it would be preferable to get the main fix merged in time for 1.6 since OIR is broken on master (see #41861). cc @timothysc	2017-03-07 08:10:46 -08:00
Andy Goldstein	b011529d8a	Add pprof trace support Add pprof trace support and --enable-contention-profiling to those components that don't already have it.	2017-03-07 10:10:42 -05:00
Kubernetes Submit Queue	a1c5d1b80f	Merge pull request #42585 from derekwaynecarr/cgroup-flake Automatic merge from submit-queue (batch tested with PRs 42506, 42585, 42596, 42584) provide active pods to cgroup cleanup What this PR does / why we need it: This PR provides more information for when a pod cgroup is considered orphaned. The running pods cache is based on the runtime's view of the world. we create pod cgroups before containers so we should just be looking at activePods. Which issue this PR fixes Fixes https://github.com/kubernetes/kubernetes/issues/42431	2017-03-06 22:20:11 -08:00
Kubernetes Submit Queue	31db570a00	Merge pull request #42497 from derekwaynecarr/lower_cgroup_names Automatic merge from submit-queue cgroup names created by kubelet should be lowercased What this PR does / why we need it: This PR modifies the kubelet to create cgroupfs names that are lowercased. This better aligns us with the naming convention for cgroups v2 and other cgroup managers in ecosystem (docker, systemd, etc.) See: https://www.kernel.org/doc/Documentation/cgroup-v2.txt "2-6-2. Avoid Name Collisions" Special notes for your reviewer: none Release note: ```release-note kubelet created cgroups follow lowercase naming conventions ```	2017-03-06 20:43:03 -08:00
Connor Doyle	364dbc0ca5	Revert "Revert "Pods pending due to insufficient OIR should get scheduled once sufficient OIR becomes available."" - This reverts commit `60758f3fff`. - Disabled opaque integer resource end-to-end tests.	2017-03-06 17:48:09 -08:00
Derek Carr	5ce298c9aa	provide active pods to cgroup cleanup	2017-03-06 17:37:26 -05:00
Dawn Chen	60758f3fff	Revert "Pods pending due to insufficient OIR should get scheduled once sufficient OIR becomes available."	2017-03-06 14:27:17 -08:00
Kubernetes Submit Queue	0fad9ce5e2	Merge pull request #41870 from intelsdi-x/test-out-of-oir Automatic merge from submit-queue (batch tested with PRs 31783, 41988, 42535, 42572, 41870) Pods pending due to insufficient OIR should get scheduled once sufficient OIR becomes available. This appears to be a regression since v1.5.0 in scheduler behavior for opaque integer resources, reported in https://github.com/kubernetes/kubernetes/issues/41861. - [X] Add failing e2e test to trigger the regression - [x] Restore previous behavior (pods pending due to insufficient OIR get scheduled once sufficient OIR becomes available.)	2017-03-06 11:30:24 -08:00
Derek Carr	48d822eafe	cgroup names created by kubelet should be lowercased	2017-03-06 11:19:21 -05:00
Seth Jennings	ccd87fca3f	kubelet: add cgroup manager metrics	2017-03-06 08:53:47 -06:00
Kubernetes Submit Queue	4bbf98850f	Merge pull request #42500 from vishh/fix-gpu-init Automatic merge from submit-queue [Bug] Fix gpu initialization in Kubelet Kubelet incorrectly fails if `AllAlpha=true` feature gate is enabled with container runtimes that are not `docker`. Replaces #42407	2017-03-04 20:28:08 -08:00
Connor Doyle	8a42189690	Fix unbounded growth of cached OIRs in sched cache - Added schedulercache.Resource.SetOpaque helper. - Amend kubelet allocatable sync so that when OIRs are removed from capacity they are also removed from allocatable. - Fixes #41861.	2017-03-04 09:26:22 -08:00
Kubernetes Submit Queue	f9ccee7714	Merge pull request #42435 from dashpole/timestamps_for_fsstats Automatic merge from submit-queue (batch tested with PRs 42369, 42375, 42397, 42435, 42455) [Bug Fix]: Avoid evicting more pods than necessary by adding Timestamps for fsstats and ignoring stale stats Continuation of #33121. Credit for most of this goes to @sjenning. I added volume fs timestamps. why is this a bug This PR attempts to fix part of https://github.com/kubernetes/kubernetes/issues/31362 which results in multiple pods getting evicted unnecessarily whenever the node runs into resource pressure. This PR reduces the chances of such disruptions by avoiding reacting to old/stale metrics. Without this PR, kubernetes nodes under resource pressure will cause unnecessary disruptions to user workloads. This PR will also help deflake a node e2e test suite. The eviction manager currently avoids evicting pods if metrics are old. However, timestamp data is not available for filesystem data, and this causes lots of extra evictions. See the [inode eviction test flakes](https://k8s-testgrid.appspot.com/google-node#kubelet-flaky-gce-e2e) for examples. This should probably be treated as a bugfix, as it should help mitigate extra evictions. cc: @kubernetes/sig-storage-pr-reviews @kubernetes/sig-node-pr-reviews @vishh @derekwaynecarr @sjenning	2017-03-03 23:21:48 -08:00
Kubernetes Submit Queue	51a3d7b663	Merge pull request #42397 from feiskyer/fix-42396 Automatic merge from submit-queue (batch tested with PRs 42369, 42375, 42397, 42435, 42455) Kubelet: return container runtime's version instead of CRI's one What this PR does / why we need it: With CRI enabled by default, kubelet reports the version of CRI instead of container runtime version. This PR fixes this problem. Which issue this PR fixes Fixes #42396. Special notes for your reviewer: Should also cherry-pick to 1.6 branch. Release note: ```release-note NONE ``` cc @yujuhong @kubernetes/sig-node-bugs	2017-03-03 23:21:46 -08:00
Kubernetes Submit Queue	2d319bd406	Merge pull request #42204 from dashpole/allocatable_eviction Automatic merge from submit-queue Eviction Manager Enforces Allocatable Thresholds This PR modifies the eviction manager to enforce node allocatable thresholds for memory as described in kubernetes/community#348. This PR should be merged after #41234. cc @kubernetes/sig-node-pr-reviews @kubernetes/sig-node-feature-requests @vishh Why is this a bug/regression Kubelet uses `oom_score_adj` to enforce QoS policies. But the `oom_score_adj` is based on overall memory requested, which means that a Burstable pod that requested a lot of memory can lead to OOM kills for Guaranteed pods, which violates QoS. Even worse, we have observed system daemons like kubelet or kube-proxy being killed by the OOM killer. Without this PR, v1.6 will have node stability issues and regressions in an existing GA feature `out of Resource` handling.	2017-03-03 20:20:12 -08:00
Kubernetes Submit Queue	9cc5480918	Merge pull request #41149 from sjenning/qos-memory-limits Automatic merge from submit-queue (batch tested with PRs 41919, 41149, 42350, 42351, 42285) kubelet: enable qos-level memory limits ```release-note Experimental support to reserve a pod's memory request from being utilized by pods in lower QoS tiers. ``` Enables the QoS-level memory cgroup limits described in https://github.com/kubernetes/community/pull/314 Note: QoS level cgroups have to be enabled for any of this to take effect. Adds a new `--experimental-qos-reserved` flag that can be used to set the percentage of a resource to be reserved at the QoS level for pod resource requests. For example, `--experimental-qos-reserved="memory=50%`, means that if a Guaranteed pod sets a memory request of 2Gi, the Burstable and BestEffort QoS memory cgroups will have their `memory.limit_in_bytes` set to `NodeAllocatable - (2Gi*50%)` to reserve 50% of the guaranteed pod's request from being used by the lower QoS tiers. If a Burstable pod sets a request, its reserve will be deducted from the BestEffort memory limit. The result is that: - Guaranteed limit matches root cgroup at is not set by this code - Burstable limit is `NodeAllocatable - Guaranteed reserve` - BestEffort limit is `NodeAllocatable - Guaranteed reserve - Burstable reserve` The only resource currently supported is `memory`; however, the code is generic enough that other resources can be added in the future. @derekwaynecarr @vishh	2017-03-03 16:44:39 -08:00
Vishnu kannan	038585626d	fix gpu initialization Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-03-03 12:13:01 -08:00
David Ashpole	a90c7951d4	add volume timestamps	2017-03-02 15:01:59 -08:00
Seth Jennings	cc50aa9dfb	kubelet: enable qos-level memory request reservation	2017-03-02 15:04:13 -06:00
Seth Jennings	c5faf1c156	kubelet: eviction: add timestamp to FsStats	2017-03-02 11:20:24 -08:00
David Ashpole	ac612eab8e	eviction manager changes for allocatable	2017-03-02 07:36:24 -08:00
Kubernetes Submit Queue	00c0c8332f	Merge pull request #42273 from smarterclayton/evaluate_probes Automatic merge from submit-queue (batch tested with PRs 41672, 42084, 42233, 42165, 42273) ExecProbes should be able to do simple env var substitution For containers that don't have bash, we should support env substitution like we do on command and args. However, without major refactoring valueFrom is not supportable from inside the prober. For now, implement substitution based on hardcoded env and leave TODOs for future work. Improves the state of #40846, will spawn a follow up issue for future refactoring after CRI settles down	2017-03-02 03:20:29 -08:00
Kubernetes Submit Queue	5ee6ba2f59	Merge pull request #42223 from Random-Liu/dockershim-better-implement-cri Automatic merge from submit-queue (batch tested with PRs 41980, 42192, 42223, 41822, 42048) CRI: Make dockershim better implements CRI. When thinking about CRI Validation test, I found that `PodSandboxStatus.Linux.Namespaces.Options.HostPid` and `PodSandboxStatus.Linux.Namespaces.Options.HostIpc` are not populated. Although they are not used by kuberuntime now, we should populate them to conform to CRI. /cc @yujuhong @feiskyer	2017-03-02 00:59:19 -08:00
Pengfei Ni	1986b78e0e	Version(): return runtime version instead of CRI	2017-03-02 14:42:37 +08:00
Kubernetes Submit Queue	fa0387c9fe	Merge pull request #42195 from Random-Liu/cri-support-non-json-logging Automatic merge from submit-queue (batch tested with PRs 41931, 39821, 41841, 42197, 42195) Use `docker logs` directly if the docker logging driver is not `json-file` Fixes https://github.com/kubernetes/kubernetes/issues/41996. Post the PR first, I still need to manually test this, because we don't have test coverage for journald logging pluggin. @yujuhong @dchen1107 /cc @kubernetes/sig-node-pr-reviews	2017-03-01 20:08:08 -08:00

1 2 3 4 5 ...

4426 Commits