kubernetes

Author	SHA1	Message	Date
Kubernetes Submit Queue	c1ebba0ae2	Merge pull request #38925 from xiangpengzhao/fix-volume-panic Automatic merge from submit-queue Fix nil pointer issue when making mounts for container When rebooting one of the nodes in my colleague's cluster, two panics were discovered: ``` E1216 04:07:00.193058 2394 runtime.go:52] Recovered from panic: "invalid memory address or nil pointer dereference" (runtime error: invalid memory address or nil pointer dereference) /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/util/runtime/runtime.go:58 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/util/runtime/runtime.go:51 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/util/runtime/runtime.go:41 /usr/local/go/src/runtime/asm_amd64.s:472 /usr/local/go/src/runtime/panic.go:443 /usr/local/go/src/runtime/panic.go:62 /usr/local/go/src/runtime/sigpanic_unix.go:24 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/kubelet.go:1313 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/kubelet.go:1473 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/dockertools/docker_manager.go:1495 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/dockertools/docker_manager.go:2125 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/dockertools/docker_manager.go:2093 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/kubelet.go:1971 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/kubelet.go:530 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/pod_workers.go:171 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/pod_workers.go:154 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/pod_workers.go:215 /usr/local/go/src/runtime/asm_amd64.s:1998 E1216 04:07:00.275030 2394 runtime.go:52] Recovered from panic: "invalid memory address or nil pointer dereference" (runtime error: invalid memory address or nil pointer dereference) /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/util/runtime/runtime.go:58 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/util/runtime/runtime.go:51 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/util/runtime/runtime.go:41 /usr/local/go/src/runtime/asm_amd64.s:472 /usr/local/go/src/runtime/panic.go:443 /usr/local/go/src/runtime/panic.go:62 /usr/local/go/src/runtime/sigpanic_unix.go:24 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/server/stats/volume_stat_caculator.go:98 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/kubelet/server/stats/volume_stat_caculator.go:63 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/util/wait/wait.go:86 /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/pkg/util/wait/wait.go:87 /usr/local/go/src/runtime/asm_amd64.s:1998 ``` kubectl version ``` Client Version: version.Info{Major:"1", Minor:"3", GitVersion:"v1.3.8", GitCommit:"693ef591120267007be359f97191a6253e0e4fb5", GitTreeState:"clean", BuildDate:"2016-09-28T03:03:21Z", GoVersion:"go1.6.2", Compiler:"gc", Platform:"linux/amd64"} Server Version: version.Info{Major:"1", Minor:"3", GitVersion:"v1.3.8", GitCommit:"693ef591120267007be359f97191a6253e0e4fb5", GitTreeState:"clean", BuildDate:"2016-09-28T02:52:25Z", GoVersion:"go1.6.2", Compiler:"gc", Platform:"linux/amd64"} ``` The second panic had already been fixed by #33616 and #34251. Not sure what caused the first nil pointer issue and whether it has been fixed yet in the master branch. Just fix it by ignoring the nil pointer when making mounts. cc @jingxu97 @yujuhong	2017-05-01 10:01:16 -07:00
Kubernetes Submit Queue	08606b530b	Merge pull request #45148 from rickypai/rpai/use_host_aliases Automatic merge from submit-queue (batch tested with PRs 45110, 45148) write HostAliases to hosts file What this PR does / why we need it: using the PodSpec's `HostAliases`, we write entries into the Kubernetes-managed hosts file. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes #43632 Special notes for your reviewer: Previous PRs in this series: - https://github.com/kubernetes/kubernetes/pull/44572 isolates the logic of creating the file and writing the file - https://github.com/kubernetes/kubernetes/pull/44641 introduces the `HostAliases` field in PodSpec along with validations Release note: ```release-note PodSpec's `HostAliases` now write entries into the Kubernetes-managed hosts file. ``` @thockin @yujuhong Thanks for reviewing!	2017-05-01 05:42:16 -07:00
Kubernetes Submit Queue	6480bc70b0	Merge pull request #45110 from smarterclayton/offset_timeouts Automatic merge from submit-queue (batch tested with PRs 45110, 45148) Make timeouts in the Kubelet slightly offset to aid debugging Several of these loops overlap, and when they are the reason a failure is happening it is difficult to sort them out. Slighly misalign these loops to make their impact obvious. We are seeing exactly 2 minute pod worker timeouts in a wide range of test flake scenarios, and I want to be confident we know exactly which one is the culprit.	2017-05-01 05:42:14 -07:00
Ricky Pai	407fe8b356	write HostAliases to hosts file	2017-04-29 11:31:24 -07:00
Kubernetes Submit Queue	e2042bb81b	Merge pull request #41583 from verb/sharedpid Automatic merge from submit-queue (batch tested with PRs 41583, 45117, 45123) Implement shared PID namespace in the dockershim What this PR does / why we need it: Defaults the Docker CRI to using a shared PID namespace for pods. Implements proposal in https://github.com/kubernetes/community/pull/207 tracked by #1615. //cc @dchen1107 @vishh @timstclair Special notes for your reviewer: none Release note: ```release-note Some container runtimes share a process (PID) namespace for all containers in a pod. This will become the default for Docker in a future release of Kubernetes. You can preview this functionality if running with the CRI and Docker 1.13.1 by enabling the --experimental-docker-enable-shared-pid kubelet flag. ```	2017-04-28 20:15:03 -07:00
Kubernetes Submit Queue	e06fc087e0	Merge pull request #44938 from jayunit100/cleanup-orphan-logging Automatic merge from submit-queue (batch tested with PRs 45033, 44961, 45021, 45097, 44938) Cleanup orphan logging that goes on in the sync loop. What this PR does / why we need it: Fixes #44937 Before this PR The older logs were like this: ``` E0426 00:06:33.763347 21247 kubelet_volumes.go:114] Orphaned pod "35c4a858-2a12-11e7-910c-42010af00003" found, but volume paths are still present on disk. E0426 00:06:33.763400 21247 kubelet_volumes.go:114] Orphaned pod "e7676365-1580-11e7-8c27-42010af00003" found, but volume paths are still present on disk. ``` The problem being that, all the volumes were spammed w/ no summary info. After this PR the logs look like this: ``` E0426 01:32:27.295568 22261 kubelet_volumes.go:129] Orphaned pod "408b060e-2a1d-11e7-90e8-42010af00003" found, but volume paths are still present on disk. : There were a total of 2 errors similar to this. Turn up verbosity to see them. E0426 01:32:29.295515 22261 kubelet_volumes.go:129] Orphaned pod "408b060e-2a1d-11e7-90e8-42010af00003" found, but volume paths are still present on disk. : There were a total of 2 errors similar to this. Turn up verbosity to see them. E0426 01:32:31.293180 22261 kubelet_volumes.go:129] Orphaned pod "408b060e-2a1d-11e7-90e8-42010af00003" found, but volume paths are still present on disk. : There were a total of 2 errors similar to this. Turn up verbosity to see them. ``` And with logging turned up, the extra info logs are shown with details: ``` E0426 01:34:21.933983 26010 kubelet_volumes.go:129] Orphaned pod "1c565800-2a20-11e7-bbc2-42010af00003" found, but volume paths are still present on disk. : There were a total of 3 errors similar to this. Turn up verbosity to see them. I0426 01:34:21.934010 26010 kubelet_volumes.go:131] Orphan pod: Orphaned pod "1c565800-2a20-11e7-bbc2-42010af00003" found, but volume paths are still present on disk. I0426 01:34:21.934015 26010 kubelet_volumes.go:131] Orphan pod: Orphaned pod "408b060e-2a1d-11e7-90e8-42010af00003" found, but volume paths are still present on disk. I0426 01:34:21.934019 26010 kubelet_volumes.go:131] Orphan pod: Orphaned pod "e7676365-1580-11e7-8c27-42010af00003" found, but volume paths are still present on disk. ``` Release note ```release-note Roll up volume error messages in the kubelet sync loop. ```	2017-04-28 13:16:47 -07:00
Clayton Coleman	49209b3394	Make timeouts in the Kubelet slightly offset to aid debugging Several of these loops overlap, and when they are the reason a failure is happening it is difficult to sort them out. Slighly misalign these loops to make their impact obvious.	2017-04-28 12:00:28 -04:00
xiangpengzhao	db97cba291	Fix nil pointer issue when making mounts for container	2017-04-28 11:41:39 +08:00
Kubernetes Submit Queue	acca01bcc2	Merge pull request #44939 from sjenning/adjust-logging Automatic merge from submit-queue don't HandleError on container start failure Failing to start containers is a common error case if there is something wrong with the container image or environment like missing mounts/configs/permissions/etc. Not only is it common; it is reoccurring as backoff happens and new attempts to start the container are made. `HandleError` it too verbose for this very common situation. Replace `HandleError` with `glog.V(3).Infof` xref https://github.com/openshift/origin/issues/13889 @smarterclayton @derekwaynecarr @eparis	2017-04-27 19:36:23 -07:00
Kubernetes Submit Queue	8efb5c9957	Merge pull request #44983 from caesarxuchao/easy-remove-client-go-api-scheme Automatic merge from submit-queue (batch tested with PRs 45052, 44983, 41254) Non-controversial part of #44523 For easier review of #44523, i extracted the non-controversial part out to this PR.	2017-04-27 17:14:04 -07:00
Lee Verberne	d22dd0fa35	Implement shared PID namespace in the dockershim	2017-04-27 23:43:53 +00:00
Kubernetes Submit Queue	8b9625d2ea	Merge pull request #41627 from gyliu513/kubelet-types Automatic merge from submit-queue (batch tested with PRs 42740, 44980, 45039, 41627, 45044) Improved code coverage for /pkg/kubelet/types What this PR does / why we need it: The test coverage for /pkg/kubelet/types was increased from 50% to 87.5% Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes # Special notes for your reviewer: Release note: ```release-note ```	2017-04-27 13:27:06 -07:00
Chao Xu	958903509c	bazel	2017-04-27 09:41:53 -07:00
Chao Xu	3fa7b7824a	easy changes	2017-04-27 09:41:53 -07:00
Kubernetes Submit Queue	c3df35df7b	Merge pull request #44970 from Random-Liu/fix-stop-container-timeout Automatic merge from submit-queue (batch tested with PRs 44970, 43618) CRI: Fix StopContainer timeout Fixes https://github.com/kubernetes/kubernetes/issues/44956. I verified this PR with the example provided in https://github.com/kubernetes/kubernetes/issues/44956, and now pod deletion will respect grace period timeout: ``` NAME READY STATUS RESTARTS AGE gracefully-terminating-pod 1/1 Terminating 0 6m ``` @dchen1107 @yujuhong @feiskyer /cc @kubernetes/sig-node-bugs	2017-04-26 22:58:11 -07:00
Seth Jennings	ffb9f5aa4c	don't HandleError on container start failure	2017-04-26 23:00:39 -05:00
David Ashpole	958e290c8d	still consider quantity reclaimed even when errors are returned	2017-04-26 17:40:30 -07:00
Random-Liu	cfd0efff11	Fix StopContainer timeout	2017-04-26 15:48:12 -07:00
Andy Goldstein	715d5d9c91	Add redirect support to SpdyRoundTripper Add support for following redirects to the SpdyRoundTripper. This is necessary for clients using it directly (e.g. the apiserver talking directly to the kubelet) because the CRI streaming server issues a redirect for streaming requests. Also extract common logic for following redirects.	2017-04-26 09:45:19 -04:00
jayunit100	b3c45247bc	Cleanup orphan logging that goes on in the sync loop.	2017-04-25 21:16:22 -04:00
Guangya Liu	593336bd9d	Improved code coverage for /pkg/kubelet/types	2017-04-25 06:25:21 +08:00
Ricky Pai	e21da839e5	extract content-generation concern from `ensureHostsFile` add tests to assert the output of `ensureHostsFile`	2017-04-24 12:33:45 -07:00
Kubernetes Submit Queue	f11f72ece8	Merge pull request #42486 from jcbsmpsn/certificate-manager-bootstrap Automatic merge from submit-queue Add bootstrap support to certificate manager. Adds configuration options to certificate manager for using bootstrap cert/key pairs to handle the scenario where new nodes are initialized using a generic cert/key pair. Bootstrap cert/key pairs are quickly rotated, independent of duration remaining, so that each kubelet has a unique cert/key pair.	2017-04-21 16:37:44 -07:00
Kubernetes Submit Queue	b19589df31	Merge pull request #44642 from supereagle/fix-comment-error Automatic merge from submit-queue (batch tested with PRs 42202, 40784, 44642, 44623, 44761) fix comment error for network plugin What this PR does / why we need it: Which issue this PR fixes : fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2017-04-21 11:52:07 -07:00
Jacob Simpson	e992eaec8f	Add bootstrap support to certificate manager.	2017-04-20 16:27:32 -07:00
supereagle	343f4baa5a	fix comment error for network plugin	2017-04-19 07:10:41 +08:00
Casey Callendrello	e4eaad3d24	kubelet/networking: add support for cni ConfigLists, pass hostport parameters reason for this change CNI has recently introduced a new configuration list feature. This allows for plugin chaining. It also supports varied plugin versions.	2017-04-18 14:23:57 +02:00
Kubernetes Submit Queue	e91bd12b99	Merge pull request #42033 from NickrenREN/dswp-findAndAddActivePods Automatic merge from submit-queue (batch tested with PRs 41849, 42033) fix TODO: find and add active pods for dswp loops through the list of active pods and ensures that each one exists in the desired state of the world cache Release note: ```release-note NONE ```	2017-04-18 01:26:58 -07:00
Kubernetes Submit Queue	c20e63bfb9	Merge pull request #42939 from k82cn/k8s_42701 Automatic merge from submit-queue Used ObjectReference for events. fixes #42701 ```release-note None ```	2017-04-17 23:26:06 -07:00
NickrenREN	5cafb9042b	find and add active pods for dswp loops through the list of active pods and ensures that each one exists in the desired state of the world cache	2017-04-18 11:21:37 +08:00
Kubernetes Submit Queue	2c774753e1	Merge pull request #44467 from JulienBalestra/fix-rkt-host-path-volume Automatic merge from submit-queue (batch tested with PRs 44469, 44566, 44467, 44526) Kubelet:rkt Fix the hostPath Volume creation What this PR does / why we need it: This PR fix the `hostPath` volume when the path exist and it's not a directory. At the moment, the creation of a `hostPath` volume for an existing file leads to this error: > kubelet[1984]: E0413 07:53:16.480922 1984 pod_workers.go:184] Error syncing pod 38359a57-1fb1-11e7-a484-76870fe7db83, skipping: failed to SyncPod: mkdir /usr/share/coreos/lsb-release: not a directory Special notes for your reviewer: You can have a look to the difference with this [gist](https://gist.github.com/JulienBalestra/28ae15efc8a1393d350300880c07ff4f)	2017-04-17 20:06:59 -07:00
Kubernetes Submit Queue	884124272a	Merge pull request #44469 from siggy/siggy/custom-metrics-comment Automatic merge from submit-queue comment spelling correction in custommetrics What this PR does / why we need it: fix spelling in a comment Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes # Special notes for your reviewer: Release note: ```release-note ```	2017-04-17 19:59:16 -07:00
Klaus Ma	6d29cfc0cc	Registered node before other initialization.	2017-04-18 10:43:56 +08:00
Kubernetes Submit Queue	a1684fea80	Merge pull request #42085 from cblecker/gofmt-fix Automatic merge from submit-queue (batch tested with PRs 40055, 42085, 44509, 44568, 43956) Fix gofmt errors What this PR does / why we need it: There were some gofmt errors on master. Ran the following to fix: ``` hack/verify-gofmt.sh \| grep ^diff \| awk '{ print $2 }' \| xargs gofmt -w -s ``` Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): none Special notes for your reviewer: Release note: ```release-note NONE ```	2017-04-17 15:39:07 -07:00
Kubernetes Submit Queue	73fb978181	Merge pull request #44398 from caesarxuchao/move-v1/refs-and-v1/resource Automatic merge from submit-queue (batch tested with PRs 44569, 44398) Move v1/refs and v1/resource This PR moves pkg/api/v1/ref.go and pkg/api/v1/resource_helper.go to their own sub packages, it's very similar to 44299 and 44302. The PR is mostly mechanical, except that * i moved some utility function from resource.go to pkg/api/v1/pod and pkg/api/v1/node, as they are more appropriate * i updated the staging/copy.sh to copy the new subpackages, so that helper functions are copied. We can get rid of this copy after client-go stops copying API types.	2017-04-17 14:03:57 -07:00
Chao Xu	4f9591b1de	move pkg/api/v1/ref.go and pkg/api/v1/resource.go to subpackages. move some functions in resource.go to pkg/api/v1/node and pkg/api/v1/pod	2017-04-17 11:38:11 -07:00
Yu-Ju Hong	1d3d12dfc2	Don't check runtime condition for rktnetes rktnetes is not a CRI implementation, and does not provide runtime conditions. This change fixes the issue where rkt will never be considered running from kubelet's point of view.	2017-04-17 11:33:58 -07:00
Julien Balestra	edc4ccd660	Kubelet:rkt Fix the hostPath Volume creation Kubelet:rkt Fix the hostPath Volume creation	2017-04-15 15:03:27 +02:00
Kubernetes Submit Queue	4e3bbe3915	Merge pull request #42498 from jcbsmpsn/add-jitter-to-rotation-threshold Automatic merge from submit-queue (batch tested with PRs 44364, 44361, 42498) Fix the certificate rotation threshold and add jitter. Adjusts the certificate rotation threshold to be fixed, with some jitter to spread out the load on the Certificate Signing Request API. The rotation threshold is fixed at 20% now, meaning when 20% of the certificate's total duration is remaining, the certificate manager will attempt to rotate, with jitter +/-10%. For certificates of duration 1 month that means they will rotate after 24 days, +/- 3 days. On a 6000 node cluster, assuming all nodes added at nearly the same time, this should result in 6000 nodes rotating spread over 6 days (total range of the jitter), or ~42 nodes / hour requesting new certificates.	2017-04-14 17:56:01 -07:00
Chao Xu	d4850b6c2b	move pkg/api/v1/helpers.go to subpackage	2017-04-14 14:25:11 -07:00
Mike Danese	a05c3c0efd	autogenerated	2017-04-14 10:40:57 -07:00
Kubernetes Submit Queue	b0a05b4597	Merge pull request #42474 from k82cn/rm_empty_line_kl Automatic merge from submit-queue Removed un-necessary empty line.	2017-04-14 07:23:11 -07:00
Kubernetes Submit Queue	4653a9b280	Merge pull request #41543 from dshulyak/decouple_remotecommand Automatic merge from submit-queue (batch tested with PRs 44406, 41543, 44071, 44374, 44299) Decouple remotecommand Refactored unversioned/remotecommand to decouple it from undesirable dependencies: - term package now is not required, and functionality required to resize terminal size can be plugged in directly in kubectl - in order to remove dependency on kubelet package - constants from kubelet/server/remotecommand were moved to separate util package (pkg/util/remotecommand) - remotecommand_test.go moved to pkg/client/tests module	2017-04-13 19:52:05 -07:00
Kubernetes Submit Queue	1cf6ef08df	Merge pull request #44406 from Random-Liu/stop-following-when-exited Automatic merge from submit-queue CRI: Stop following container log when container exited. Fixes https://github.com/kubernetes/kubernetes/issues/44340. This PR changed kubelet to periodically check whether container is running when following container logs, and stop following when container exited. I've tried this PR in my local cluster: ``` Wed Apr 12 20:23:54 UTC 2017 Wed Apr 12 20:23:58 UTC 2017 Wed Apr 12 20:24:02 UTC 2017 Wed Apr 12 20:24:06 UTC 2017 Wed Apr 12 20:24:10 UTC 2017 Wed Apr 12 20:24:14 UTC 2017 Wed Apr 12 20:24:18 UTC 2017 Wed Apr 12 20:24:22 UTC 2017 Wed Apr 12 20:24:26 UTC 2017 Wed Apr 12 20:24:30 UTC 2017 Wed Apr 12 20:24:34 UTC 2017 Wed Apr 12 20:24:38 UTC 2017 Wed Apr 12 20:24:42 UTC 2017 Wed Apr 12 20:24:46 UTC 2017 failed to wait logs for log file "/var/log/pods/1d54634c7b31346fc3219f5e0b7507cc/nginx_0.log": container "b9a17a2c53550c3703ab350d85911743af8bf164a41813544fd08fb9585f7501" is not running (state="CONTAINER_EXITED") ``` The only difference is that `ReadLogs` will return error when container exits during following. I'm not sure whether we should get rid of it or not. @yujuhong @feiskyer @JorritSalverda /cc @kubernetes/sig-node-bugs Release note: ```release-note `kubectl logs -f` now stops following when container stops. ```	2017-04-13 19:10:28 -07:00
Andrew Seigner	f13563b73a	fix comment in custommetrics	2017-04-13 15:03:36 -07:00
Random-Liu	2fbf34f7c1	Stop following container log when container exited.	2017-04-13 11:25:08 -07:00
Dmitry Shulyak	f50480c714	Decouple remotecommand client from term/kubelet dependencies In order to move client/unversioned/remotecommand to client-go as a followup for this change we have to decouple it from tons of dependencies	2017-04-13 15:56:40 +03:00
Kubernetes Submit Queue	42c0994c34	Merge pull request #43031 from dashpole/eviction_metrics Automatic merge from submit-queue Add prometheus metrics for age of stats used for evictions. Completes #42923 This PR adds metrics for evictions, and records how stale data used for evictions is. cc @vishh @derekwaynecarr @kubernetes/sig-node-pr-reviews	2017-04-12 12:38:58 -07:00
Andy Goldstein	00e11566f2	Make the dockershim root directory configurable Make the dockershim root directory configurable so things like integration tests (e.g. in OpenShift) can run as non-root.	2017-04-12 09:06:21 -04:00
Chao Xu	08aa712a6c	move helpers.go to helper	2017-04-11 15:49:11 -07:00

1 2 3 4 5 ...

4581 Commits