kubernetes

Author	SHA1	Message	Date
Clayton Coleman	d5719800bf	kubelet: Handle UID reuse in pod worker If a pod is killed (no longer wanted) and then a subsequent create/ add/update event is seen in the pod worker, assume that a pod UID was reused (as it could be in static pods) and have the next SyncKnownPods after the pod terminates remove the worker history so that the config loop can restart the static pod, as well as return to the caller the fact that this termination was not final. The housekeeping loop then reconciles the desired state of the Kubelet (pods in pod manager that are not in a terminal state, i.e. admitted pods) with the pod worker by resubmitting those pods. This adds a small amount of latency (2s) when a pod UID is reused and the pod is terminated and restarted.	2021-09-15 14:02:00 -04:00
Kubernetes Prow Robot	047a6b9f86	Merge pull request #104874 from wojtek-t/migrate_clock_1 Unify towards k8s.io/utils/clock - part 1	2021-09-13 19:09:20 -07:00
wojtekt	53ce79a18a	Migrate to k8s.io/utils/clock in pkg/kubelet	2021-09-10 12:20:09 +02:00
Kubernetes Prow Robot	5724484bda	Merge pull request #104069 from pacoxu/fix-data-race-104057 fix data race in kubelet volume test: add lock for ut	2021-09-09 21:09:59 -07:00
paco	ab055e9ba4	fix data race in kubelet volume test: add lock Signed-off-by: Paco Xu <paco.xu@daocloud.io> Co-authored-by: Jian Zeng <zengjian.zj@bytedance.com>	2021-09-01 16:13:55 +08:00
Clayton Coleman	a2ca66d280	kubelet: Admission must exclude completed pods and avoid races Fixes two issues with how the pod worker refactor calculated the pods that admission could see (GetActivePods() and filterOutTerminatedPods()) First, completed pods must be filtered from the "desired" state for admission, which arguably should be happening earlier in config. Exclude the two terminal pods states from GetActivePods() Second, the previous check introduced with the pod worker lifecycle ownership changes was subtly wrong for the admission use case. Admission has to include pods that haven't yet hit the pod worker, which CouldHaveRunningContainers was filtering out (because the pod worker hasn't seen them). Introduce a weaker check - IsPodKnownTerminated() - that returns true only if the pod is in a known terminated state (no running containers AND known to pod worker). This weaker check may only be called from components that need admitted pods, not other kubelet subsystems. This commit does not fix the long standing bug that force deleted pods are omitted from admission checks, which must be fixed by having GetActivePods() also include pods "still terminating".	2021-08-25 13:31:02 -04:00
Clayton Coleman	de9cdab5ae	kubelet: Prevent runtime-only pods from going into terminated phase If a pod is already in terminated and the housekeeping loop sees an out of date cache entry for a running container, the pod worker should ignore that running pod termination request. Once the worker completes, a subsequent housekeeping invocation will then invoke terminating because the worker is no longer processing any pod with that UID. This does leave the possibility of syncTerminatedPod being blocked if a container in the pod is started after killPod successfully completes but before syncTerminatedPod can exit successfully, perhaps because the terminated flow (detach volumes) is blocked on that running container. A future change will address that issue.	2021-07-13 15:41:49 -04:00
Clayton Coleman	3eadd1a9ea	Keep pod worker running until pod is truly complete A number of race conditions exist when pods are terminated early in their lifecycle because components in the kubelet need to know "no running containers" or "containers can't be started from now on" but were relying on outdated state. Only the pod worker knows whether containers are being started for a given pod, which is required to know when a pod is "terminated" (no running containers, none coming). Move that responsibility and podKiller function into the pod workers, and have everything that was killing the pod go into the UpdatePod loop. Split syncPod into three phases - setup, terminate containers, and cleanup pod - and have transitions between those methods be visible to other components. After this change, to kill a pod you tell the pod worker to UpdatePod({UpdateType: SyncPodKill, Pod: pod}). Several places in the kubelet were incorrect about whether they were handling terminating (should stop running, might have containers) or terminated (no running containers) pods. The pod worker exposes methods that allow other loops to know when to set up or tear down resources based on the state of the pod - these methods remove the possibility of race conditions by ensuring a single component is responsible for knowing each pod's allowed state and other components simply delegate to checking whether they are in the window by UID. Removing containers now no longer blocks final pod deletion in the API server and are handled as background cleanup. Node shutdown no longer marks pods as failed as they can be restarted in the next step. See https://docs.google.com/document/d/1Pic5TPntdJnYfIpBeZndDelM-AbS4FN9H2GTLFhoJ04/edit# for details	2021-07-06 15:55:22 -04:00
KeZhang	83ee5da75e	Fix:slow memory leak may be in kubelet podworkers.isWorking	2021-06-15 15:26:30 +08:00
Jordan Liggitt	124a5ddf72	Fix int->string casts	2020-07-24 16:23:12 -04:00
Ted Yu	2242e396d4	Pass desiredPods to CleanupPods	2019-07-03 10:35:13 +08:00
Davanum Srinivas	7b8c9acc09	remove unused code Change-Id: If821920ec8872e326b7d85437ad8d2620807799d	2019-04-19 08:36:31 -04:00
mYmNeo	e74cabe545	Correct TestUpdatePod comment Signed-off-by: mYmNeo <thomassong2012@gmail.com>	2017-10-20 09:41:18 +08:00
Klaus Ma	63b78a37e0	Added golint check for pkg/kubelet.	2017-07-19 11:33:06 +08:00
Chao Xu	60604f8818	run hack/update-all	2017-06-22 11:31:03 -07:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
Clayton Coleman	3e095d12b4	Refactor move of client-go/util/clock to apimachinery	2017-05-20 14:19:48 -04:00
deads2k	8a12000402	move client/record	2017-01-31 19:14:13 -05:00
deads2k	5a8f075197	move authoritative client-go utils out of pkg	2017-01-24 08:59:18 -05:00
deads2k	c47717134b	move utils used in restclient to client-go	2017-01-19 07:55:14 -05:00
Clayton Coleman	9a2a50cda7	refactor: use metav1.ObjectMeta in other types	2017-01-17 16:17:19 -05:00
deads2k	6a4d5cd7cc	start the apimachinery repo	2017-01-11 09:09:48 -05:00
Chao Xu	5e1adf91df	cmd/kubelet	2016-11-23 15:53:09 -08:00
derekwaynecarr	ff017839c7	Log an event when container runtime exceeds grace-period during eviction	2016-09-07 13:28:08 -04:00
Harry Zhang	cb14b35bde	Refactor util clock into it's own pkg	2016-07-28 02:29:04 -04:00
David McMahon	ef0c9f0c5b	Remove "All rights reserved" from all the headers.	2016-06-29 17:47:36 -07:00
derekwaynecarr	6fefb428c1	Add killPodNow to kubelet	2016-05-12 19:17:08 -04:00
Saad Ali	25f37007aa	Merge pull request #24846 from pmorie/kubelet-test-loc Reduce LOC in kubelet tests	2016-05-12 15:52:27 -07:00
Paul Morie	d1e0e726f2	Reduce LOC in kubelet tests	2016-05-03 22:45:08 -04:00
Random-Liu	4cca5b2290	Use fake clock in TestGetPodsToSync to fix flake.	2016-05-02 16:05:36 -07:00
Tim St. Clair	7b6d843309	Move test-only files to test-only packages	2016-03-01 09:11:32 -08:00
Yu-Ju Hong	ff04de4fc0	Remove RuntimeCache from sync path This change removes RuntimeCache in the pod workers and the syncPod() function. Note that it doesn't deprecate RuntimeCache completely as other components still rely on the cache.	2016-02-01 21:32:41 -08:00
Yu-Ju Hong	cfb5442b2d	Turn on kubecontainer.Cache in kubelet	2016-01-19 18:15:10 -08:00
Piotr Szczesniak	9659057986	Revert "Enable kubecontainer.Cache in kubelet"	2016-01-18 13:35:41 +01:00
Yu-Ju Hong	07cf5cff48	Enable kubecontainer.Cache in kubelet	2016-01-14 09:31:24 -08:00
Lantao Liu	a35220c321	cleanup pod_workers_test.go to use general runtime interface	2015-11-04 16:55:25 -08:00
Yu-Ju Hong	2eb17df46b	kubelet: independent pod syncs and backoff on error Currently kubelet syncs all pods every 10s. This is not preferred because * Some pods may have been sync'd recently. * This may cause all the pods to be sync'd at once, causing undesirable CPU spikes. This PR replaces the global syncs with independent, periodic pod syncs. At the end of syncing, each pod worker will enqueue itslef with a future timestamp ( current time + sync interval), when it will be due for another sync. * If the pod worker encoutners an sync error, it may requeue with a different timestamp to retry sooner. * If a sync is triggered by the update channel (events or spec changes), the pod worker would enqueue a new sync time. This change is necessary for moving to long or no periodic sync period once pod lifecycle event generator is completed. We will still rely on the mechanism to requeue the pod on sync error. This change also makes sure that if a sync does not succeed (either due to real error or the per-container backoff mechanism), an error would be propagated back to the pod worker, which is responsible for requeuing.	2015-11-03 13:29:08 -08:00
eulerzgy	31c09bdcb8	Del capatical local packagename for cadvisorApi	2015-10-16 11:03:50 +08:00
Yu-Ju Hong	a3e60cc32e	Rename imported package local name kubeletTypes to kubetypes According to the naming guidelines, package name should not include mixedCaps.	2015-10-09 10:24:31 -07:00
Yu-Ju Hong	098ab05997	kubelet: move common types to kubelet/types This would faciliate tasks such as moving code in pkg/kubelet to sub packages.	2015-10-08 14:38:01 -07:00
Sam Abed	fdf712cd84	back-off image pull on failure Signed-off-by: Sam Abed <samabed@gmail.com>	2015-10-07 21:12:42 +11:00
Yu-Ju Hong	889e798ddb	kubelet: pipe SyncPodType to pod workers Now that kubelet has switched to incremental updates, it has complete information of the pod update type (create, update, sync). This change pipes this information to pod workers so that they don't have to derive the type again.	2015-10-01 16:29:46 -07:00
Brendan Burns	77fd388485	Increase a bunch of timeouts to reduce flakes	2015-09-23 11:09:03 -07:00
Daniel Smith	15b30b8b09	Move version agnostic parts of client pkg/client/unversioned/cache -> pkg/client/cache pkg/client/unversioned/record -> pkg/client/record	2015-09-10 17:17:59 -07:00
Kris Rousey	ae6c64d9bb	Moving everyone to unversioned client	2015-08-18 10:23:03 -07:00
Yifan Gu	d70a30c069	kubelet: refactor kubelet.Runtimehooks to container.ImagePuller.	2015-08-12 16:28:25 -07:00
Ananya Kumar	6ef3de1d5f	Add QoS support on node	2015-08-07 11:18:16 -07:00
Mike Danese	17defc7383	run gofmt on everything we touched	2015-08-05 17:52:56 -07:00
Mike Danese	8e33cbfa28	rewrite go imports	2015-08-05 17:30:03 -07:00
Prashanth Balasubramanian	b5ed0e9b13	Dont generatePodStatus twice for new pods	2015-06-11 17:18:16 -07:00

1 2

67 Commits