kubernetes

Author	SHA1	Message	Date
Clayton Coleman	6b9a381185	kubelet: Force deleted pods can fail to move out of terminating If a CRI error occurs during the terminating phase after a pod is force deleted (API or static) then the housekeeping loop will not deliver updates to the pod worker which prevents the pod's state machine from progressing. The pod will remain in the terminating phase but no further attempts to terminate or cleanup will occur until the kubelet is restarted. The pod worker now maintains a store of the pods state that it is attempting to reconcile and uses that to resync unknown pods when SyncKnownPods() is invoked, so that failures in sync methods for unknown pods no longer hang forever. The pod worker's store tracks desired updates and the last update applied on podSyncStatuses. Each goroutine now synchronizes to acquire the next work item, context, and whether the pod can start. This synchronization moves the pending update to the stored last update, which will ensure third parties accessing pod worker state don't see updates before the pod worker begins synchronizing them. As a consequence, the update channel becomes a simple notifier (struct{}) so that SyncKnownPods can coordinate with the pod worker to create a synthetic pending update for unknown pods (i.e. no one besides the pod worker has data about those pods). Otherwise the pending update info would be hidden inside the channel. In order to properly track pending updates, we have to be very careful not to mix RunningPods (which are calculated from the container runtime and are missing all spec info) and config- sourced pods. Update the pod worker to avoid using ToAPIPod() and instead require the pod worker to directly use update.Options.Pod or update.Options.RunningPod for the correct methods. Add a new SyncTerminatingRuntimePod to prevent accidental invocations of runtime only pod data. Finally, fix SyncKnownPods to replay the last valid update for undesired pods which drives the pod state machine towards termination, and alter HandlePodCleanups to: - terminate runtime pods that aren't known to the pod worker - launch admitted pods that aren't known to the pod worker Any started pods receive a replay until they reach the finished state, and then are removed from the pod worker. When a desired pod is detected as not being in the worker, the usual cause is that the pod was deleted and recreated with the same UID (almost always a static pod since API UID reuse is statistically unlikely). This simplifies the previous restartable pod support. We are careful to filter for active pods (those not already terminal or those which have been previously rejected by admission). We also force a refresh of the runtime cache to ensure we don't see an older version of the state. Future changes will allow other components that need to view the pod worker's actual state (not the desired state the podManager represents) to retrieve that info from the pod worker. Several bugs in pod lifecycle have been undetectable at runtime because the kubelet does not clearly describe the number of pods in use. To better report, add the following metrics: kubelet_desired_pods: Pods the pod manager sees kubelet_active_pods: "Admitted" pods that gate new pods kubelet_mirror_pods: Mirror pods the kubelet is tracking kubelet_working_pods: Breakdown of pods from the last sync in each phase, orphaned state, and static or not kubelet_restarted_pods_total: A counter for pods that saw a CREATE before the previous pod with the same UID was finished kubelet_orphaned_runtime_pods_total: A counter for pods detected at runtime that were not known to the kubelet. Will be populated at Kubelet startup and should never be incremented after. Add a metric check to our e2e tests that verifies the values are captured correctly during a serial test, and then verify them in detail in unit tests. Adds 23 series to the kubelet /metrics endpoint.	2023-03-08 22:03:51 -06:00
David Ashpole	64af1adace	Second attempt: Plumb context to Kubelet CRI calls (#113591 ) * plumb context from CRI calls through kubelet * clean up extra timeouts * try fixing incorrectly cancelled context	2022-11-05 06:02:13 -07:00
Antonio Ojea	9c2b333925	Revert "plumb context from CRI calls through kubelet" This reverts commit `f43b4f1b95`.	2022-11-02 13:37:23 +00:00
David Ashpole	f43b4f1b95	plumb context from CRI calls through kubelet	2022-10-28 02:55:28 +00:00
Clayton Coleman	69a3820214	kubelet: Delay writing a terminal phase until the pod is terminated Other components must know when the Kubelet has released critical resources for terminal pods. Do not set the phase in the apiserver to terminal until all containers are stopped and cannot restart. As a consequence of this change, the Kubelet must explicitly transition a terminal pod to the terminating state in the pod worker which is handled by returning a new isTerminal boolean from syncPod. Finally, if a pod with init containers hasn't been initialized yet, don't default container statuses or not yet attempted init containers to the unknown failure state.	2022-03-16 13:15:00 -04:00
Clayton Coleman	3eadd1a9ea	Keep pod worker running until pod is truly complete A number of race conditions exist when pods are terminated early in their lifecycle because components in the kubelet need to know "no running containers" or "containers can't be started from now on" but were relying on outdated state. Only the pod worker knows whether containers are being started for a given pod, which is required to know when a pod is "terminated" (no running containers, none coming). Move that responsibility and podKiller function into the pod workers, and have everything that was killing the pod go into the UpdatePod loop. Split syncPod into three phases - setup, terminate containers, and cleanup pod - and have transitions between those methods be visible to other components. After this change, to kill a pod you tell the pod worker to UpdatePod({UpdateType: SyncPodKill, Pod: pod}). Several places in the kubelet were incorrect about whether they were handling terminating (should stop running, might have containers) or terminated (no running containers) pods. The pod worker exposes methods that allow other loops to know when to set up or tear down resources based on the state of the pod - these methods remove the possibility of race conditions by ensuring a single component is responsible for knowing each pod's allowed state and other components simply delegate to checking whether they are in the window by UID. Removing containers now no longer blocks final pod deletion in the API server and are handled as background cleanup. Node shutdown no longer marks pods as failed as they can be restarted in the next step. See https://docs.google.com/document/d/1Pic5TPntdJnYfIpBeZndDelM-AbS4FN9H2GTLFhoJ04/edit# for details	2021-07-06 15:55:22 -04:00
JunYang	01a4e4face	Structured Logging migration: modify volume and container part logs of kubelet. Signed-off-by: JunYang <yang.jun22@zte.com.cn>	2021-03-17 08:59:03 +08:00
Davanum Srinivas	442a69c3bd	switch over k/k to use klog v2 Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:27 -04:00
Tim Allclair	8a495cb5e4	Clean up error messages (ST1005)	2019-08-21 10:40:21 -07:00
Davanum Srinivas	954996e231	Move from glog to klog - Move from the old github.com/golang/glog to k8s.io/klog - klog as explicit InitFlags() so we add them as necessary - we update the other repositories that we vendor that made a similar change from glog to klog * github.com/kubernetes/repo-infra * k8s.io/gengo/ * k8s.io/kube-openapi/ * github.com/google/cadvisor - Entirely remove all references to glog - Fix some tests by explicit InitFlags in their init() methods Change-Id: I92db545ff36fcec83afe98f550c9e630098b3135	2018-11-10 07:50:31 -05:00
hangaoshuai	1ddfe9c856	fix todo:add function getFailContainer to report which containers failed the pod	2018-03-15 09:38:02 +08:00
Klaus Ma	63b78a37e0	Added golint check for pkg/kubelet.	2017-07-19 11:33:06 +08:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
Chao Xu	5e1adf91df	cmd/kubelet	2016-11-23 15:53:09 -08:00
Harry Zhang	64c8d3ad3d	Add e2e node test for log path Update to use pod to check log file	2016-11-08 13:01:25 -05:00
Ke Zhang	e48f995987	pods can not admitted should return directly	2016-07-30 11:47:50 +08:00
Kevin Wang	09344c1ffc	Optimizing the processing flow of HandlePodAdditions and canAdmitPod methods. Signed-off-by: Kevin Wang <wang.kanghua@zte.com.cn> change the note for the canAdmitPod method. Signed-off-by: Kevin Wang <wang.kanghua@zte.com.cn> gofmt kubelet.go Signed-off-by: Kevin Wang <wang.kanghua@zte.com.cn>	2016-07-11 10:34:51 +08:00
David McMahon	ef0c9f0c5b	Remove "All rights reserved" from all the headers.	2016-06-29 17:47:36 -07:00
Lucas Käldström	2022c44409	kubelet: Use MkdirAll instead of Mkdir	2016-05-22 00:23:18 +03:00
derekwaynecarr	6fefb428c1	Add killPodNow to kubelet	2016-05-12 19:17:08 -04:00
Random-Liu	41b12a18d9	Remove GetAPIPodStatus usage	2016-02-02 23:41:55 +00:00
Yu-Ju Hong	ff04de4fc0	Remove RuntimeCache from sync path This change removes RuntimeCache in the pod workers and the syncPod() function. Note that it doesn't deprecate RuntimeCache completely as other components still rely on the cache.	2016-02-01 21:32:41 -08:00
Random-Liu	b2a72ca384	Change my TODO to be the same with others	2015-12-31 00:41:05 -08:00
Random-Liu	6e92ddf9e1	Cleanup pod print in dockertools/manager.go, prober/prober.go and runonce.go	2015-12-28 14:07:37 -08:00
k8s-merge-robot	cb42bd47bb	Merge pull request #18027 from roboll/runonce-datadirs Auto commit by PR queue bot	2015-12-19 19:14:29 -08:00
Yu-Ju Hong	c646255579	Replace podFullName with format.Pod() in logging messages	2015-12-07 13:41:52 -08:00
Random-Liu	3cbdf79f8c	Change original PodStatus to APIPodStatus, and start using kubelet internal PodStatus in dockertools	2015-12-04 17:37:39 -08:00
rob boll	81b9097a80	kubelet runonce: create data dirs	2015-12-01 14:03:02 -05:00
Abhi Shah	8c7c5ec117	Merge pull request #17307 from zhengguoyong/set_no_public_runonce Use small letter var definition	2015-11-18 14:50:04 -08:00
zhengguoyong	b18a9baacc	Use small letter var definition	2015-11-16 12:12:21 +08:00
Alexander Hersh	0584f9ba7a	Create mirrorPod in runOnce to update API before syncPod + Fix #14992 + "When deploying a pod using an on-disk kubelet manifest (a la /etc/kubernetes/manifests), it appears that the network plugin setUpPod is notified of the new pod before the apiserver."	2015-11-12 15:35:45 -08:00
Yu-Ju Hong	a3e60cc32e	Rename imported package local name kubeletTypes to kubetypes According to the naming guidelines, package name should not include mixedCaps.	2015-10-09 10:24:31 -07:00
Yu-Ju Hong	098ab05997	kubelet: move common types to kubelet/types This would faciliate tasks such as moving code in pkg/kubelet to sub packages.	2015-10-08 14:38:01 -07:00
Yu-Ju Hong	b906e34576	kubelet: trigger pod workers independently Currently, whenever there is any update, kubelet would force all pod workers to sync again, causing resource contention and hence performance degradation. This commit flips kubelet to use incremental updates (as opposed to snapshots). This allows us to know what pods have changed and send updates to those pod workers only. The `SyncPods` function has been replaced with individual handlers, each handling an operation (ADD, REMOVE, UPDATE). Pod workers are still triggered periodically, and kubelet performs periodic cleanup as well. This commit also spawns a new goroutine solely responsible for killing pods. This is necessary because pod killing could hold up the sync loop for indefinitely long amount of time now user can define the graceful termination period in the container spec.	2015-08-25 17:52:01 -07:00
Mike Danese	17defc7383	run gofmt on everything we touched	2015-08-05 17:52:56 -07:00
Mike Danese	8e33cbfa28	rewrite go imports	2015-08-05 17:30:03 -07:00
Prashanth Balasubramanian	b5ed0e9b13	Dont generatePodStatus twice for new pods	2015-06-11 17:18:16 -07:00
Yu-Ju Hong	1ad4dd7803	Kubelet: replace DockerManager with the Runtime interface This change instructs kubelet to switch to using the Runtime interface. In order to do it, the change moves the Prober instantiation to DockerManager. Note that most of the tests in kubelet_test.go needs to be migrated to dockertools. For now, we use type assertion to convert the Runtime interface to DockerManager in most tests.	2015-05-04 10:19:46 -07:00
Eric Paris	6b3a6e6b98	Make copyright ownership statement generic Instead of saying "Google Inc." (which is not always correct) say "The Kubernetes Authors", which is generic.	2015-05-01 17:49:56 -04:00
Yifan Gu	c848fa447d	kubelet: Refactor isPodRunning() in runonce.go Replace InspectContainer() with generic GetPodStatus().	2015-04-28 17:44:13 -07:00
Kris Rousey	81497f3ed2	Changing the scheduler package to use *api.Pod instead of api.Pod to avoid unnecessary shallow copies. The change rippled through a lot of code.	2015-04-17 13:34:31 -07:00
Yifan Gu	dda600e45c	kubelet/dockertools: Add puller interfaces in the containerManager.	2015-04-13 15:34:22 -07:00
Yifan Gu	a3675e08f2	kubelet/dockertool: Move Getpods() to DockerManager.	2015-04-13 14:05:22 -07:00
Yu-Ju Hong	b4b0bc75c4	Kubelet: pass the acutal pod for status update Pod status update should include the ObjectMeta of the pod. This change is required for #5738 to merge.	2015-03-25 09:58:46 -07:00
Yifan Gu	13250c904f	kubelet: Replace GetKubeletDockerContainers with GetPods in syncPod/SyncPods.	2015-03-24 16:01:38 -07:00
Jerzy Szczepkowski	34a8a3a844	Running node selector predicate on kubelet. Added checking on kubelet if scheduled pods have matching node selector. This is the last step to fix #5207.	2015-03-23 08:21:58 +01:00
Jerzy Szczepkowski	5845f6ad48	Running resource predicate on kubelet. Added checking on kubelet if scheduled pods do not exceed resources. Related to #5207.	2015-03-19 10:40:10 +01:00
Yu-Ju Hong	929fb63b33	Sync static pods from Kubelet to the API server Currently, API server is not aware of the static pods (manifests from sources other than the API server, e.g. file and http) at all. This is inconvenient since users cannot check the static pods through kubectl. It is also sub-optimal because scheduler is unaware of the resource consumption by these static pods on the node. This change syncs the information back to the API server by creating a mirror pod via API server for each static pod. - Kubelet creates containers for the static pod, as it would do normally. - If a mirror pod gets deleted, Kubelet will re-create one. The containers are sync'd to the static pods, so they will not be affected. - If a static pod gets removed from the source (e.g. manifest file removed from the directory), the orphaned mirror pod will be deleted. Note that because events are associated with UID, and the mirror pod has a different UID than the original static pod, the events will not be shown for the mirror pod when running `kubectl describe pod <mirror_pod>`.	2015-03-17 08:45:56 -07:00
Wojciech Tyczynski	5d95e9e671	Remove BoundPods from Kubelet	2015-03-16 19:17:21 +01:00
Victor Marmol	2939abb6cb	Merge pull request #5383 from wojtek-t/kubelet_test Speedup pkg/kubelet/runonce_test.go	2015-03-12 10:22:03 -07:00

1 2

64 Commits