kubernetes

Author	SHA1	Message	Date
Michal Wozniak	3d68f362c3	Give terminal phase correctly to all pods that will not be restarted	2023-03-16 21:25:29 +01:00
Clayton Coleman	6b9a381185	kubelet: Force deleted pods can fail to move out of terminating If a CRI error occurs during the terminating phase after a pod is force deleted (API or static) then the housekeeping loop will not deliver updates to the pod worker which prevents the pod's state machine from progressing. The pod will remain in the terminating phase but no further attempts to terminate or cleanup will occur until the kubelet is restarted. The pod worker now maintains a store of the pods state that it is attempting to reconcile and uses that to resync unknown pods when SyncKnownPods() is invoked, so that failures in sync methods for unknown pods no longer hang forever. The pod worker's store tracks desired updates and the last update applied on podSyncStatuses. Each goroutine now synchronizes to acquire the next work item, context, and whether the pod can start. This synchronization moves the pending update to the stored last update, which will ensure third parties accessing pod worker state don't see updates before the pod worker begins synchronizing them. As a consequence, the update channel becomes a simple notifier (struct{}) so that SyncKnownPods can coordinate with the pod worker to create a synthetic pending update for unknown pods (i.e. no one besides the pod worker has data about those pods). Otherwise the pending update info would be hidden inside the channel. In order to properly track pending updates, we have to be very careful not to mix RunningPods (which are calculated from the container runtime and are missing all spec info) and config- sourced pods. Update the pod worker to avoid using ToAPIPod() and instead require the pod worker to directly use update.Options.Pod or update.Options.RunningPod for the correct methods. Add a new SyncTerminatingRuntimePod to prevent accidental invocations of runtime only pod data. Finally, fix SyncKnownPods to replay the last valid update for undesired pods which drives the pod state machine towards termination, and alter HandlePodCleanups to: - terminate runtime pods that aren't known to the pod worker - launch admitted pods that aren't known to the pod worker Any started pods receive a replay until they reach the finished state, and then are removed from the pod worker. When a desired pod is detected as not being in the worker, the usual cause is that the pod was deleted and recreated with the same UID (almost always a static pod since API UID reuse is statistically unlikely). This simplifies the previous restartable pod support. We are careful to filter for active pods (those not already terminal or those which have been previously rejected by admission). We also force a refresh of the runtime cache to ensure we don't see an older version of the state. Future changes will allow other components that need to view the pod worker's actual state (not the desired state the podManager represents) to retrieve that info from the pod worker. Several bugs in pod lifecycle have been undetectable at runtime because the kubelet does not clearly describe the number of pods in use. To better report, add the following metrics: kubelet_desired_pods: Pods the pod manager sees kubelet_active_pods: "Admitted" pods that gate new pods kubelet_mirror_pods: Mirror pods the kubelet is tracking kubelet_working_pods: Breakdown of pods from the last sync in each phase, orphaned state, and static or not kubelet_restarted_pods_total: A counter for pods that saw a CREATE before the previous pod with the same UID was finished kubelet_orphaned_runtime_pods_total: A counter for pods detected at runtime that were not known to the kubelet. Will be populated at Kubelet startup and should never be incremented after. Add a metric check to our e2e tests that verifies the values are captured correctly during a serial test, and then verify them in detail in unit tests. Adds 23 series to the kubelet /metrics endpoint.	2023-03-08 22:03:51 -06:00
Oscar Utbult	e4f776f230	grammar: replace all occurrences of "the the" with "the"	2022-10-14 09:03:14 +02:00
Davanum Srinivas	a9593d634c	Generate and format files - Run hack/update-codegen.sh - Run hack/update-generated-device-plugin.sh - Run hack/update-generated-protobuf.sh - Run hack/update-generated-runtime.sh - Run hack/update-generated-swagger-docs.sh - Run hack/update-openapi-spec.sh - Run hack/update-gofmt.sh Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2022-07-26 13:14:05 -04:00
Clayton Coleman	69a3820214	kubelet: Delay writing a terminal phase until the pod is terminated Other components must know when the Kubelet has released critical resources for terminal pods. Do not set the phase in the apiserver to terminal until all containers are stopped and cannot restart. As a consequence of this change, the Kubelet must explicitly transition a terminal pod to the terminating state in the pod worker which is handled by returning a new isTerminal boolean from syncPod. Finally, if a pod with init containers hasn't been initialized yet, don't default container statuses or not yet attempted init containers to the unknown failure state.	2022-03-16 13:15:00 -04:00
Gunju Kim	3ce5c944a8	kubelet: Clean up a static pod that has been terminated before starting - Allow a podWorker to start if it is blocked by a pod that has been terminated before starting - When a pod can't start AND has already been terminated, exit cleanly - Add a unit test that exercises race conditions in pod workers	2022-02-02 16:05:32 -05:00
Ryan Phillips	25f95f2bde	kubelet: fix podstatus not containing pod full name	2022-01-25 13:21:04 -06:00
Sascha Grunert	de37b9d293	Make CRI `v1` the default and allow a fallback to `v1alpha2` This patch makes the CRI `v1` API the new project-wide default version. To allow backwards compatibility, a fallback to `v1alpha2` has been added as well. This fallback can either used by automatically determined by the kubelet. Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2021-11-17 11:05:05 -08:00
Gunju Kim	3bce245279	Ensure there is one running static pod with the same full name	2021-10-19 16:30:18 +09:00
Kubernetes Prow Robot	e595d79dfc	Merge pull request #104574 from 249043822/br-repeat-package fix duplicate package import in pod_worker	2021-10-18 15:49:46 -07:00
Ryan Phillips	0166d446b9	kubelet: set terminated podWorker status for terminated pods	2021-10-07 16:18:59 -05:00
Clayton Coleman	d5719800bf	kubelet: Handle UID reuse in pod worker If a pod is killed (no longer wanted) and then a subsequent create/ add/update event is seen in the pod worker, assume that a pod UID was reused (as it could be in static pods) and have the next SyncKnownPods after the pod terminates remove the worker history so that the config loop can restart the static pod, as well as return to the caller the fact that this termination was not final. The housekeeping loop then reconciles the desired state of the Kubelet (pods in pod manager that are not in a terminal state, i.e. admitted pods) with the pod worker by resubmitting those pods. This adds a small amount of latency (2s) when a pod UID is reused and the pod is terminated and restarted.	2021-09-15 14:02:00 -04:00
Clayton Coleman	a2ca66d280	kubelet: Admission must exclude completed pods and avoid races Fixes two issues with how the pod worker refactor calculated the pods that admission could see (GetActivePods() and filterOutTerminatedPods()) First, completed pods must be filtered from the "desired" state for admission, which arguably should be happening earlier in config. Exclude the two terminal pods states from GetActivePods() Second, the previous check introduced with the pod worker lifecycle ownership changes was subtly wrong for the admission use case. Admission has to include pods that haven't yet hit the pod worker, which CouldHaveRunningContainers was filtering out (because the pod worker hasn't seen them). Introduce a weaker check - IsPodKnownTerminated() - that returns true only if the pod is in a known terminated state (no running containers AND known to pod worker). This weaker check may only be called from components that need admitted pods, not other kubelet subsystems. This commit does not fix the long standing bug that force deleted pods are omitted from admission checks, which must be fixed by having GetActivePods() also include pods "still terminating".	2021-08-25 13:31:02 -04:00
KeZhang	dd4fd54427	fix duplicate package import in pod_worker	2021-08-25 21:16:38 +08:00
Clayton Coleman	de9cdab5ae	kubelet: Prevent runtime-only pods from going into terminated phase If a pod is already in terminated and the housekeeping loop sees an out of date cache entry for a running container, the pod worker should ignore that running pod termination request. Once the worker completes, a subsequent housekeeping invocation will then invoke terminating because the worker is no longer processing any pod with that UID. This does leave the possibility of syncTerminatedPod being blocked if a container in the pod is started after killPod successfully completes but before syncTerminatedPod can exit successfully, perhaps because the terminated flow (detach volumes) is blocked on that running container. A future change will address that issue.	2021-07-13 15:41:49 -04:00
Clayton Coleman	3eadd1a9ea	Keep pod worker running until pod is truly complete A number of race conditions exist when pods are terminated early in their lifecycle because components in the kubelet need to know "no running containers" or "containers can't be started from now on" but were relying on outdated state. Only the pod worker knows whether containers are being started for a given pod, which is required to know when a pod is "terminated" (no running containers, none coming). Move that responsibility and podKiller function into the pod workers, and have everything that was killing the pod go into the UpdatePod loop. Split syncPod into three phases - setup, terminate containers, and cleanup pod - and have transitions between those methods be visible to other components. After this change, to kill a pod you tell the pod worker to UpdatePod({UpdateType: SyncPodKill, Pod: pod}). Several places in the kubelet were incorrect about whether they were handling terminating (should stop running, might have containers) or terminated (no running containers) pods. The pod worker exposes methods that allow other loops to know when to set up or tear down resources based on the state of the pod - these methods remove the possibility of race conditions by ensuring a single component is responsible for knowing each pod's allowed state and other components simply delegate to checking whether they are in the window by UID. Removing containers now no longer blocks final pod deletion in the API server and are handled as background cleanup. Node shutdown no longer marks pods as failed as they can be restarted in the next step. See https://docs.google.com/document/d/1Pic5TPntdJnYfIpBeZndDelM-AbS4FN9H2GTLFhoJ04/edit# for details	2021-07-06 15:55:22 -04:00
KeZhang	83ee5da75e	Fix:slow memory leak may be in kubelet podworkers.isWorking	2021-06-15 15:26:30 +08:00
yuzhiquan	bebca30309	comment should have function name as prefix	2021-04-28 15:26:46 +08:00
JunYang	01a4e4face	Structured Logging migration: modify volume and container part logs of kubelet. Signed-off-by: JunYang <yang.jun22@zte.com.cn>	2021-03-17 08:59:03 +08:00
Davanum Srinivas	442a69c3bd	switch over k/k to use klog v2 Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:27 -04:00
Tim Allclair	a2c51674cf	Cleanup more static check issues (S1,ST)	2019-08-21 10:40:21 -07:00
Ted Yu	2242e396d4	Pass desiredPods to CleanupPods	2019-07-03 10:35:13 +08:00
Davanum Srinivas	954996e231	Move from glog to klog - Move from the old github.com/golang/glog to k8s.io/klog - klog as explicit InitFlags() so we add them as necessary - we update the other repositories that we vendor that made a similar change from glog to klog * github.com/kubernetes/repo-infra * k8s.io/gengo/ * k8s.io/kube-openapi/ * github.com/google/cadvisor - Entirely remove all references to glog - Fix some tests by explicit InitFlags in their init() methods Change-Id: I92db545ff36fcec83afe98f550c9e630098b3135	2018-11-10 07:50:31 -05:00
Krzysztof Jastrzebski	ad330f7dbe	Start synchronizing pods after network is ready.	2018-09-21 10:12:49 +02:00
Lantao Liu	0f4c739b2c	Fix pod worker deadlock. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-07-09 11:45:26 -07:00
yue9944882	9ecc0b2bd2	fixes document grammar	2018-02-20 10:38:41 -05:00
Derek Carr	54224600ec	kubelet syncPod throws specific events	2017-10-13 10:24:09 -04:00
Chao Xu	60604f8818	run hack/update-all	2017-06-22 11:31:03 -07:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
Derek Carr	a02f10fa3a	Strip containerID from events to reduce spam	2017-06-13 23:31:56 -04:00
David Ashpole	6daa2f2ef3	increase timeout	2017-02-27 10:59:24 -08:00
deads2k	8a12000402	move client/record	2017-01-31 19:14:13 -05:00
Michal Fojtik	34db84efe2	Report the Pod name and namespace when kubelet fails to sync the container	2017-01-16 16:44:56 +01:00
deads2k	6a4d5cd7cc	start the apimachinery repo	2017-01-11 09:09:48 -05:00
Kubernetes Submit Queue	fe391d7bb0	Merge pull request #37350 from chentao1596/delete-meaningless-judgments Automatic merge from submit-queue (batch tested with PRs 39280, 37350, 39389, 39390, 39313) delete meaningless judgments What this PR does / why we need it： Whether "err" is nil or not, "err" can be return, so the judgment "err !=nil " is unnecessary	2017-01-03 18:25:10 -08:00
Chao Xu	5e1adf91df	cmd/kubelet	2016-11-23 15:53:09 -08:00
chentao1596	6d4341b3ff	delete meaningless judgments	2016-11-23 15:20:11 +08:00
derekwaynecarr	ff017839c7	Log an event when container runtime exceeds grace-period during eviction	2016-09-07 13:28:08 -04:00
Andrey Kurilin	9f1c3a4c56	Fix various typos in kubelet	2016-08-03 01:14:44 +03:00
Ron Lai	e6b745122d	Moving event.go from kubelet/container to kubelet/events	2016-07-12 17:42:29 -07:00
David McMahon	ef0c9f0c5b	Remove "All rights reserved" from all the headers.	2016-06-29 17:47:36 -07:00
derekwaynecarr	6fefb428c1	Add killPodNow to kubelet	2016-05-12 19:17:08 -04:00
Jan Chaloupka	392fc6668f	Jittering periods of some kubelet's sync loops: - pod_workers: pod syncing - prober workers: container syncing In order to synchronize the current state of Kubernetes's objects (e.g. pods, containers, etc.), periodic synch loops are run. When there is a lot of objects to synchronize with, loops increase communication traffic. At some point when all the traffic interfere cpu usage curve hits the roof causing 100% cpu utilization. To distribute the traffic in time, some sync loops can jitter their period in each loop and help to flatten the curve.	2016-02-09 15:22:22 +01:00
Yu-Ju Hong	ff04de4fc0	Remove RuntimeCache from sync path This change removes RuntimeCache in the pod workers and the syncPod() function. Note that it doesn't deprecate RuntimeCache completely as other components still rely on the cache.	2016-02-01 21:32:41 -08:00
harry	1032067ff9	Replace runtime reference by pkg	2016-02-01 21:06:44 +08:00
Yu-Ju Hong	cfb5442b2d	Turn on kubecontainer.Cache in kubelet	2016-01-19 18:15:10 -08:00
Piotr Szczesniak	9659057986	Revert "Enable kubecontainer.Cache in kubelet"	2016-01-18 13:35:41 +01:00
Yu-Ju Hong	07cf5cff48	Enable kubecontainer.Cache in kubelet	2016-01-14 09:31:24 -08:00
Avesh Agarwal	3d5207fd73	This commit adds type information to events. This addresses issue #15624.	2015-11-24 09:59:54 -05:00
jiangyaoguo	b0f0c294d9	make reasons constants with clarity	2015-11-09 23:42:14 +08:00

1 2

83 Commits