kubernetes

Author	SHA1	Message	Date
Michal Wozniak	17013d3960	Review remarks to improve HandlePodCleanups in kubelet	2023-06-22 10:55:39 +02:00
Michal Wozniak	e3ee9b9adc	Fix the deletion of rejected pods	2023-06-22 09:18:34 +02:00
Kubernetes Prow Robot	18d05b646d	Merge pull request #117702 from kannon92/pod-ready-to-start-rename feat: rename PodHasNetwork to PodReadyToStartContainers	2023-06-11 18:59:48 -07:00
Sascha Grunert	db9fcfeed2	Move cri/streaming to k8s.io/kubelet staging repository Container runtimes like CRI-O and containerd reuse the code by copying it from Kubernetes. To have a single source of truth for the streaming server we now move the already isolated implementation to the k8s.io/kubelet staging repository. This way runtimes can re-use the code without copying the parts. Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2023-06-05 08:08:18 +02:00
Kubernetes Prow Robot	bdbfbffef3	Merge pull request #117371 from smarterclayton/minimal_podmanager kubelet: Don't reference the pod manager interface directly from components	2023-05-16 14:34:33 -07:00
Kubernetes Prow Robot	03b2e84183	Merge pull request #113209 from luozhiwenn/personal/etc-host ensure etc-host file permission is 644 whatever umask is	2023-05-16 01:41:35 -07:00
Clayton Coleman	bb568844b6	kubelet: Separate the MirrorClient from the PodManager The two are not coupled except accidentally. Separate them and update callsites. This will reduce the scope of PodManager interface to make exposing the pod worker cleaner.	2023-05-12 12:57:26 -04:00
Clayton Coleman	80b1aca580	kubelet: Remove dispatchWork and inline calls to UpdatePod The HandlePod* methods are all structurally similar, but accrued subtle differences. In general the only point for Handle is to process admission and to update the pod worker with the desired state of the kubelet's config (so that pod worker can make it the actual state). Add a new GetPodAndMirrorPod() method that handles when the config pod is ambiguous (pod or mirror pod) and inline the structure. Add comments on questionable additions in the config methods for future improvement. Move the metric observation of container count closer to where pods are actually started (in the pod worker). A future change can likely move it to syncPod.	2023-05-12 12:57:26 -04:00
Clayton Coleman	e7207c8546	kubelet: Merge orphaned mirror pod names into GetPodsAndMirrorPods There is only one caller and both sets of data are part of the resync operation between kubelet's desired state and the actual state of the pod workers. Reduces the size of the interface so that it is easier to create another pod manager.	2023-05-12 12:57:26 -04:00
kannon92	5f489a3327	feat: rename PodHasNetwork to PodReadyToStartContainers	2023-05-02 19:52:23 +00:00
Tim Hockin	dd7af241c1	Replace diff.ObjectDiff with cmp.Equal More obvious and cheaper, and ObjectDiff is already written in terms of cmp.	2023-04-12 08:45:32 -07:00
Kubernetes Prow Robot	74ad7c397d	Merge pull request #116723 from SergeyKanzhelev/ExperimentalHostUserNamespaceDefaulting deprecate ExperimentalHostUserNamespaceDefaulting	2023-04-11 21:16:57 -07:00
Kubernetes Prow Robot	d48c883372	Merge pull request #116690 from smarterclayton/handle_twice kubelet: HandlePodCleanups takes an extra sync to restart pods	2023-04-11 18:19:23 -07:00
Sergey Kanzhelev	eb60dce33b	deprecate ExperimentalHostUserNamespaceDefaulting	2023-03-17 22:07:25 +00:00
Clayton Coleman	d25572c389	kubelet: HandlePodCleanups takes an extra sync to restart pods HandlePodCleanups is responsible for restarting pods that are no longer running (usually due to delete and recreation with the same UID in quick succession). We have to filter the list of pods to restart from podManager to get the list of admitted pods, which uses filterOutInactivePods on the kubelet. That method excludes pods the pod worker has already terminated. Since a restarted pod will be in the terminated state before HandlePodCleanups calls SyncKnownPods, we have to call filterOutInactivePods after SyncKnownPods, otherwise the to-be-restarted pod is ignored and we have to wait for the next houskeeping cycle to restart it. Since static pods are often critical system components, this extra 2s wait is undesirable and we should restart as soon as we can. Add a failing test that passes after we move the filter call after SyncKnownPods.	2023-03-16 15:18:44 -06:00
Michal Wozniak	3d68f362c3	Give terminal phase correctly to all pods that will not be restarted	2023-03-16 21:25:29 +01:00
Rodrigo Campos	8af3cce7fe	kubelet: remove GetHostIDsForPod() Now KEP-127 relies on idmap mounts to do the ID translation and we won't do any chowns in the kubelet. This patch just removes the usage of GetHostIDsForPod() in operationexecutor to do the chown, and also removes the GetHostIDsForPod() method from the kubelet volume interface. Signed-off-by: Rodrigo Campos <rodrigoca@microsoft.com>	2023-03-13 22:28:03 +01:00
vinay kulkarni	01b96e7704	Rename ContainerStatus.ResourcesAllocated to ContainerStatus.AllocatedResources	2023-03-10 14:49:26 +00:00
Kubernetes Prow Robot	e57d968323	Merge pull request #116015 from SataQiu/clean-kubelet-20230223 kubelet: remove the deprecated --master-service-namespace flag	2023-03-09 22:43:34 -08:00
Kubernetes Prow Robot	45b96eae98	Merge pull request #113145 from smarterclayton/zombie_terminating_pods kubelet: Force deleted pods can fail to move out of terminating	2023-03-09 15:32:30 -08:00
Clayton Coleman	6b9a381185	kubelet: Force deleted pods can fail to move out of terminating If a CRI error occurs during the terminating phase after a pod is force deleted (API or static) then the housekeeping loop will not deliver updates to the pod worker which prevents the pod's state machine from progressing. The pod will remain in the terminating phase but no further attempts to terminate or cleanup will occur until the kubelet is restarted. The pod worker now maintains a store of the pods state that it is attempting to reconcile and uses that to resync unknown pods when SyncKnownPods() is invoked, so that failures in sync methods for unknown pods no longer hang forever. The pod worker's store tracks desired updates and the last update applied on podSyncStatuses. Each goroutine now synchronizes to acquire the next work item, context, and whether the pod can start. This synchronization moves the pending update to the stored last update, which will ensure third parties accessing pod worker state don't see updates before the pod worker begins synchronizing them. As a consequence, the update channel becomes a simple notifier (struct{}) so that SyncKnownPods can coordinate with the pod worker to create a synthetic pending update for unknown pods (i.e. no one besides the pod worker has data about those pods). Otherwise the pending update info would be hidden inside the channel. In order to properly track pending updates, we have to be very careful not to mix RunningPods (which are calculated from the container runtime and are missing all spec info) and config- sourced pods. Update the pod worker to avoid using ToAPIPod() and instead require the pod worker to directly use update.Options.Pod or update.Options.RunningPod for the correct methods. Add a new SyncTerminatingRuntimePod to prevent accidental invocations of runtime only pod data. Finally, fix SyncKnownPods to replay the last valid update for undesired pods which drives the pod state machine towards termination, and alter HandlePodCleanups to: - terminate runtime pods that aren't known to the pod worker - launch admitted pods that aren't known to the pod worker Any started pods receive a replay until they reach the finished state, and then are removed from the pod worker. When a desired pod is detected as not being in the worker, the usual cause is that the pod was deleted and recreated with the same UID (almost always a static pod since API UID reuse is statistically unlikely). This simplifies the previous restartable pod support. We are careful to filter for active pods (those not already terminal or those which have been previously rejected by admission). We also force a refresh of the runtime cache to ensure we don't see an older version of the state. Future changes will allow other components that need to view the pod worker's actual state (not the desired state the podManager represents) to retrieve that info from the pod worker. Several bugs in pod lifecycle have been undetectable at runtime because the kubelet does not clearly describe the number of pods in use. To better report, add the following metrics: kubelet_desired_pods: Pods the pod manager sees kubelet_active_pods: "Admitted" pods that gate new pods kubelet_mirror_pods: Mirror pods the kubelet is tracking kubelet_working_pods: Breakdown of pods from the last sync in each phase, orphaned state, and static or not kubelet_restarted_pods_total: A counter for pods that saw a CREATE before the previous pod with the same UID was finished kubelet_orphaned_runtime_pods_total: A counter for pods detected at runtime that were not known to the kubelet. Will be populated at Kubelet startup and should never be incremented after. Add a metric check to our e2e tests that verifies the values are captured correctly during a serial test, and then verify them in detail in unit tests. Adds 23 series to the kubelet /metrics endpoint.	2023-03-08 22:03:51 -06:00
vinay kulkarni	b0dce923f1	Add Get interfaces for container's checkpointed ResourcesAllocated and Resize values, remove error logging for valid standalone kubelet scenario	2023-03-06 09:50:12 +00:00
vinay kulkarni	12435b26fc	Fix nil pointer access panic in kubelet from uninitialized pod allocation checkpoint manager in standalone kubelet scenario	2023-03-04 08:07:40 +00:00
SataQiu	91089ce65b	kubelet: remove the deprecated --master-service-namespace flag	2023-03-01 18:44:59 +08:00
Vinay Kulkarni	f2bd94a0de	In-place Pod Vertical Scaling - core implementation 1. Core Kubelet changes to implement In-place Pod Vertical Scaling. 2. E2E tests for In-place Pod Vertical Scaling. 3. Refactor kubelet code and add missing tests (Derek's kubelet review) 4. Add a new hash over container fields without Resources field to allow feature gate toggling without restarting containers not using the feature. 5. Fix corner-case where resize A->B->A gets ignored 6. Add cgroup v2 support to pod resize E2E test. KEP: /enhancements/keps/sig-node/1287-in-place-update-pod-resources Co-authored-by: Chen Wang <Chen.Wang1@ibm.com>	2023-02-24 18:21:21 +00:00
Kubernetes Prow Robot	a668924cb6	Merge pull request #113255 from claudiubelu/path-filepath-update-kubelet Replaces path.Operation with filepath.Operation (kubelet)	2022-12-09 22:27:41 -08:00
Ed Bartosh	abcb56defb	kubelet: do not enter termination status if pod might need to unprepare resources	2022-11-11 21:58:03 +01:00
Michal Wozniak	c803892bd8	Enable the feature into beta	2022-11-09 09:02:40 +01:00
Claudiu Belu	b9bf3e5c49	Replaces path.Operation with filepath.Operation (kubelet) The path module has a few different functions: Clean, Split, Join, Ext, Dir, Base, IsAbs. These functions do not take into account the OS-specific path separator, meaning that they won't behave as intended on Windows. For example, Dir is supposed to return all but the last element of the path. For the path "C:\some\dir\somewhere", it is supposed to return "C:\some\dir\", however, it returns ".". Instead of these functions, the ones in filepath should be used instead.	2022-11-08 16:05:48 +00:00
Michal Wozniak	4e732e20d0	Do not revert the pod condition if there might be running containers, skip condition update instead.	2022-11-07 16:22:29 +01:00
Michal Wozniak	52cd6755eb	Add pod disruption conditions for kubelet initiated failures	2022-11-07 11:23:22 +01:00
David Ashpole	64af1adace	Second attempt: Plumb context to Kubelet CRI calls (#113591 ) * plumb context from CRI calls through kubelet * clean up extra timeouts * try fixing incorrectly cancelled context	2022-11-05 06:02:13 -07:00
Antonio Ojea	9c2b333925	Revert "plumb context from CRI calls through kubelet" This reverts commit `f43b4f1b95`.	2022-11-02 13:37:23 +00:00
Kubernetes Prow Robot	9bbd0fbdb2	Merge pull request #113476 from marosset/hpc-to-stable Promoting WindowsHostProcessContainers to stable	2022-11-01 19:59:43 -07:00
Mark Rossetti	498d065cc5	Promoting WindowsHostProcessContainers to stable Signed-off-by: Mark Rossetti <marosset@microsoft.com>	2022-11-01 14:06:25 -07:00
David Ashpole	f43b4f1b95	plumb context from CRI calls through kubelet	2022-10-28 02:55:28 +00:00
luozhiwenn	76c8765bda	ensure etc-host file permission is 644 whatever umask is	2022-10-20 20:57:39 +08:00
Kubernetes Prow Robot	be22f605cf	Merge pull request #112097 from wongearl/cleanup_loop use copy() instead of a loop	2022-09-30 18:04:12 -07:00
Monis Khan	b738be9b46	Use https links for k8s KEPs, issues, PRs, etc Signed-off-by: Monis Khan <mok@microsoft.com>	2022-09-23 23:36:24 +00:00
Kubernetes Prow Robot	127f33f63d	Merge pull request #111221 from inosato/remove-ioutil-from-kubelet Remove ioutil in kubelet/kubeadm and its tests	2022-09-17 21:56:28 -07:00
wongearl	47bd712b81	use copy() instead of a loop	2022-08-29 17:55:16 +08:00
Rodrigo Campos	d07c2688fe	kubelet: add GetHostIDsForPod() In future commits we will need this to set the user/group of supported volumes of KEP 127 - Phase 1. Signed-off-by: Rodrigo Campos <rodrigoca@microsoft.com>	2022-08-03 19:53:22 +02:00
Giuseppe Scrivano	9b2fc639a0	kubelet: add GetUserNamespaceMappings to RuntimeHelper Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2022-08-03 19:53:22 +02:00
Giuseppe Scrivano	63462285d5	kubelet: add userns manager it is used to allocate and keep track of the unique users ranges assigned to each pod that runs in a user namespace. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Signed-off-by: Rodrigo Campos <rodrigoca@microsoft.com> Co-authored-by: Rodrigo Campos <rodrigoca@microsoft.com>	2022-08-03 19:53:22 +02:00
Kubernetes Prow Robot	2e1a4da8df	Merge pull request #111358 from ddebroy/hasnet1 Introduce PodHasNetwork condition for pods	2022-08-01 15:04:52 -07:00
Deep Debroy	dfdf8245bb	Introduce PodHasNetwork condition for pods Signed-off-by: Deep Debroy <ddebroy@gmail.com>	2022-08-01 09:51:43 -07:00
inosato	3b95d3b076	Remove ioutil in kubelet and its tests Signed-off-by: inosato <si17_21@yahoo.co.jp>	2022-07-30 12:35:26 +09:00
Lee Verberne	d238e67ba6	Remove EphemeralContainers feature-gate checks	2022-07-26 02:55:30 +02:00
Ryan Phillips	f25ca15e1c	kubelet: only shutdown probes for pods that are terminated This fixes a bug where terminating pods would not run their readiness probes. Terminating pods are found within the possiblyRunningPods map.	2022-06-06 17:00:54 -05:00
Clayton Coleman	1d518adb76	kubelet: Pod probes should be handled by pod worker The pod worker is the owner of when a container is running or not, and the start and stop of the probes for a given pod should be handled during the pod sync loop. This ensures that probes do not continue running even after eviction. Because the pod semantics allow lifecycle probes to shorten grace period, the probe is removed after the containers in a pod are terminated successfully. As an optimization, if the pod will have a very short grace period (0 or 1 seconds) we stop the probes immediately to reduce resource usage during eviction slightly. After this change, the probe manager is only called by the pod worker or by the reconcile loop.	2022-06-06 17:00:54 -05:00

1 2 3 4 5 ...

367 Commits