kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	b4d793c450	Merge pull request #118865 from iholder101/kubelet/add-swap-to-summary-stats Add swap to stats to Summary API and Prometheus endpoints (`/stats/summary` and `/metrics/resource`)	2023-07-17 19:49:18 -07:00
Kubernetes Prow Robot	da2fdf8cc3	Merge pull request #118764 from iholder101/Swap/burstableQoS-impl Add full cgroup v2 swap support with automatically calculated swap limit for LimitedSwap and Burstable QoS Pods	2023-07-17 19:49:07 -07:00
Kubernetes Prow Robot	d17f3ba2cf	Merge pull request #119168 from gjkim42/sidecar-allow-probes-and-lifecycle-hooks Allow all probes and lifecycle for restartable init containers	2023-07-17 18:11:07 -07:00
Itamar Holder	e429793db1	Unit tests: node swap usage resource metric Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-18 02:55:56 +03:00
Itamar Holder	1d368420b2	Add a node swap usage resource metric (/metrics/resource) Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-18 02:55:56 +03:00
Itamar Holder	7d187f967b	Unit tests: CRI swap stats Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-18 02:55:56 +03:00
Itamar Holder	59e3e3897e	Add SwapStats to summary API through CRI Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-18 02:55:56 +03:00
Itamar Holder	87ff9c4525	Add swap statistics to CRI-API Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-18 02:55:53 +03:00
Itamar Holder	053d7ac61f	Unit tests: cadvisor swap stats Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-18 02:40:02 +03:00
Itamar Holder	c74ee8045d	Add SwapStats to summary API through cadvisor Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-18 02:40:02 +03:00
Gunju Kim	9d6c1030db	Generate containers ready condition including restartable init containers	2023-07-18 08:12:24 +09:00
Gunju Kim	3bf282652f	Allow restartable init containers to have lifecycle	2023-07-18 08:12:24 +09:00
Gunju Kim	7ef2d674e2	Allow restartable init containers to have livenessProbe	2023-07-18 07:54:33 +09:00
Gunju Kim	2c8b37498e	Allow restartable init containers to have readinessProbe	2023-07-18 07:54:33 +09:00
Kubernetes Prow Robot	ff90c1cc73	Merge pull request #119374 from danwinship/kep-3178-ga move KEP-3178 IPTablesOwnershipCleanup to GA	2023-07-17 15:53:47 -07:00
Kubernetes Prow Robot	1fef8fd51d	Merge pull request #118770 from marquiz/devel/cgroup-driver-autoconfig kubelet: get cgroup driver config from CRI	2023-07-17 15:53:18 -07:00
Kubernetes Prow Robot	a776bf0462	Merge pull request #116335 from gnufied/update-api-recovery-apis Update api recovery apis	2023-07-17 14:52:35 -07:00
Dan Winship	f1e7386fbc	Deprecate now-unused kubelet iptables flags	2023-07-17 16:51:47 -04:00
Dan Winship	d486736dd3	Remove IPTablesOwnershipCleanup checks and dead code	2023-07-17 16:51:47 -04:00
Kubernetes Prow Robot	92856db662	Merge pull request #118973 from ffromani/kubelet-podresources-getallocatable-ga node: podresources: getallocatable: move to GA	2023-07-17 13:47:33 -07:00
Kubernetes Prow Robot	42141aaf93	Merge pull request #118578 from bart0sh/PR117-DRA-log-manager-errors DRA: report NodePrepareResource errors	2023-07-17 13:47:13 -07:00
Hemant Kumar	e011187114	Update code to use new generic allocatedResourceStatus field	2023-07-17 15:30:35 -04:00
Markus Lehtonen	d2d5e2e27d	Add CRI fake runtimes for RuntimeConfig rpc Also update the CRI RuntimeService inteface to include the new RuntimeConfig rpc.	2023-07-17 12:27:04 -04:00
Kubernetes Prow Robot	bdcf812c95	Merge pull request #118254 from elezar/4009/add-cdi-devices-to-device-plugin Add CDI devices to device plugin API	2023-07-17 05:21:08 -07:00
Ed Bartosh	229eb93a83	DRA: report NodePrepareResource errors Log an error and submit an event when NodePrepareResource fails.	2023-07-17 12:56:28 +03:00
Evan Lezar	b57c7e2fe4	Add CDI devices to device plugin API This change adds CDI device IDs to the ContainerAllocateResponse in the device plugin API. This allows a device plugin to specify CDI devices by their unique fully-qualified CDI device names using the related field in the CRI specification. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-17 11:53:09 +02:00
Kubernetes Prow Robot	71f8a2405d	Merge pull request #119333 from liggitt/flushfrequencystring Conditionally serialize flushFrequency as int	2023-07-16 07:09:06 -07:00
Jordan Liggitt	6c0ea702d4	Conditionally serialize flushFrequency as int	2023-07-16 08:37:37 -04:00
Kubernetes Prow Robot	900237fada	Merge pull request #118635 from ffromani/devmgr-check-pod-running kubelet: devices: skip allocation for running pods	2023-07-15 05:43:16 -07:00
Kubernetes Prow Robot	8a0ea1bd58	Merge pull request #109616 from wzshiming/feat/pod-host-ips Field `status.hostIPs` added for Pod	2023-07-15 00:31:04 -07:00
Kubernetes Prow Robot	cab65e2008	Merge pull request #118816 from PiotrProkop/topo-opts-to-beta topologymanager: Promote support for improved multi-numa alignment in Topology Manager to beta	2023-07-14 16:55:08 -07:00
Itamar Holder	4b6314f815	Unit test: Swap - Limited/Unlimited Swap, cgroups v1/v2, etc Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-14 14:52:28 +03:00
Itamar Holder	a30410d9ce	LimitedSwap: Automatically configure swap limit for Burstable QoS Pods After this commit, when LimitedSwap is enabled, containers would get swap acess limited with respect the container memory request, total physical memory on the node, and the swap size on the node. Pods of Best-Effort / Guaranteed QoS classes don't get to swap. In addition, container with memory requests that are equal to their memory limits also don't get to swap. The swap limitation is calculated in the following way: 1. Calculate the container's memory proportionate to the node's memory: - Divide the container's memory request by the total node's physical memory. Let's call this value ContainerMemoryProportion. 2. Multiply the container memory proportion by the available swap memory for Pods: Meaning: ContainerMemoryProportion * TotalPodsSwapAvailable. Fore more information: https://github.com/kubernetes/enhancements/blob/master/keps/sig-node/2400-node-swap/README.md Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-14 14:52:28 +03:00
Itamar Holder	e4da568f33	Make kuberuntime unit tests environment independent + support cgroup v2 Before this commit, to find out the current node's cgroup version, a libcontainers function was used directly. This way, cgroup version is hard to mock and is dependant on the environment in which the unit tests are being run. After this commit, libcontainer's function is wrapped within a variable function that can be re-assigned by the tests. This way the test can easily mock the cgroup version and become environment independant. After this commit both cgroup versions v1 and v2 are being tested, no matter in which environment it runs. Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-14 14:52:27 +03:00
Shiming Zhang	335d905ce9	Downward API support for status.hostIPs	2023-07-14 09:35:30 +08:00
Shiming Zhang	e6bdd224c1	Add HostIPs for kubelet	2023-07-14 09:35:30 +08:00
Kubernetes Prow Robot	d37c62dcbf	Merge pull request #117800 from cyclinder/loggin_format Add '--logging-format' flag to kube-proxy	2023-07-13 08:40:37 -07:00
cyclinder	c550c17f7f	accept int or string flush frequency	2023-07-13 14:33:33 +08:00
Kubernetes Prow Robot	70370d0210	Merge pull request #117731 from jongwooo/refactor/use-early-return-pattern Use early return pattern to avoid nested conditions	2023-07-12 17:59:41 -07:00
Kubernetes Prow Robot	0086712926	Merge pull request #116922 from sourcelliu/checkpoint Improve the performance of map usage	2023-07-12 17:59:30 -07:00
Kubernetes Prow Robot	047d040ce7	Merge pull request #119012 from pohly/dra-batch-node-prepare kubelet: support batched prepare/unprepare in v1alpha3 DRA plugin API	2023-07-12 10:57:37 -07:00
Kubernetes Prow Robot	ac07b4612e	Merge pull request #117804 from jsafrane/fix-csi-attachable-reconstruction Fix reconstruction of CSI volumes	2023-07-12 10:57:15 -07:00
Kubernetes Prow Robot	be222f38f0	Merge pull request #119058 from TommyStarK/dra-state-checkpoint-unit-test dynamic resource allocation: Improve code coverage of state checkpoint	2023-07-12 07:49:14 -07:00
Patrick Ohly	d743c50bb9	kubelet: support batched prepare/unprepare in v1alpha3 DRA plugin API Combining all prepare/unprepare operations for a pod enables plugins to optimize the execution. Plugins can continue to use the v1beta2 API for now, but should switch. The new API is designed so that plugins which want to work on each claim one-by-one can do so and then report errors for each claim separately, i.e. partial success is supported.	2023-07-12 14:50:30 +02:00
Francesco Romani	01c3a51a78	node: podresources: getallocatable: move to GA lock the feature gate to GA, and remove the now-redundant code. Signed-off-by: Francesco Romani <fromani@redhat.com>	2023-07-12 14:11:22 +02:00
TommyStarK	f924bf95df	dynamic resource allocation: Improve code coverage of state checkpoint Signed-off-by: TommyStarK <thomasmilox@gmail.com>	2023-07-12 13:27:18 +02:00
Francesco Romani	c635a7e7d8	node: devicemgr: topomgr: add logs One of the contributing factors of issues #118559 and #109595 hard to debug and fix is that the devicemanager has very few logs in important flow, so it's unnecessarily hard to reconstruct the state from logs. We add minimal logs to be able to improve troubleshooting. We add minimal logs to be backport-friendly, deferring a more comprehensive review of logging to later PRs. Signed-off-by: Francesco Romani <fromani@redhat.com>	2023-07-12 13:25:36 +02:00
Francesco Romani	3bcf4220ec	kubelet: devices: skip allocation for running pods When kubelet initializes, runs admission for pods and possibly allocated requested resources. We need to distinguish between node reboot (no containers running) versus kubelet restart (containers potentially running). Running pods should always survive kubelet restart. This means that device allocation on admission should not be attempted, because if a container requires devices and is still running when kubelet is restarting, that container already has devices allocated and working. Thus, we need to properly detect this scenario in the allocation step and handle it explicitely. We need to inform the devicemanager about which pods are already running. Note that if container runtime is down when kubelet restarts, the approach implemented here won't work. In this scenario, so on kubelet restart containers will again fail admission, hitting https://github.com/kubernetes/kubernetes/issues/118559 again. This scenario should however be pretty rare. Signed-off-by: Francesco Romani <fromani@redhat.com>	2023-07-12 13:25:36 +02:00
Kubernetes Prow Robot	e0dafe57a3	Merge pull request #117351 from pohly/dra-generated-resource-claim-names DRA: generated resource claim names	2023-07-11 10:33:11 -07:00
PiotrProkop	f855a23b45	topologymanager: promote TopologyManagerPolicyOptions feature to beta * Promote TopologyManagerPolicyOptions feature to beta * Promote PreferClosestNUMANodes TopologyManagerPolicyOption to beta Signed-off-by: PiotrProkop <pprokop@nvidia.com>	2023-07-11 15:06:57 +02:00

1 2 3 4 5 ...

10851 Commits