kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	f6564d33ba	Merge pull request #114357 from dengyufeng2206/1208pull Log spelling formatting	2023-03-09 21:33:22 -08:00
Swati Sehgal	937d330393	node: topologymgr: Remove ResourceAllocator as TM is always enabled With Topology Manager enabled by default, we no longer need `resourceAllocator` as Topology Manager serves as the main PodAdmitHandler completely responsible for admission check based on hints received from the hintProviders and the subsequent allocation of the corresponding resources to a pod as can be seen here: https://github.com/kubernetes/kubernetes/blob/v1.26.0/pkg/kubelet/cm/topologymanager/scope.go#L150 With regard to DRA, the passing of `cm.draManager` into resourceAllocator seems redundant as no admission checks (and allocation of resources handled by DRA) is taking place in `Admit` method of resourceAllocator. DRA has a completely different model to the rest of the resource managers where pod is only scheduled on a node once resources are reserved for it. Because of this, admission checks or waiting for resources to be provisioned after the pod has been scheduled on the node is not required. Before making the above change, it was verified that DRA Manager is instantiated in `NewContainerManager`: https://github.com/kubernetes/kubernetes/blob/v1.26.0/pkg/kubelet/cm/container_manager_linux.go#L318 Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2023-03-06 12:51:11 +00:00
Swati Sehgal	6a62f0236a	node: topologymgr: trivial internal variable renaming Since Topology manager is graduating to GA, we remove internal configuration variable names with `Experimental` prefix. There is no expected change in behavior, only trival variable renaming. Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2023-03-06 12:51:11 +00:00
Swati Sehgal	d536a342b4	node: topologymgr: GA graduation implies Feature Gate is ON by default Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2023-03-06 12:51:05 +00:00
Sergey Kanzhelev	04189b1fc4	rename ExperimentalPodPidsLimit to PodPidsLimit	2023-03-04 01:48:16 +00:00
Ed Bartosh	5a86895070	DRA: pass CDI devices through CRI CDIDevice field	2023-02-28 19:21:20 +02:00
Ed Bartosh	4f88332ab4	kubelet: prepare DRA resources before CNI setup	2023-02-06 20:40:11 +02:00
Ian K. Coolidge	e5143d16c2	cpuset: Make 'ToSlice*' methods look like 'set' methods In 'set', conversions to slice are done also, but with different names: ToSliceNoSort() -> UnsortedList() ToSlice() -> List() Reimplement List() in terms of UnsortedList to save some duplication.	2023-01-06 23:32:51 +00:00
Ian K. Coolidge	a0c989b99a	cpuset: Remove *Int64 methods These are rarely used and can be accommodated with a trivial helper.	2023-01-06 23:32:51 +00:00
dengyufeng2206	8525cfab02	Log spelling formatting	2022-12-08 15:02:19 +08:00
Ed Bartosh	abcb56defb	kubelet: do not enter termination status if pod might need to unprepare resources	2022-11-11 21:58:03 +01:00
Ed Bartosh	ae0f38437c	kubelet: add support for dynamic resource allocation Dependencies need to be updated to use github.com/container-orchestrated-devices/container-device-interface. It's not decided yet whether we will implement Topology support for DRA or not. Not having any toppology-related code will help to avoid wrong impression that DRA is used as a hint provider for the Topology Manager.	2022-11-11 21:58:03 +01:00
Kubernetes Prow Robot	243ba086e7	Merge pull request #112914 from PiotrProkop/topology-manager-policies-flag node: topologymanager: Improved multi-numa alignment in Topology Manager	2022-11-07 16:00:51 -08:00
David Ashpole	64af1adace	Second attempt: Plumb context to Kubelet CRI calls (#113591 ) * plumb context from CRI calls through kubelet * clean up extra timeouts * try fixing incorrectly cancelled context	2022-11-05 06:02:13 -07:00
PiotrProkop	75bb437a6b	Improved multi-numa alignment in Topology Manager: implement closest numa policy Signed-off-by: PiotrProkop <pprokop@nvidia.com>	2022-11-03 10:45:25 +01:00
Kubernetes Prow Robot	433787d25b	Merge pull request #113018 from fromanirh/cpumanager-ga-features node: kubelet: cpumgr: CPU Manager to GA	2022-11-02 14:41:01 -07:00
Francesco Romani	a6b928d90c	kubelet: cpumgr: internal variable trivial rename CPUManager is going GA, thus it makes little sense to keep the names of the internal configuration variables `Experimental*`. Trivial rename only. Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-11-02 18:41:42 +01:00
Francesco Romani	ff44dc1932	cpumanager: the FG is locked to default (ON) hence we can remove the if() guards, the feature is always available. Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-11-02 18:41:41 +01:00
Antonio Ojea	9c2b333925	Revert "plumb context from CRI calls through kubelet" This reverts commit `f43b4f1b95`.	2022-11-02 13:37:23 +00:00
Swati Sehgal	8b29eded52	node: devicemgr: Remove `devicePluginEnabled` field from container mgr With graduation of device plugins to GA in 1.26, the feature gate is enabled by default so `devicePluginEnabled` field no longer needs to be passed at the time of Container Manager creation. In addition to that, we remove the `ManagerStub` as it is no longer needed. Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2022-11-02 11:05:20 +00:00
David Ashpole	f43b4f1b95	plumb context from CRI calls through kubelet	2022-10-28 02:55:28 +00:00
Kubernetes Prow Robot	127f33f63d	Merge pull request #111221 from inosato/remove-ioutil-from-kubelet Remove ioutil in kubelet/kubeadm and its tests	2022-09-17 21:56:28 -07:00
Dmitry Verkhoturov	d0f9e6dc36	clarify CPUCFSQuotaPeriod values, set the minimum to 1ms cpu.cfs_period_us is measured in microseconds in the kernel but provided in time.Duration by the user, that change clarifies the code to make this evident to the reader. Also, the minimum value for that feature is 1ms and not 1μs, and this change alters the validation to reject values smaller than 1ms.	2022-09-08 23:29:13 +02:00
jinxu	0064010cdd	Promote Local storage capacity isolation feature to GA This change is to promote local storage capacity isolation feature to GA At the same time, to allow rootless system disable this feature due to unable to get root fs, this change introduced a new kubelet config "localStorageCapacityIsolation". By default it is set to true. For rootless systems, they can set this configuration to false to disable the feature. Once it is set, user cannot set ephemeral-storage request/limit because capacity and allocatable will not be set. Change-Id: I48a52e737c6a09e9131454db6ad31247b56c000a	2022-08-02 23:45:48 -07:00
inosato	3b95d3b076	Remove ioutil in kubelet and its tests Signed-off-by: inosato <si17_21@yahoo.co.jp>	2022-07-30 12:35:26 +09:00
Davanum Srinivas	a9593d634c	Generate and format files - Run hack/update-codegen.sh - Run hack/update-generated-device-plugin.sh - Run hack/update-generated-protobuf.sh - Run hack/update-generated-runtime.sh - Run hack/update-generated-swagger-docs.sh - Run hack/update-openapi-spec.sh - Run hack/update-gofmt.sh Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2022-07-26 13:14:05 -04:00
Jack Francis	ab14cba2cf	kubelet: more resilient node allocatable ephemeral-storage data getter	2022-03-29 18:13:57 -07:00
Kir Kolyshkin	37761a329e	pkg/kubelet: changes to update runc to 1.1.0 The changes (mostly in pkg/kubelet/cm) are there to adopt changed runc 1.1 API, and simplify things a bit. In particular: 1. simplify cgroup manager instantiation, using a new, easier way of libcontainers/cgroups/manager.New; 2. replace libcontainerAdapter with a boolean variable (all it did was passing on whether systemd manager should be used); 3. trivial change due to removed cgroupfs.HugePageSizes and added cgroups.HugePageSizes(); 4. do not calculate cgroup paths in update / destroy, since libcontainer cgroup managers now calculate the paths upon creation (previously, they were doing that only in Apply, so using e.g. Set or Destroy right after creation was impossible without specifying paths). We currently still calculate cgroup paths in Exists -- this is to be addressed separately. Co-Authored-By: Elana Hashman <ehashman@redhat.com>	2022-03-28 16:23:20 -07:00
Steve Kuznetsov	8f2bc39f72	kubelet: cgroups: be verbose about validation Previously, callers of `Exists()` would not know why the cGroup was or was not existing. In one call-site in particular, the `kubelet` would entirely fail to start if the cGroup validation did not succeed. In these cases we MUST explain what went wrong and pass that information clearly to the caller. Previously, some but not all of the reasons for invalidation were logged at a low log-level instead. This led to poor UX. The original method was retained on the interface so as to make this diff small. Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>	2022-03-10 07:25:33 -08:00
chenyw1990	e26df3594c	do not return err when PodSandbox not exist Co-authored-by: pacoxu <paco.xu@daocloud.io>	2022-02-24 14:58:39 +08:00
Matthias Bertschy	9500ee9d9c	container_manager: use oomScoreAdj instead of default when set	2022-02-12 15:23:13 +01:00
Danielle Lancashire	c198062da4	cm: Remove legacy docker references Dockershim and built-in Docker support are gone. Cleans up dead code references to them.	2022-02-08 16:25:04 +01:00
yxxhero	a90b149be0	add more message for no PodSandbox container Signed-off-by: yxxhero <aiopsclub@163.com>	2021-12-18 09:52:03 +08:00
Neha Lohia	fa1b6765d5	move pkg/util/node to component-helpers/node/util (#105347 ) Signed-off-by: Neha Lohia <nehapithadiya444@gmail.com>	2021-11-12 07:52:27 -08:00
Alexey Perevalov	5d9032007a	Return only isolated cpus in podresources interface Co-Authored-by: Swati Sehgal <swsehgal@redhat.com> Signed-off-by: Alexey Perevalov <alexey.perevalov@huawei.com>	2021-10-07 15:34:08 +01:00
Ricardo Pchevuzinske Katz	37d11bcdaf	Move node and networking related helpers from pkg/util to component helpers Signed-off-by: Ricardo Katz <rkatz@vmware.com>	2021-09-16 17:00:19 -03:00
Stephen Augustus	481cf6fbe7	generated: Run hack/update-gofmt.sh Signed-off-by: Stephen Augustus <foo@auggie.dev>	2021-08-24 15:47:49 -04:00
Kir Kolyshkin	e5b434e990	kubelet/cm: don't set Devices Since runc 1.0.0 it is now sufficient to have SkipDevices: true. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2021-07-16 12:45:35 -07:00
Francesco Romani	6dcec345df	smtalign: cm: factor out admission response Introduce a new `admission` subpackage to factor out the responsability to create `PodAdmitResult` objects. This enables resource manager to report specific errors in Allocate() and to bubble up them in the relevant fields of the `PodAdmitResult`. To demonstrate the approach we refactor TopologyAffinityError as a proper error. Co-authored-by: Kevin Klues <kklues@nvidia.com> Co-authored-by: Swati Sehgal <swsehgal@redhat.com> Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-07-08 23:15:37 +02:00
Francesco Romani	c5cb263dcf	smtalign: propagate policy options to cpumanager The CPUManagerPolicyOptions received from the kubelet config/command line args is propogated to the Container Manager. We defer the consumption of the options to a later patch(set). Co-authored-by: Swati Sehgal <swsehgal@redhat.com> Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-07-08 23:15:35 +02:00
Li Bo	c3d9b10ca8	feature: support Memory QoS for cgroups v2	2021-07-08 09:26:46 +08:00
Akihiro Suda	dbe0155139	kubelet/cm: ignore sysctl error when running in userns Errors during setting the following sysctl values are ignored: - vm.overcommit_memory - vm.panic_on_oom - kernel.panic - kernel.panic_on_oops - kernel.keys.root_maxkeys - kernel.keys.root_maxbytes Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-07-07 14:23:29 +09:00
Odin Ugedal	61d88af9e4	Revert "Update runc to 1.0.0"	2021-07-05 14:03:04 +02:00
Kir Kolyshkin	ab5b77944e	kubelet/cm: don't set Devices Since runc 1.0.0 it is now sufficient to have SkipDevices: true. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2021-06-30 16:17:35 -07:00
Artyom Lukianov	03830db82d	Implement all necessary methods to provide memory manager data under pod resources metrics Signed-off-by: Artyom Lukianov <alukiano@redhat.com>	2021-06-22 13:06:32 +03:00
Francesco Romani	369416b763	cm: handle nil cpumanager avoiding segfault If the cpumanager feature gate is disabled, the corresponsing field of the containerManager will be nil. A couple functions don't check for this occurrence and happily deference the pointer unconditionally, leading to possible segfaults. The relevant functions were introduced to support the podresources API, so to trigger this segfault all the following are needed: - cpumanager feature gate has to be disabled explicitely - any podresources API must be called Worth pointing out that when the new functions were introduced (around kubernetes 1.20) the default feature gate for cpumanager was already set to true, hence this bug is expected to be triggered rarely. Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-06-10 16:22:43 +02:00
Giuseppe Scrivano	12abc3b7c9	kubelet: reuse manager Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2021-05-19 23:51:49 -07:00
Jordan Liggitt	4b45d0d921	Revert "Merge pull request 101888 from kolyshkin/update-runc-rc94" This reverts commit `b1b06fe0a4`, reversing changes made to `382a33986b`.	2021-05-18 09:13:47 -04:00
Giuseppe Scrivano	fd7ecd3915	kubelet: reuse manager Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2021-05-10 17:34:53 -07:00
Ryan Phillips	4488162bd9	kubelet: change cgroup move message to log level 3	2021-04-28 14:54:54 -05:00

1 2 3 4 5

244 Commits