kubernetes

Author	SHA1	Message	Date
inosato	3b95d3b076	Remove ioutil in kubelet and its tests Signed-off-by: inosato <si17_21@yahoo.co.jp>	2022-07-30 12:35:26 +09:00
Sascha Grunert	de37b9d293	Make CRI `v1` the default and allow a fallback to `v1alpha2` This patch makes the CRI `v1` API the new project-wide default version. To allow backwards compatibility, a fallback to `v1alpha2` has been added as well. This fallback can either used by automatically determined by the kubelet. Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2021-11-17 11:05:05 -08:00
nolancon	6bbb36df10	Additional cases for reconcileState testing	2021-10-11 16:17:21 +00:00
Artyom Lukianov	66babd1a90	cpu manager: do not clean admitted pods from the state Signed-off-by: Artyom Lukianov <alukiano@redhat.com>	2021-08-08 16:46:06 +03:00
Francesco Romani	23abdab2b7	smtalign: propagate policy options to policies Consume in the static policy the cpu manager policy options from the cpumanager instance. Validate in the none policy if any option is given, and fail if so - this is almost surely a configuration mistake. Add new cpumanager.Options type to hold the options and translate from user arguments to flags. Co-authored-by: Swati Sehgal <swsehgal@redhat.com> Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-07-08 23:15:37 +02:00
Francesco Romani	c5cb263dcf	smtalign: propagate policy options to cpumanager The CPUManagerPolicyOptions received from the kubelet config/command line args is propogated to the Container Manager. We defer the consumption of the options to a later patch(set). Co-authored-by: Swati Sehgal <swsehgal@redhat.com> Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-07-08 23:15:35 +02:00
Kevin Klues	6646039481	Add logic to only call Update() if state different than last Update() Signed-off-by: Kevin Klues <kklues@nvidia.com>	2021-05-06 23:38:08 +02:00
Francesco Romani	6d33354e4c	node: podresources: implement GetAllocatableResources API Extend the podresources API implementing the GetAllocatableResources endpoint, as specified in the KEPs: https://github.com/kubernetes/enhancements/tree/master/keps/sig-node/2043-pod-resource-concrete-assigments https://github.com/kubernetes/enhancements/pull/2404 Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-03-09 13:13:36 +01:00
Kubernetes Prow Robot	e6e079aac3	Merge pull request #97748 from heqg/collides-state Fix variable 'state' collides with imported package name	2021-01-28 17:51:40 -08:00
Artyom Lukianov	38dc7509f8	cpu manager: specify the container CPU set during the creation We can set the container cpuset.cpus diring the creation and it will not need to call to update resources after the container creation. Additional side effect of the change, that the runc process that responsible to create the container will run with the same CPU affinity because the runc runs on the cpuset provided in the config.json arg. It will allow to prevent undesirable interupts on isolated CPUs. Signed-off-by: Artyom Lukianov <alukiano@redhat.com>	2021-01-20 17:53:33 +02:00
he.qingguo	8249cd611d	Fix variable 'state' collides with imported package name Signed-off-by: he.qingguo <he.qingguo@zte.com.cn>	2021-01-06 10:31:47 +08:00
sw.han	f5997fe537	Add GetPodTopologyHints() interface to Topology/CPU/Device Manager Signed-off-by: Krzysztof Wiatrzyk <k.wiatrzyk@samsung.com>	2020-11-12 12:25:54 +01:00
Alexey Perevalov	a047e8aa1b	move to cadvisor.MachineInfo This patch removes GetNUMANodeInfo, cadvisor.MachineInfo will be used instead of it. GetNUMANodeInfo was introduced due to difference of meaning of MachineInfo.Topology. On the arm it was NUMA nodes, but on the x86 it represents sockets (since reading from /proc/cpuinfo). Now it unified and MachineInfo.Topology represents NUMA node. Signed-off-by: Alexey Perevalov <alexey.perevalov@huawei.com>	2020-07-24 09:29:41 -04:00
Kubernetes Prow Robot	7fdc1275d9	Merge pull request #90377 from cbf123/container_cpuset_fixup_2 Fix exclusive CPU allocations being deleted at container restart	2020-04-27 13:40:04 -07:00
Chris Friesen	ab5870d808	Fix exclusive CPU allocations being deleted at container restart The expectation is that exclusive CPU allocations happen at pod creation time. When a container restarts, it should not have its exclusive CPU allocations removed, and it should not need to re-allocate CPUs. There are a few places in the current code that look for containers that have exited and call CpuManager.RemoveContainer() to clean up the container. This will end up deleting any exclusive CPU allocations for that container, and if the container restarts within the same pod it will end up using the default cpuset rather than what should be exclusive CPUs. Removing those calls and adding resource cleanup at allocation time should get rid of the problem. Signed-off-by: Chris Friesen <chris.friesen@windriver.com>	2020-04-27 11:36:54 -06:00
Kevin Klues	751b9f3e13	Update strategy used to reuse CPUs from init containers in CPUManager With the old strategy, it was possible for an init container to end up running without some of its CPUs being exclusive if it requested more guaranteed CPUs than the sum of all guaranteed CPUs requested by app containers. Unfortunately, this case was not caught by our unit tests because they didn't validate the state of the defaultCPUSet to ensure there was no overlap with CPUs assigned to containers. This patch updates the strategy to reuse the CPUs assigned to init containers across into app containers, while avoiding this edge case. It also updates the unit tests to now catch this type of error in the future.	2020-04-23 20:27:43 +00:00
Byonggon Chun	a3047672d0	move pkg/kubelet/cm/cpumanager/containermap to pkg/kubelet/cm/containermap for reusing containerMap is used in CPU Manager to store all containers information in the node. containerMap provides a mapping from (pod, container) -> containerID for all containers a pod It is reusable in another component in pkg/kubelet/cm which needs to track changes of all containers in the node. Signed-off-by: Byonggon Chun <bg.chun@samsung.com>	2020-03-14 02:38:51 +09:00
nolancon	0a9bd0334d	CPU Manager - Updates to unit tests: - Where previously we called manager.AddContainer(), we now call both manager.Allocate() and manager.AddContainer(). - Some test cases now have two expected errors. One each from Allocate() and AddContainer(). Existing outcomes are unchanged.	2020-02-27 07:24:34 +00:00
nolancon	709989efa2	CPU Manager - Rename policy.AddContainer() to policy.Allocate()	2020-02-27 07:24:33 +00:00
Kevin Klues	bc686ea27b	Update TopologyManager.GetTopologyHints() to take pointers Previously, this function was taking full Pod and Container objects unnecessarily. This commit updates this so that they will take pointers instead.	2020-02-03 17:13:28 +00:00
Kubernetes Prow Robot	e6b5194ec1	Merge pull request #84300 from klueska/upstream-cpu-manager-reconcile-on-container-state Update logic in `CPUManager` `reconcileState()`	2020-01-20 12:27:37 -08:00
Kevin Klues	f2acbf6607	Base CPUManager state reconciliation on container state, not pod state	2020-01-20 13:57:30 +00:00
Kevin Klues	f6cf9b8ce9	Move CPUManager Pod Status logic before container loop	2020-01-20 13:57:30 +00:00
whypro	f4bd4e2e96	Return error instead of panic when cpu manager starts failed.	2019-12-19 21:56:23 +08:00
Kevin Klues	69f8053850	Update top-level CPUManager to adhere to new state semantics For now, we just pass 'nil' as the set of 'initialContainers' for migrating from old state semantics to new ones. In a subsequent commit will we pull this information from higher layers so that we can pass it down at this stage properly.	2019-12-11 23:02:51 +01:00
Kevin Klues	9191a949ae	Extend makePod() helper in CPUManager to take PodUID and ContainerName	2019-12-11 23:02:51 +01:00
Kevin Klues	7a15d3a4d7	Fix bug in parsing int to string in CPUManager tests	2019-12-11 23:02:51 +01:00
Kevin Klues	765aae93f8	Move containerMap out of static policy and into top-level CPUManager	2019-12-11 23:02:51 +01:00
Jianzhu Zhang	89dfd24483	added --reserved-cpus kubelet command option	2019-11-06 07:33:52 -05:00
Kubernetes Prow Robot	17a57f99d5	Merge pull request #81344 from zouyee/cpm fix cpumanager reconcileState without sourceready	2019-10-30 23:33:36 -07:00
Connor Doyle	389853894d	Delegate topology hint gen to CPU manager policy - The previous implementation depended on a fixed set of policies.	2019-09-27 22:29:02 -07:00
zouyee	594fc0f4b9	fix cpumanager reconcileState without sourceready Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2019-09-25 10:39:06 +08:00
Kevin Klues	ddfd9ac0ca	Fix bug in CPUManager with setting topology for policies Also add a check in the unit tests to avoid regressions	2019-08-28 17:32:25 -05:00
Kevin Klues	ecc14fe661	Update CPUManager to include NUMANodeID in CPUTopology Unfortunately, the NUMA information is not readily available from cadvisor, so we have to roll the logic to discover it by hand. In the future, we should remove this custiom code to use the information provided by cadvisor once it is made available.	2019-08-27 16:51:05 -05:00
Conor Nolan	e33af11add	Add stub support for TopologyManager to CPUManager Co-Authored-By: Louise Daly <louise.m.daly@intel.com>	2019-08-07 15:56:05 +02:00
Kevin Klues	9f36f1a173	Add tests for proactive init Container removal in the CPUManager static policy	2019-07-26 14:34:51 +02:00
Kevin Klues	5dc5f1de06	Update the cpumanager to error out if an invalid policy is given Previously, the cpumanager would simply fall back to the None() policy if an invalid policy was specified. This patch updates this to return an error when an invalid policy is passed, forcing the kubelet to fail fast when this occurs. These semantics should be preferable because an invalid policy likely indicates operator error in setting the policy flag on the kubelet correctly (e.g. misspelling 'static' as 'statiic'). In this case it is better to fail fast so the operator can detect this and correct the mistake, than to mask the error and essentially disable the cpumanager unexpectedly.	2019-07-18 13:24:09 +02:00
Kevin Klues	ef27f5f1a5	Add ability to find init Container IDs in cpumanager reconcileState() The cpumanager loops through all init Containers and app Containers when reconciling its state. However, the current implementation of findContainerIDByName(), which is call by the reconciler, does not resolve for init Containers. This patch updates findContainerIDByName() to account for init Containers and adds a regression test that fails before the change and succeeds after.	2019-04-27 06:18:55 -07:00
Davanum Srinivas	33081c1f07	New staging repository for cri-api Change-Id: I2160b0b0ec4b9870a2d4452b428e395bbe12afbb	2019-03-26 18:21:04 -04:00
choury	36b92b9b29	cpumanager: rollback state if updateContainerCPUSet failed	2018-08-17 18:08:58 +08:00
Lin Yang	b7e1f0bf17	kubelet/cm/cpumanager: Fix unused variable "skipIfPermissionsError" The variable "skipIfPermissionsError" is not needed even when permission error happened.	2018-08-02 17:24:33 -07:00
Lee Verberne	e10042d22f	Increment CRI version from v1alpha1 to v1alpha2 This also incorporates the version string into the package name so that incompatibile versions will fail to connect. Arbitrary choices: - The proto3 package name is runtime.v1alpha2. The proto compiler normally translates this to a go package of "runtime_v1alpha2", but I renamed it to "v1alpha2" for consistency with existing packages. - kubelet/apis/cri is used as "internalapi". I left it alone and put the public "runtimeapi" in kubelet/apis/cri/runtime.	2018-02-07 09:06:26 +01:00
linweibin	fa8afc1d39	Remove unused code in UT files in pkg/	2018-01-15 16:02:35 +08:00
Di Xu	d474b86e05	Propagate error up instead panic	2017-12-18 14:05:06 +08:00
Michał Stachowski	809ac834a0	Cpu manager file state tests	2017-11-14 18:26:41 +01:00
Szymon Scharmach	4ee0adc77a	Added Cpu Manager file state	2017-10-26 20:03:17 +02:00
Harry Zhang	282973d87d	Elimenate extra CRI call	2017-09-30 16:51:32 +08:00
Connor Doyle	ec706216e6	Un-revert "CPU manager wiring and `none` policy" This reverts commit `8d2832021a`.	2017-09-04 07:24:59 -07:00
Shyam JVS	8d2832021a	Revert "CPU manager wiring and `none` policy"	2017-09-01 18:17:36 +02:00
Balaji Subramaniam	7567f1765f	Added CPU manager unit tests (none policy)	2017-08-30 08:26:22 -07:00

50 Commits