kubernetes

Author	SHA1	Message	Date
Kevin Klues	7069b1d6e8	Update TopologyManager single-numa-node logic to handle "don't cares" The logic has been updated to match the logic of the best-effort policy except in two places: 1) The hint filtering frunction has been updated to allow "don't care" hints encoded with a `nil` affinity mask, to pass through the filter in addition to hints that have just a single NUMA bit set. 2) After calculating the `bestHint` we transform "don't care" affinities encoded as having all NUMA bits set in their affinity masks into "don't care" affinities encoded as `nil`.	2020-01-16 08:50:35 +00:00
Kevin Klues	2905ffffa7	Rename TopologyManager test TestPolicyBestEffortMerge for consistency	2020-01-16 08:50:21 +00:00
Kevin Klues	94489c137c	Cleanup use of defaultAffinity in mergePermutation of TopologyManager	2020-01-16 08:50:12 +00:00
nolancon	5e23517ebf	Use reflect.DeepEqual check in policy_test.go	2020-01-16 08:13:07 +00:00
nolancon	92eb7cd601	Update "Single NUMA hint generation" expected affinity to nil	2020-01-16 08:13:07 +00:00
nolancon	8b3f6e61a2	Move test case "Two providers, 1 with 2 hints, 1 with single non-preferred hint matching" into specific policy tests	2020-01-16 08:13:07 +00:00
nolancon	681c42bfc2	Move test case "Two providers, 1 hint each, same mask, 1 preferred, 1 not 2/2" into specific policy tests	2020-01-16 08:13:07 +00:00
nolancon	a38a2562b2	Move test case "Two providers, 1 hint each, same mask, 1 preferred, 1 not 1/2" into specific policy test.	2020-01-16 08:13:07 +00:00
nolancon	f639da7637	Move test case "Two providers, 1 hint each, no common mask" into specific policy tests.	2020-01-16 08:13:07 +00:00
nolancon	401a2bb285	Move test case "Single TopologyHint with Preferred as false and NUMANodeAffinity as nil" into specific policy tests.	2020-01-16 08:13:06 +00:00
nolancon	6460ef6392	Move test case "Single TopologyHint with Preferred as true and NUMANodeAffinity as nil" into specific policy tests.	2020-01-16 08:13:06 +00:00
nolancon	baeff9ec5d	Move test case "HintProvider returns empty non-nil map[string][]TopologyHint from provider" into specific policy tests.	2020-01-16 08:13:06 +00:00
nolancon	599217d482	Move test case "HintProvider returns -nil map[string][]TopologyHint from provider" into specific policy tests	2020-01-16 08:13:06 +00:00
nolancon	57661ee946	Move test case 'HintProvider returns empty non-nil map[string][]TopologyHint' into specific policy tests.	2020-01-16 08:13:06 +00:00
nolancon	51f1af0395	Move test case 'TopologyHint not set' into individual policy tests	2020-01-16 08:13:06 +00:00
nolancon	8466a5852a	Restore policy_test.go to upstream Following commits will contain incremental changes to this file to ease review process and ensure all tests are accounted for.	2020-01-16 08:13:06 +00:00
nolancon	59bb6c4d6f	Update checks in mergeProvidersHints: - Initialize best Hint to TopologyHint{} - Update checks. - Move generic unit test case into policy specific tests and updated expected outcome to reflect changes.	2020-01-16 08:13:06 +00:00
nolancon	6758f95117	Restore original policy none test cases: Mistakenly overwritten in earlier commit	2020-01-16 08:13:06 +00:00
nolancon	2d1a535a35	Make mergePermutation generic: - Remove policy parameters to make function generic - Move function into top level policy.go	2020-01-16 08:13:06 +00:00
nolancon	5487941485	Refactor filterHints: - Restructure function - Remove bug fix for catching {nil true} - To be fixed in later commit - Restore unit tests to original state for testing filterHints	2020-01-16 08:13:06 +00:00
nolancon	adfd11f38f	Make iterateAllProviderTopologyHints generic: - Remove policy parameters to make this function generic. - Move function out of individual policies and into policy.go	2020-01-16 08:13:06 +00:00
nolancon	e43f0a5293	Reinstate canAdmitPodResult in policy_none: This is to keep consistency with the other policies. This change may be made across all policies in a future PR, but removing it from the scope of this PR for now.	2020-01-16 08:13:05 +00:00
nolancon	4cc5b9e46c	Edit hints returned from policies and unit tests: - Best Effort Policy: Return hint with nil affinity as opposed to defaultAffinity when provider has no preference for NUMA affinty or no possible NUMA affinities. - Single NUMA Node Policy: Remove defaultHint from mergeProvidersHints. Instead return appropriate TopologyHint where required. - Update unit tests to reflect changes. Some test cases moved into individual policy test functions due to differing returned affinties per policy.	2020-01-16 08:13:05 +00:00
nolancon	e3d0c9397f	Updates to single-numa-node policy: - Remove getHintMatch method. - Replace with simplified versions of mergePermutation and iterateAllProviderTopologyHints methods - as used in best-effort. - Remove getHintMatch unit tests.	2020-01-16 08:13:05 +00:00
nolancon	b5ca4989e3	Update unit tests: - Update filterHints test to reflect changes in previous commit. - Some common test cases achieve differing expected results based on policy due to independent merge strategies. These cases are moved into individual policy based test functions.	2020-01-16 08:13:05 +00:00
nolancon	17d615bca2	Update filterHints: - Only append valid preferred-true hints to filtered - Return true if allResourceHints only consist of nil-affinity/preferred-true hints: {nil true}, update defaultHint preference accordingly.	2020-01-16 08:13:05 +00:00
Adrian Chiris	9f21f49493	Additional unit tests for Topology Manager methods	2020-01-16 08:13:05 +00:00
Adrian Chiris	f886d2a832	Update single-numa-node policy unit tests	2020-01-16 08:13:05 +00:00
Adrian Chiris	2825a7be1a	Add new functionality for single-numa-node policy: Explanation taken from original commit: - Change the current method of finding the best hint. Instead of going over all permutations, sort the hints and find the narrowest hint common to all resources. - Break out early when merging to a preferred hint is not possible	2020-01-16 08:13:05 +00:00
Adrian Chiris	5ce2ea2773	Return defaultAffinity from PolicyBestEffort: Now that PolicySingleNUMANode is not considered here, return defaultAffinity as was the original case before previous bug fix	2020-01-16 08:13:05 +00:00
Adrian Chiris	eda1521562	Make mergeProviderHints policy-specific: - Remove need to pass policy and numaNodes as arguments - Remove PolicySingleNUMANode special case check in policy_best_effort - Add mergeProviderHints base to policy_single_numa_node for upcoming commit	2020-01-16 08:13:05 +00:00
Adrian Chiris	dc36924c37	Update policy_none removing canAdmitPodResult Update unit tests for none_policy Add Name test for policy_restricted	2020-01-16 08:13:05 +00:00
Adrian Chiris	cf8b098dda	Refactor policy-best-effort - Modularize code with mergePermutation method	2020-01-16 08:13:05 +00:00
Sascha Grunert	278717bc57	Fix ineffectual assignment to CPUSets Signed-off-by: Sascha Grunert <sgrunert@suse.com>	2020-01-16 08:57:42 +01:00
Kevin Klues	34b942a41d	Remove check for empty activePods list in CPUManager removeStaleState This check is redundant since we protect this call with a call to `m.sourcesReady.AllReady()` earlier on. Moreover, having this check in place means that we will leave some stale state around in cases where there are actually no active pods in the system and this loop hasn't cleaned them up yet. This can happen, for example, if a pod exits while the kubelet is down for some reason. We see this exact case being triggered in our e2e tests, where a test has been failing since October when this change was first introduced.	2020-01-15 20:09:24 +01:00
Kevin Klues	5802f3a910	Add proper activePods list in TestGetTopologyHints for CPUManager	2020-01-15 20:08:41 +01:00
danielqsj	1a9b121764	remove deprecated metrics of kubelet	2020-01-10 16:46:52 +08:00
Kubernetes Prow Robot	fd0358fd21	Merge pull request #86689 from klueska/upstream-fix-cpumanager-v1-state-checksum Lock checksum calculation for v1 CPUManager state to pre 1.18 logic	2020-01-08 02:57:40 -08:00
Kubernetes Prow Robot	d6412b856f	Merge pull request #84345 from danielqsj/withdialer replace grpc.WithDialer which is deprecated	2020-01-06 15:56:17 -08:00
Kubernetes Prow Robot	9acf7d11fe	Merge pull request #86344 from klueska/upstream-cm-approver Add klueska as an approver in pkg/kubelet/cm/OWNERS	2020-01-06 09:54:16 -08:00
Kevin Klues	b373121a14	Make CPUManagerCheckpointV2 type an alias of CPUManagerCheckpoint This change is to prevent problems when we remove the V1->V2 migration code in the future. Without this, the checksums of all checkpoints would be hashed with the name CPUManagerCheckpointV2 embedded inside of them, which is undesirable. We want the checkpoints to be hashed with the name CPUManagerCheckpoint instead.	2019-12-28 19:29:13 +01:00
Kevin Klues	5faf8f4c52	Lock checksum calculation for v1 CPUManager state to pre 1.18 logic The updated CPUManager from PR #84462 implements logic to migrate the CPUManager checkpoint file from an old format to a new one. To do so, it defines the following types: ``` type CPUManagerCheckpoint = CPUManagerCheckpointV2 type CPUManagerCheckpointV1 struct { ... } type CPUManagerCheckpointV2 struct { ... } ``` This replaces the old definition of just: ``` type CPUManagerCheckpoint struct { ... } ``` Code was put in place to ensure proper migration from checkpoints in V1 format to checkpoints in V2 format. However (and this is a big however), all of the unit tests were performed on V1 checkpoints that were generated using the type name `CPUManagerCheckpointV1` and not the original type name of `CPUManagerCheckpoint`. As such, the checksum in the checkpoint file uses the `CPUManagerCheckpointV1` type to calculate its checksum and not the original type name of `CPUManagerCheckpoint`. This causes problems in the real world since all pre-1.18 checkpoint files will have been generated with the original type name of `CPUManagerCheckpoint`. When verifying the checksum of the checkpoint file across an upgrade to 1.18, the checksum is calculated assuming a type name of `CPUManagerCheckpointV1` (which is incorrect) and the file is seen to be corrupt. This patch ensures that all V1 checksums are verified against a type name of `CPUManagerCheckpoint` instead of ``CPUManagerCheckpointV1`. It also locks the algorithm used to calculate the checksum in place, since it wil never change in the future (for pre-1.18 checkpoint files at least).	2019-12-28 14:17:55 +01:00
danielqsj	19fe9f8d94	replace grpc.WithDialer which is deprecated	2019-12-26 17:46:59 +08:00
whypro	f4bd4e2e96	Return error instead of panic when cpu manager starts failed.	2019-12-19 21:56:23 +08:00
Kevin Klues	9818b4522e	Add klueska as an approver in pkg/kubelet/cm/OWNERS	2019-12-17 10:40:23 +01:00
Kevin Klues	f553286156	Pass initial set of runtime containers to the CPUManager at startup These information associatedd with these containers is used to migrate the CPUManager state from it's old format to its new (i.e. keyed off of podUID and containerName instead of containerID).	2019-12-11 23:02:51 +01:00
Kevin Klues	6441e1ef43	Move CPUManager Checkpoint restoration to Start() instead of New()	2019-12-11 23:02:51 +01:00
Kevin Klues	69f8053850	Update top-level CPUManager to adhere to new state semantics For now, we just pass 'nil' as the set of 'initialContainers' for migrating from old state semantics to new ones. In a subsequent commit will we pull this information from higher layers so that we can pass it down at this stage properly.	2019-12-11 23:02:51 +01:00
Kevin Klues	185e790f71	Update CPUManager policies to adhere to new state semantics	2019-12-11 23:02:51 +01:00
Kevin Klues	7c760fea38	Change CPUManager state to key off of podUID and containerName Previously, the state was keyed off of containerID intead of podUID and containerName. Unfortunately, this is no longer possible as we move to a to model where we we allocate CPUs to containers at pod adit time rather than container start time. This patch is the first step towards full migration to the new semantics. Only the unit tests in cpumanager/state are passing. In subsequent commits we will update the CPUManager itself to use these new semantics. This patch also includes code to do migration from the old checkpoint format to the new one, assuming the existence of a ContainerMap with the proper mapping of (containerID)->(podUID, containerName). A subsequent commit will update code in higher layers to make sure that this ContainerMap is made available to this state logic.	2019-12-11 23:02:51 +01:00
Kevin Klues	9191a949ae	Extend makePod() helper in CPUManager to take PodUID and ContainerName	2019-12-11 23:02:51 +01:00
Kevin Klues	7a15d3a4d7	Fix bug in parsing int to string in CPUManager tests	2019-12-11 23:02:51 +01:00
Kevin Klues	765aae93f8	Move containerMap out of static policy and into top-level CPUManager	2019-12-11 23:02:51 +01:00
Kevin Klues	1d995c98ef	Update CPUmanager containerMap to allow removal by containerRef	2019-12-11 23:02:47 +01:00
Kevin Klues	0639bd0942	Change CPUManager containerMap to key off of (podUID, containerName) Previously it keyed off of a pointer to the actual pod / container, which was unnecessary, and hard to work with (especially on the retrieval side).	2019-12-11 23:02:11 +01:00
Kevin Klues	3881e50cce	Update CPUmanager containerMap to also return a containerRef	2019-12-11 23:01:01 +01:00
Kevin Klues	347d5f57ac	Move CPUManager ContainerMap to its own package	2019-12-11 22:59:00 +01:00
Kubernetes Prow Robot	ad5d4c4705	Merge pull request #85706 from yutedz/per-node-dev Remove nodes slice in loop of takeByTopology	2019-12-05 13:50:30 -08:00
Kubernetes Prow Robot	57b6b287d4	Merge pull request #85688 from yutedz/pods-to-rm Reduce unnecessary Set in updateAllocatedDevices	2019-12-02 17:07:26 -08:00
Kubernetes Prow Robot	833f585104	Merge pull request #85760 from yutedz/chkpt-write-err Log error when writing checkpoint fails	2019-12-02 10:27:06 -08:00
Ted Yu	84a9803741	Log error when writing checkpoint fails	2019-11-29 19:47:17 -08:00
Ted Yu	6415fa765e	Remove nodes slice in loop of takeByTopology	2019-11-29 12:12:22 -08:00
Kubernetes Prow Robot	80eed952f0	Merge pull request #84854 from BSWANG/fix-hugetlb-cgroup fix kubelet failed to start on setting hugetlb limits	2019-11-27 12:29:03 -08:00
Ted Yu	86f3bc25e1	Reduce unnecessary Set in updateAllocatedDevices	2019-11-27 08:48:06 -08:00
Travis Rhoden	0c5c3d8bb9	Remove pkg/util/mount (moved out of tree) This patch removes pkg/util/mount completely, and replaces it with the mount package now located at k8s.io/utils/mount. The code found at k8s.io/utils/mount was moved there from pkg/util/mount, so the code is identical, just no longer in-tree to k/k.	2019-11-15 08:29:12 -07:00
Kubernetes Prow Robot	30e6238795	Merge pull request #85147 from yutedz/devmgr-rm-contents Continue removing file in ManagerImpl#removeContents	2019-11-14 16:38:28 -08:00
Ted Yu	fb046f7787	Continue removing file in ManagerImpl#removeContents	2019-11-13 06:00:34 -08:00
Kubernetes Prow Robot	ed10b5b17f	Merge pull request #85047 from yutedz/dev-mgr-err-handling Handle error return from allocatePodResources	2019-11-12 11:51:27 -08:00
Kubernetes Prow Robot	897ce3073c	Merge pull request #84533 from davidz627/fix/deprecatedPath Remove plugin watching of deprecated directory and CSI v0 support in accordance with deprecation policy	2019-11-12 04:48:20 -08:00
David Zhu	802fe12803	Remove plugin watching of deprecated directory {kubelet_root_dir}/plugins and support for CSI V0 in accordance with deprecation announcement in https://v1-13.docs.kubernetes.io/docs/setup/release/notes/	2019-11-11 11:42:58 -08:00
Ted Yu	db0f616974	Handle error return from allocatePodResources	2019-11-09 16:25:15 -08:00
Travis Rhoden	1fd8921546	Move mount/fake.go to mount/fake_mount.go This patch moves fake.go to mount_fake.go, and follows to principle of always returning a discrete type rather than an Interface. All callers of "FakeMounter" are changed to instead use "NewFakeMounter()". The FakeMounter "Log" struct member is changed to not be exported, and instead only access through a new "GetLog()" method.	2019-11-08 08:07:41 -07:00
mrobson	e401ee9158	Errors from cgroup destroy and pid kills are swallowed. Log a warning when that happens.	2019-11-07 07:47:57 -05:00
Kubernetes Prow Robot	73b2c82b28	Merge pull request #83592 from jianzzha/opt-reserved-cpus added --reserved-cpus kubelet command option	2019-11-06 22:14:42 -08:00
Kubernetes Prow Robot	695c3061dd	Merge pull request #82809 from liggitt/go-1.13-no-modules update to use go1.13.4	2019-11-06 17:02:43 -08:00
Kubernetes Prow Robot	08e5781b41	Merge pull request #84525 from klueska/upstream-fix-hint-generation-after-kubelet-restart Fix bug in TopologyManager hint generation after kubelet restart	2019-11-06 15:33:50 -08:00
Jordan Liggitt	297570e06a	hack/update-vendor.sh	2019-11-06 17:42:34 -05:00
Kubernetes Prow Robot	46472773cb	Merge pull request #84836 from yuxiaobo96/k8s-checks Correct spelling mistakes	2019-11-06 12:21:11 -08:00
Kevin Klues	4d4d4bdd61	Ensure devicemanager TopologyHints are regenerated after kubelet restart This patch also includes test to make sure the newly added logic works as expected.	2019-11-06 15:01:34 +00:00
Jianzhu Zhang	89dfd24483	added --reserved-cpus kubelet command option	2019-11-06 07:33:52 -05:00
yuxiaobo	81e9f21f83	Correct spelling mistakes Signed-off-by: yuxiaobo <yuxiaobogo@163.com>	2019-11-06 20:25:19 +08:00
bingshen.wbs	47642a0bad	fix kubelet failed to start on setting hugetlb limits in non-exist cgroup dir cause by kubelet startup be interrupted on setting list of cgroups In the 'cgroupManagerImpl.Exists' not check&recreate the hugetlb cgroup dir. Then setting the limits in non-exist cgroup dir will cause kubelet start failed. Signed-off-by: bingshen.wbs <bingshen.wbs@alibaba-inc.com>	2019-11-06 16:39:55 +08:00
Kubernetes Prow Robot	0c0408c790	Merge pull request #76407 from yanghaichao12/dev0411 change directory permissions from 0755 to 0750	2019-11-05 19:30:59 -08:00
Kevin Klues	9dc116eb08	Ensure CPUManager TopologyHints are regenerated after kubelet restart This patch also includes test to make sure the newly added logic works as expected.	2019-11-05 15:48:51 +00:00
Kevin Klues	a338c8f7fd	Add some more comments to GetTopologyHints() in the devicemanager	2019-11-05 13:06:23 +00:00
Kevin Klues	58f3554ebe	Sync all CPU and device state before generating TopologyHints for them This ensures that we have the most up-to-date state when generating topology hints for a container. Without this, it's possible that some resources will be seen as allocated, when they are actually free.	2019-11-05 13:00:20 +00:00
Kevin Klues	d9adf20360	Abstract removeStaleState from reconcileState in CPUManager This will become especially important as we move to a model where exclusive CPUs are assigned at pod admission time rather than at pod creation time. Having this function will allow us to do garbage collection on these CPUs anytime we are about to allocate CPUs to a new set of containers, in addition to reclaiming state periodically in the reconcileState() loop.	2019-11-05 12:45:11 +00:00
Kevin Klues	b5f52e6072	Modularize TopologyManager policy Merge() tests These changes make it so that a set of common test cases can be used for all merge strategies, with specific test cases being able to be specified on a policy-by-policy basis.	2019-11-04 18:43:07 +01:00
Kevin Klues	7ea1fc9be4	Move TopologyManager TestPolicyMerge() to shared test file	2019-11-04 18:43:07 +01:00
Kevin Klues	d7d7bfcda0	Abstract TopologyManager Policy Merge() tests into their own function	2019-11-04 18:43:07 +01:00
Adrian Chiris	dee22d1fbc	Fix comments in TopologyManager	2019-11-04 18:43:07 +01:00
Adrian Chiris	5f7db54d3c	Move function from top-level TopologyManager to best-effort policy This is in preparation for removing the special-case of the SingleNumaNode policy in mergeProvidersHints() in favor of a custom merging strategy with much less overhead.	2019-11-04 18:43:07 +01:00
Adrian Chiris	d95464645c	Add Merge() API to TopologyManager Policy abstraction This abstraction moves the responsibility of merging topology hints to the individual policies themselves. As part of this, it removes the CanAdmitPodResult() API from the policy abstraction, and rolls it into a second return value from Merge()	2019-11-04 18:43:07 +01:00
Adrian Chiris	78d7856288	Globalize a few TopologyManager functions This is in preparation for a larger refactoring effort that will add a 'Merge()' API to the TopologyManager policy API.	2019-11-04 18:43:07 +01:00
Adrian Chiris	e72847676f	Pass a list of NUMA nodes to the various TopologyManager policies This is in preparation for a larger refactoring effort that will add a 'Merge()' API to the TopologyManager policy API.	2019-11-04 18:43:07 +01:00
Adrian Chiris	6fd8a6eb69	Make restricted TopologyManager policy inherit from best-effort policy These policies only differ on whether they admit the pod or not when a TopologyHint is preferred or not. As such, the restricted policy should simply inherit whatever it can from the best effort policy and only overwrite what is necessary. This does not matter for now, but will become important when we add a new 'Merge()' abstraction to a Policy later on.	2019-11-04 18:43:07 +01:00
Adrian Chiris	3391daeb00	Break TopologyManager.calculateAffinity() into more modular functions This modularization is in preparation for a larger refactoring effort that will add a 'Merge()' API to the TopologyManager policy API.	2019-11-04 18:43:07 +01:00
Adrian Chiris	b17706b149	Added LessThan() and IsEqual() methods for TopologyHints	2019-11-04 18:43:07 +01:00
yanghaichao12	5cbafba457	change directory permissions from 0755 to 0750	2019-11-04 17:04:37 +08:00
Kubernetes Prow Robot	002dbf6a4c	Merge pull request #83777 from lmdaly/fix-single-numa-node-with-best-effort-pods Fixed bug in TopologyManager with SingleNUMANode Policy	2019-11-01 04:53:23 -07:00
Kubernetes Prow Robot	17a57f99d5	Merge pull request #81344 from zouyee/cpm fix cpumanager reconcileState without sourceready	2019-10-30 23:33:36 -07:00
nolancon	b0a85177d2	Clean-up and additional test cases for socket-mask unit test.	2019-10-18 04:16:06 +01:00
Kubernetes Prow Robot	017842d49d	Merge pull request #83492 from ConnorDoyle/topo-align-all-qos Topology manager aligns pods of all QoS classes.	2019-10-11 03:03:40 -07:00
Louise Daly	a353247d44	Fixed bug in TopologyManager with SingleNUMANode Policy This patch fixes an issue where best-effort pods were not admitted to the node if the single-numa-node policy was set. This was because the Admit policy in single-numa-node policy does not admit any pod where the hint is anything but single NUMA node. The 'best hint' in this case is {<set bits for num. Numa Nodes on machine>, true} So on a machine with 2 NUMA nodes the best hint for a best-effort pod is {11,true} as best-effort pods have no Topology preferences. The single-numa-node policy fails any pod with a not preferred hint OR a hint where > 1 bits are set, thus the above example resulting in termintaed pods with a Topology Affinity Error. This is a short term fix for the single-numa-node policy, as there will be code refactoring for the 1.17 release.	2019-10-11 07:00:37 +01:00
Kubernetes Prow Robot	4561b67971	Merge pull request #83697 from klueska/fix-single-numa-with-one-provider Fixed bug in TopologyManager with SingleNUMANode Policy	2019-10-10 19:00:33 -07:00
Kubernetes Prow Robot	3db6d3abcf	Merge pull request #83551 from dims/move-external-facing-kubelet-apis-to-staging Move external facing kubelet apis to staging	2019-10-10 13:41:36 -07:00
Connor Doyle	a598369e3c	Gofmt.	2019-10-10 12:16:21 -07:00
Connor Doyle	a9203ebdcf	Topology manager aligns pods of all QoS classes.	2019-10-10 12:16:21 -07:00
Kevin Klues	5501f542cd	Fixed bug in TopologyManager with SingleNUMANode Policy This patch fixes an issue in the TopologyManager that wouldn't allow pods to be admitted if pods were launched with the SingleNUMANode policy and any of the hint providers had no NUMA preferences. This is due to 2 factors: 1) Any hint provider that passes back a `nil` as its hints, has its hint automatically transformed into a single {11 true} hint before merging 2) We added a special casing for the SingleNumaNodePolicy() in the TopologyManager that essentially turns these hints into a {11 false} anytime a {11 true} is seen. The current patch reworks this logic so the that TopologyManager can tell the difference between a "don't care" hint and a true "{11 true}" hint returned by the hint provider. Only true "{11 true}" hints will be converted by the special casing for the SingleNumaNodePolicy(), while "don't care" hints will not. This is a short term fix for this issue until we do a larger refactoring of this code for the 1.17 release.	2019-10-09 17:41:08 -07:00
mrobson	ad3dcb9fa0	Add podCgroup to process kill events to allow for correlation	2019-10-08 13:12:48 -04:00
Kubernetes Prow Robot	d70b2db1f2	Merge pull request #83296 from yutedz/kill-cgrp-proc Only kill process where killing failed during previous iterations	2019-10-08 07:19:13 -07:00
Kubernetes Prow Robot	3f8f0a32fa	Merge pull request #83527 from odinuge/runc-rc9 Bump dependency opencontainers/runc@v1.0.0-rc9	2019-10-08 03:45:44 -07:00
Davanum Srinivas	f29d2272c8	fix gofmt and golint failures Change-Id: I6535b506f50558b31663a13cd270b15023afa2c6	2019-10-06 18:43:17 -04:00
Kubernetes Prow Robot	48b90db9c3	Merge pull request #83495 from tanjunchen/fix-typo remove the repeat word in documents	2019-10-06 15:05:08 -07:00
Davanum Srinivas	6ecc0f83af	update bazel BUILD files Change-Id: Ia3917cec1453c0b22a958faf8c22bccd79242d14	2019-10-06 15:29:23 -04:00
Davanum Srinivas	d30c489c54	Move pkg/kubelet/pluginregistration and deviceplugin Change-Id: I06adcb43bd278b430ffad2010869e1524c8cc4ff	2019-10-06 15:28:38 -04:00
tanjunchen	de3cf23414	remove the repeat word in documents	2019-10-06 23:32:01 +08:00
Odin Ugedal	b9cfb19321	Rename cgroupsystemd.Manager to LegacyManager	2019-10-05 14:22:35 +02:00
Kubernetes Prow Robot	d60bda1971	Merge pull request #83043 from ConnorDoyle/cleanup-cpumanger-topo-hints Delegate topology hint gen to CPU manager policy	2019-10-05 00:59:39 -07:00
Kevin Klues	d2b53af7d7	Add klueska as reviewer for CPUManager and devicemanager	2019-10-03 13:01:41 -07:00
Ted Yu	6dbb533e3c	Only kill process where killing failed during previous iterations	2019-09-29 19:53:43 -07:00
Connor Doyle	389853894d	Delegate topology hint gen to CPU manager policy - The previous implementation depended on a fixed set of policies.	2019-09-27 22:29:02 -07:00
zouyee	b1f6974f7b	using online instead to fix kubelet service failed with wrong number of possible NUMA nodes Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2019-09-26 21:48:50 +08:00
zouyee	594fc0f4b9	fix cpumanager reconcileState without sourceready Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2019-09-25 10:39:06 +08:00
Connor Doyle	e35301c19f	Rename package socketmask to bitmask. - As discussed in reviews and other public channels, this abstraction is used to represent numa nodes, not sockets. - There is nothing inherently related to sockets in this package anyway.	2019-09-23 17:08:45 -07:00
Kubernetes Prow Robot	07cc813956	Merge pull request #81793 from lmdaly/topology-manager-owners Added OWNERS file for Topology Manager	2019-09-11 18:26:52 -07:00
Louise Daly	fbccf25e29	Added OWNERS file for Topology Manager	2019-09-11 06:40:24 +01:00
Kubernetes Prow Robot	887edd2273	Merge pull request #82099 from lmdaly/single-numa-node-policy Topology Manager Policy: single-numa-node	2019-08-30 11:21:26 -07:00
Kubernetes Prow Robot	9165f7bf56	Merge pull request #82104 from klueska/upstream-fix-cpu-manager-topology-bug Fix bug in CPUManager with setting topology for policies	2019-08-30 08:00:44 -07:00
Louise Daly	8ad1b5ba3b	Single-numa-node Topology Manager bug fix Added one off fix for single-numa-node policy to correctly reject pod admission on a resource allocation that spans NUMA nodes Co-authored-by: Kevin Klues <kklues@nvidia.com>	2019-08-30 07:17:56 +01:00
Louise Daly	f6c085f60e	Added Single NUMA Node Policy which ensure resource are aligned on a single NUMA node Co-authored-by: Kevin Klues <kklues@nvidia.com>	2019-08-30 07:17:17 +01:00
Kevin Klues	5ed80dadcf	Update CanAdmitPodResult() in TopologyManager to take a TopologyHint Previously it only took a bool, which limited the logic it could perform to determine if a pod should be admitted or not based on the merged hint from the policy.	2019-08-30 07:17:17 +01:00
Kevin Klues	eb0216e54e	Update semantics to set Preferred field in TopologyHint generation We now only set Preferred to true if resources can be allocated with a size equal to the minimimum _possible_ mask when all resources are available.	2019-08-29 14:32:10 -05:00
Kevin Klues	e0e8b3e4fd	Update CPUManager topology helpers to accept multiple ids	2019-08-29 13:22:54 -05:00
Kevin Klues	dcc9f66311	Add devicemanager tests for TopologyHint consumption	2019-08-29 08:22:50 -05:00
Kevin Klues	cc567afaf0	Consume TopologyHints in the devicemanager	2019-08-29 08:22:50 -05:00
Kevin Klues	a3320f80d9	Add devicemanager tests for TopologyHint generation	2019-08-29 07:45:43 -05:00
Kevin Klues	d3d7a8f5d4	Generate TopologyHints from the devicemanager	2019-08-29 07:45:43 -05:00
Louise Daly	9a118ceac4	Added stub support for Topology Manager to Device Manager Co-authored-by: Conor Nolan <conor.nolan@intel.com> Co-authored-by: Sreemanti Ghosh <sreemanti.ghosh@intel.com> Co-authored-by: Kevin Klues <kklues@nvidia.com>	2019-08-29 07:45:43 -05:00
Kevin Klues	ddfd9ac0ca	Fix bug in CPUManager with setting topology for policies Also add a check in the unit tests to avoid regressions	2019-08-28 17:32:25 -05:00
Kevin Klues	df1b54fc09	Fail fast with TopologyManager on machines with more than 8 NUMA Nodes	2019-08-28 11:04:52 -05:00
Kevin Klues	5660cd3cfb	Add NUMA Node awareness to the TopologyManager	2019-08-28 11:04:52 -05:00
Kubernetes Prow Robot	35867b160a	Merge pull request #81951 from klueska/upstream-update-cpu-amanger-numa-mapping Update the CPUManager to include NUMANodeID in its topology information	2019-08-28 08:55:40 -07:00
Kubernetes Prow Robot	de1cfa9bc1	Merge pull request #81787 from lmdaly/topology-manager-rename-strict-policy Renaming strict policy to restricted policy	2019-08-28 01:38:04 -07:00
Kevin Klues	f4dbd29cdb	Rename TopologyHint.SocketAffinity to TopologyHint.NUMANodeAffinity As part of this, update the logic to use the NUMA information instead of the Socket information when generating and consuming TopologyHints in the CPUManager.	2019-08-27 16:51:05 -05:00
Kevin Klues	ecc14fe661	Update CPUManager to include NUMANodeID in CPUTopology Unfortunately, the NUMA information is not readily available from cadvisor, so we have to roll the logic to discover it by hand. In the future, we should remove this custiom code to use the information provided by cadvisor once it is made available.	2019-08-27 16:51:05 -05:00
Kevin Klues	869962fa48	Cache the discovered topology in the CPUManager instead of MachineInfo	2019-08-27 16:23:07 -05:00
Kubernetes Prow Robot	a3488b4cee	Merge pull request #81206 from tallclair/staticcheck-kubelet-push Cleanup Kubelet static analysis issues	2019-08-22 15:09:43 -07:00
Kubernetes Prow Robot	6b47754740	Merge pull request #81627 from tallclair/copy Delete duplicate resource.Quantity.Copy()	2019-08-22 11:13:13 -07:00
Louise Daly	2fb94231d0	Renaming strict policy to restricted policy Restricted policy will fail admission of guaranteed pods where all requested resources are not available on a single NUMA Node	2019-08-22 07:57:55 +01:00
Tim Allclair	a2c51674cf	Cleanup more static check issues (S1,ST)	2019-08-21 10:40:21 -07:00
Tim Allclair	8a495cb5e4	Clean up error messages (ST1005)	2019-08-21 10:40:21 -07:00
Tim Allclair	6510d26b6a	Fix misc static check issues	2019-08-21 10:40:21 -07:00
Tim Allclair	3f510c69f6	Remove dead code from pkg/kubelet/...	2019-08-21 10:40:21 -07:00
Tim Allclair	49f50484b8	Delete duplicate resource.Quantity.Copy()	2019-08-19 17:23:14 -07:00
Kevin Klues	4fdd52b058	Update GetTopologyHints() API to return a map At present, there is no way for a hint provider to return distinct hints for different resource types via a call to GetTopologyHints(). This means that hint providers that govern multiple resource types (e.g. the devicemanager) must do some sort of "pre-merge" on the hints it generates for each resource type before passing them back to the TopologyManager. This patch changes the GetTopologyHints() interface to allow a hint provider to pass back raw hints for each resource type, and allow the TopologyManager to merge them using a single unified strategy. This change also allows the TopologyManager to recognize which resource type a set of hints originated from, should this information become useful in the future.	2019-08-16 08:06:12 +02:00
Kubernetes Prow Robot	f2dd24820a	Merge pull request #73920 from nolancon/topology-manager-cpu-manager Changes to make CPU Manager a Hint Provider for Topology Manager	2019-08-15 05:44:33 -07:00
Kevin Klues	b3f4bed97f	Add CPUManager tests for TopologyHint consumption	2019-08-14 06:22:56 +02:00
Kevin Klues	8278d1134c	Consume TopologyHints in the CPUManager Co-Authored-By: Conor Nolan <conor.nolan@intel.com>	2019-08-14 06:22:56 +02:00
Sreemanti Ghosh	7c626a2a00	Add CPUManager tests for TopologyHint generation Co-Authored-By: Conor Nolan <conor.nolan@intel.com> Co-Authored-By: Kevin Klues <kklues@nvidia.com>	2019-08-14 06:22:56 +02:00
Kevin Klues	156b3f6af8	Generate TopologyHints from the CPUManager	2019-08-14 06:22:56 +02:00
Kevin Klues	9a6788cb13	Add IterateSocketMasks() function to socketmask abstraction	2019-08-14 06:22:56 +02:00
Kubernetes Prow Robot	ac2295a24d	Merge pull request #78587 from kad/socketmask-string Use go standard library for common bit operations	2019-08-13 00:03:39 -07:00
Kubernetes Prow Robot	d47f9ff132	Merge pull request #81086 from dims/fix-incorrect-readlink-check-for-checking-kernel-pids [TOB-K8S-027] Fix Incorrect isKernelPid check	2019-08-08 17:58:04 -07:00
Davanum Srinivas	bd925d6611	[TOB-K8S-027] Fix Incorrect isKernelPid check isKernelPid should explicitly check the error returned from os.Readlink and return true only if the error value is ENOENT. Without this fix, if Readlink returned say ENAMETOOLONG or EACESS, we would still count the process as a kernel process (which is not true).	2019-08-07 11:19:19 -04:00
Davanum Srinivas	bc71c23bee	[TOB-K8S-025] Incorrect docker daemon process name in container manager The container manager used in kubelet checks for docker daemon process either via pidfile or process name. While the pidfile points to the docker daemon process PID, the dockerProcessName constant stores a docker cli name ( docker ) instead of docker daemon name ( dockerd ).	2019-08-07 10:59:37 -04:00
Conor Nolan	e33af11add	Add stub support for TopologyManager to CPUManager Co-Authored-By: Louise Daly <louise.m.daly@intel.com>	2019-08-07 15:56:05 +02:00
Jianfei Bai	5726b22fbc	Move docker specific const to dockershim.	2019-08-05 10:28:08 +08:00
Kubernetes Prow Robot	c63000ef81	Merge pull request #78793 from mattjmcnaughton/mattjmcnaughton/78629-fix-reserved-cgroup-systemd Fix reserved cgroup systemd	2019-08-02 17:23:52 -07:00
Kubernetes Prow Robot	93e6fb30f0	Merge pull request #74357 from lmdaly/topology-manager-container-manager Updates to container manager and internal container lifecycle to accommodate TopologyManager	2019-08-01 11:52:17 -07:00
Kubernetes Prow Robot	1a8844cd03	Merge pull request #80683 from moshe010/rename_files TopologyManager: Fix rename best-effort policy files	2019-07-31 00:25:00 -07:00
Kubernetes Prow Robot	320bc21dbe	Merge pull request #78762 from klueska/upstream-inherit-cpus-from-init-containers Proactively remove init Containers in CPUManager static policy	2019-07-30 03:35:18 -07:00
Moshe Levi	3b83c5c7c6	TopologyManager: Fix rename best-effort policy files PR https://github.com/kubernetes/kubernetes/pull/80301 rename the preferred policy to best-effort, but the files names are still policy_preferred.go and policy_preferred_test.go. This PR fix that.	2019-07-28 19:35:16 +03:00
Kevin Klues	9f36f1a173	Add tests for proactive init Container removal in the CPUManager static policy	2019-07-26 14:34:51 +02:00
Kevin Klues	6a7db380de	Add tests for new containertMap type in the CPUManager	2019-07-26 14:34:51 +02:00
Kevin Klues	c6d9bbcb74	Proactively remove init Containers in CPUManager static policy This patch fixes a bug in the CPUManager, whereby it doesn't honor the "effective requests/limits" of a Pod as defined by: https://kubernetes.io/docs/concepts/workloads/pods/init-containers/#resources The rule states that a Pod’s "effective request/limit" for a resource should be the larger of: * The highest of any particular resource request or limit defined on all init Containers * The sum of all app Containers request/limit for a resource Moreover, the rule states that: * The effective QoS tier is the same for init Containers and app containers alike This means that the resource requests of init Containers and app Containers should be able to overlap, such that the larger of the two becomes the "effective resource request/limit" for the Pod. Likewise, if a QoS tier of "Guaranteed" is determined for the Pod, then both init Containers and app Containers should run in this tier. In its current implementation, the CPU manager honors the effective QoS tier for both init and app containers, but doesn't honor the "effective request/limit" correctly. Instead, it treats the "effective request/limit" as: * The sum of all init Containers plus the sum of all app Containers request/limit for a resource It does this by not proactively removing the CPUs given to previous init containers when new containers are being created. In the worst case, this causes the CPUManager to give non-overlapping CPUs to all containers (whether init or app) in the "Guaranteed" QoS tier before any of the containers in the Pod actually start. This effectively blocks these Pods from running if the total number of CPUs being requested across init and app Containers goes beyond the limits of the system. This patch fixes this problem by updating the CPUManager static policy so that it proactively removes any guaranteed CPUs it has granted to init Containers before allocating CPUs to app containers. Since all init container are run sequentially, it also makes sure this proactive removal happens for previous init containers when allocating CPUs to later ones.	2019-07-26 14:34:51 +02:00
Kevin Klues	7eccc71c9e	Rename 'preferred' TopologyManager policy to 'best-effort'	2019-07-25 10:44:36 +02:00
Louise Daly	9f0081cc36	Updates to container manager and internal container lifecycle to accommodate Topology Manager Co-authored-by: Conor Nolan <conor.nolan@intel.com>	2019-07-24 08:09:38 +01:00
Kubernetes Prow Robot	5b496fe8f5	Merge pull request #80315 from klueska/upstream-cleanup-socketmask Cleanup the TopologyManager socketmask abstraction	2019-07-23 11:40:28 -07:00
Kevin Klues	65b07312b0	Cleanup comments in TopologyManager socketmask abstraction	2019-07-18 18:52:19 -07:00
Kevin Klues	0edfd4be35	Add package level And/Or calls to TopologyManager socketmask abstraction	2019-07-18 09:06:51 -07:00
Kevin Klues	434fd34e0b	Add NewEmtpySocketMask() call to TopologyManager socketmask abstraction	2019-07-18 09:05:55 -07:00
Kevin Klues	4ee5d5409e	Update the topologymanager to error out if an invalid policy is given Previously, the topologymanager would simply fall back to the None() policy if an invalid policy was specified. This patch updates this to return an error when an invalid policy is passed, forcing the kubelet to fail fast when this occurs. These semantics should be preferable because an invalid policy likely indicates operator error in setting the policy flag on the kubelet correctly (e.g. misspelling 'strict' as 'striict'). In this case it is better to fail fast so the operator can detect this and correct the mistake, than to mask the error and essentially disable the topologymanager unexpectedly.	2019-07-18 13:24:09 +02:00
Kevin Klues	5dc5f1de06	Update the cpumanager to error out if an invalid policy is given Previously, the cpumanager would simply fall back to the None() policy if an invalid policy was specified. This patch updates this to return an error when an invalid policy is passed, forcing the kubelet to fail fast when this occurs. These semantics should be preferable because an invalid policy likely indicates operator error in setting the policy flag on the kubelet correctly (e.g. misspelling 'static' as 'statiic'). In this case it is better to fail fast so the operator can detect this and correct the mistake, than to mask the error and essentially disable the cpumanager unexpectedly.	2019-07-18 13:24:09 +02:00
Kubernetes Prow Robot	1125054612	Merge pull request #80235 from moshe010/remove_string Remove unnecessary string() from policy_none	2019-07-17 19:34:49 -07:00
Louise Daly	9d7e31e66e	Topology Manager Implementation based on Interfaces Co-authored-by: Kevin Klues <kklues@nvidia.com> Co-authored-by: Conor Nolan <conor.nolan@intel.com> Co-authored-by: Sreemanti Ghosh <sreemanti.ghosh@intel.com>	2019-07-17 02:30:21 +01:00
Moshe Levi	d52985d3a0	Remove unnecessary string() from policy_none Signed-off-by: Moshe Levi <moshele@mellanox.com>	2019-07-17 01:58:43 +03:00
Kubernetes Prow Robot	4197adaf2d	Merge pull request #79343 from nolancon/topology-manager-none Add Policy None for Topology Manager	2019-07-16 13:22:47 -07:00
Kubernetes Prow Robot	80537a9c5f	Merge pull request #77323 from tedyu/cgroup-mgr-linux Check error return from Update	2019-07-15 14:53:24 -07:00
Kubernetes Prow Robot	923f08e29b	Merge pull request #79900 from mikebrow/todo-cleanup-container-manager-linux update code documentation to reflect change in status	2019-07-11 18:33:35 -07:00
Kubernetes Prow Robot	920ac08361	Merge pull request #76518 from haiyanmeng/limit Limit the read length of ioutil.ReadAll in `pkg/kubelet` and `pkg/probe`	2019-07-11 17:01:07 -07:00
Kubernetes Prow Robot	f0d1b10092	Merge pull request #77429 from tedyu/container-linux-err Avoid unnecessary concatenation of errors	2019-07-11 14:33:08 -07:00
Haiyan Meng	1f270ef4e2	Limit the read length of ioutil.ReadAll in `pkg/kubelet` and `pkg/probe` Signed-off-by: Haiyan Meng <haiyanmeng@google.com>	2019-07-11 13:18:06 -07:00
Kubernetes Prow Robot	d4d8daea73	Merge pull request #78558 from tedyu/policy-str Remove unnecessary string()	2019-07-11 13:13:06 -07:00
Kubernetes Prow Robot	858fce1634	Merge pull request #79531 from odinuge/kubelet-dead-code Remove unnecessary variable declaration	2019-07-08 14:28:01 -07:00
Mike Brown	6da266784a	update code documentation to reflect change in status Signed-off-by: Mike Brown <brownwm@us.ibm.com>	2019-07-08 16:15:59 -05:00
Odin Ugedal	4ee5fe23e8	Fix cgroup hugetlb size prefix for kB Use the exported list from runc that uses "KB" and not "kB". This issue breaks kubelet on AArch64 (arm 64). var HugePageSizeUnitList = []string{"B", "KB", "MB", "GB", "TB", "PB"} The hugetlb cgroup control files (introduced here in 2012: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=abb8206cb0773) use "KB" and not "kB" (https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/mm/hugetlb_cgroup.c?h=v5.0#n349). The behavior in the kernel has not changed since the introduction, and the current code using "kB" will therefore fail on devices with huge pages smaller than 1MiB. This is the case for AArch64. As seen from the code in "mem_fmt" inside hugetlb_cgroup.c, only "KB", "MB" and "GB" are used, so the others may be removed as well. Here is a real world example of the files inside the "/sys/kernel/mm/hugepages/" directory: - "hugepages-64kB" - "hugepages-2048kB" - "hugepages-32768kB" - "hugepages-1048576kB" And the corresponding cgroup files: - "hugetlb.64KB._____" - "hugetlb.2MB._____" - "hugetlb.32MB._____" - "hugetlb.1GB._____" Signed-off-by: Odin Ugedal <odin@ugedal.com>	2019-06-28 21:28:26 +02:00
Odin Ugedal	2bcdb944f0	Update dependency opencontainer/runc	2019-06-28 21:23:05 +02:00
Odin Ugedal	9c2aa843bd	Remove unnecessary variable declaration	2019-06-28 18:03:23 +02:00
Kubernetes Prow Robot	c64f81d082	Merge pull request #78653 from sjenning/add-sjenning-owners kubelet: add sjenning to kubelet subdirectory owners files	2019-06-25 14:47:15 -07:00

... 2 3 4 5 6 ...

840 Commits