kubernetes

Author	SHA1	Message	Date
Patrick Ohly	bde9b64cdf	DRA: remove "source" indirection from v1 Pod API This makes the API nicer: resourceClaims: - name: with-template resourceClaimTemplateName: test-inline-claim-template - name: with-claim resourceClaimName: test-shared-claim Previously, this was: resourceClaims: - name: with-template source: resourceClaimTemplateName: test-inline-claim-template - name: with-claim source: resourceClaimName: test-shared-claim A more long-term benefit is that other, future alternatives might not make sense under the "source" umbrella. This is a breaking change. It's justified because DRA is still alpha and will have several other API breaks in 1.31.	2024-06-27 17:53:24 +02:00
Patrick Ohly	e0fce54d02	DRA: fix indexing of generated parameters The claim parameter key didn't include the namespace of the claim. In the case where two namespaces used the exact same parameter reference, the "too many generated parameters" case got triggered incorrectly and lookup could have returned an object from the wrong namespace. Found while running the E2E tests in parallel: message: 'running PreFilter plugin "DynamicResources": multiple generated claim parameters for ConfigMap. dra-8794/parameters-3 found: [dra-4729/parameters-4 dra-7328/parameters-4 dra-8794/parameters-4 dra-3402/parameters-4 dra-6156/parameters-4 dra-1839/parameters-4 dra-7434/parameters-4 dra-6504/parameters-4]'	2024-06-13 17:27:04 +02:00
carlory	3072987fcc	DRA: scheduler: index claim and class parameters to simplify lookup	2024-05-27 15:57:10 +08:00
Patrick Ohly	a66d2163f9	dra scheduler: fix data race in unit test Clearing some irrelevant fields in objects caused a flaky data race alert because in some cases, the objects were pointers into a shared cache. A better solution is to treat the objects as read-only and ignore the irrelevant fields.	2024-04-19 17:14:13 +02:00
Patrick Ohly	458e227de0	dra scheduler: unit tests Coverage was checked with a cover profile. The biggest remaining gap is for isSchedulableAfterClaimParametersChange and isSchedulableAfterClassParametersChange which will get handled when refactoring the foreachPodResourceClaim (https://github.com/kubernetes/kubernetes/issues/123697).	2024-03-22 10:03:22 +01:00
Patrick Ohly	096e948905	dra scheduler: support structured parameters When a claim uses structured parameters, as indicated by the resource class flag, the scheduler is responsible for allocating it. To do this it needs to gather information about available node resources by watching NodeResourceSlices and then match the in-tree claim parameters against those resources.	2024-03-07 22:21:04 +01:00
Kubernetes Prow Robot	c606448922	Merge pull request #122996 from Huang-Wei/cleanup-dra-postfilter DRA: always returns Unschedulable in PostFilter	2024-01-27 08:19:44 -08:00
Wei Huang	ceabc4aba8	DRA: always returns Unschedulable in PostFilter	2024-01-26 09:44:00 -08:00
Patrick Ohly	a809a6353b	scheduler: publish PodSchedulingContext during PreBind Blocking API calls during a scheduling cycle like the DRA plugin is doing slow down overall scheduling, i.e. also affecting pods which don't use DRA. It is easy to move the blocking calls into a goroutine while the scheduling cycle ends with "pod unschedulable". The hard part is handling an error when those API calls then fail in the background. There is a solution for that (see https://github.com/kubernetes/kubernetes/pull/120963), but it's complex. Instead, publishing the modified PodSchedulingContext can also be done later. In the more common case of a pod which is ready for binding except for its claims, that'll be in PreBind, which runs in a separate goroutine already. In the less common case that a pod cannot be scheduled, that'll be in Unreserve which is still blocking.	2024-01-26 10:58:03 +01:00
Patrick Ohly	5d1509126f	dra: patch ReservedFor during PreBind This moves adding a pod to ReservedFor out of the main scheduling cycle into PreBind. There it is done concurrently in different goroutines. For claims which were specifically allocated for a pod (the most common case), that usually makes no difference because the claim is already reserved. It starts to matter when that pod then cannot be scheduled for other reasons, because then the claim gets unreserved to allow deallocating it. It also matters for claims that are created separately and then get used multiple times by different pods. Because multiple pods might get added to the same claim rapidly independently from each other, it makes sense to do all claim status updates via patching: then it is no longer necessary to have an up-to-date copy of the claim because the patch operation will succeed if (and only if) the patched claim is valid. Server-side-apply cannot be used for this because a client always has to send the full list of all entries that it wants to be set, i.e. it cannot add one entry unless it knows the full list.	2024-01-26 10:58:03 +01:00
AxeZhan	be48c93689	Sched framework: expose NodeInfo in all functions of PluginsRunner interface	2023-12-15 11:30:06 +08:00
Kubernetes Prow Robot	9aa04752e7	Merge pull request #118463 from testwill/replace_loop chore: slice replace loop	2023-10-24 15:04:39 +02:00
Kensei Nakada	cb5dc46edf	feature(scheduler): simplify QueueingHint by introducing new statuses	2023-10-19 11:02:11 +00:00
Kubernetes Prow Robot	3ac83f528d	Merge pull request #119290 from carlory/add-logger the scheduling queue logs the error and treats it as QueueAfterBackoff	2023-09-22 08:10:49 -07:00
carlory	0105a002bc	when the hint fn returns error, the scheduling queue logs the error and treats it as QueueAfterBackoff. Co-authored-by: Kensei Nakada <handbomusic@gmail.com> Co-authored-by: Kante Yin <kerthcet@gmail.com> Co-authored-by: XsWack <xushiwei5@huawei.com>	2023-09-21 09:40:44 +08:00
Mengjiao Liu	a7466f44e0	Change the scheduler plugins PluginFactory function to use context parameter to pass logger - Migrated pkg/scheduler/framework/plugins/nodevolumelimits to use contextual logging - Fix golangci-lint validation failed - Check for plugins creation err	2023-09-20 17:49:54 +08:00
Patrick Ohly	c682d2b8c5	scheduler: add ResourceClass events When filtering fails because a ResourceClass is missing, we can treat the pod as "unschedulable" as long as we then also register a cluster event that wakes up the pod. This is more efficient than periodically retrying.	2023-09-06 11:14:08 +02:00
AxeZhan	47fec59a31	parse node selector in prefilter	2023-08-14 16:39:46 +08:00
carlory	0599b3caa0	change the QueueingHintFn to pass a logger	2023-07-13 00:56:41 +08:00
Patrick Ohly	6f1a29520f	scheduler/dra: reduce pod scheduling latency This is a combination of two related enhancements: - By implementing a PreEnqueue check, the initial pod scheduling attempt for a pod with a claim template gets avoided when the claim does not exist yet. - By implementing cluster event checks, only those pods get scheduled for which something changed, and they get scheduled immediately without delay.	2023-07-12 11:17:04 +02:00
Patrick Ohly	444d23bd2f	dra: generated name for ResourceClaim from template Generating the name avoids all potential name collisions. It's not clear how much of a problem that was because users can avoid them and the deterministic names for generic ephemeral volumes have not led to reports from users. But using generated names is not too hard either. What makes it relatively easy is that the new pod.status.resourceClaimStatus map stores the generated name for kubelet and node authorizer, i.e. the information in the pod is sufficient to determine the name of the ResourceClaim. The resource claim controller becomes a bit more complex and now needs permission to modify the pod status. The new failure scenario of "ResourceClaim created, updating pod status fails" is handled with the help of a new special "resource.kubernetes.io/pod-claim-name" annotation that together with the owner reference identifies exactly for what a ResourceClaim was generated, so updating the pod status can be retried for existing ResourceClaims. The transition from deterministic names is handled with a special case for that recovery code path: a ResourceClaim with no annotation and a name that follows the Kubernetes <= 1.27 naming pattern is assumed to be generated for that pod claim and gets added to the pod status. There's no immediate need for it, but just in case that it may become relevant, the name of the generated ResourceClaim may also be left unset to record that no claim was needed. Components processing such a pod can skip whatever they normally would do for the claim. To ensure that they do and also cover other cases properly ("no known field is set", "must check ownership"), resourceclaim.Name gets extended.	2023-07-11 14:23:48 +02:00
Kubernetes Prow Robot	bc8e312857	Merge pull request #117903 from sourcelliu/dynamic feature(DynamicResources): return Skip in PreFilter	2023-06-20 17:48:20 -07:00
guoguangwu	1d9eed9f95	chore: slice replace loop	2023-06-05 22:40:53 +08:00
Kubernetes Prow Robot	f7cfb5f02f	Merge pull request #118257 from pohly/dra-scheduler-plugin-loopvar-fix dra scheduler plugin test: fix loopvar bug and "reserve" expected data	2023-05-26 06:06:53 -07:00
Patrick Ohly	7a6b4a9215	dra scheduler plugin test: fix loopvar bug and "reserve" expected data The `listAll` function returned a slice where all pointers referred to the same instance. That instance had the value of the last list entry. As a result, unit tests only compared that element. During the reserve phase, the first claim gets reserved in two test cases. Those two tests must expect that change. That hadn't been noticed before because that first claim didn't get compared.	2023-05-25 15:10:05 +02:00
Mengjiao Liu	1c05cf1d51	kube-scheduler: NewFramework function to pass the context parameter Co-authored-by: Aldo Culquicondor <1299064+alculquicondor@users.noreply.github.com>	2023-05-23 10:17:34 +08:00
mantuliu	6e2ea32fc8	feature(DynamicResources): return Skip in PreFilter	2023-05-15 00:06:08 +08:00
Patrick Ohly	fec5233668	api: resource.k8s.io PodScheduling -> PodSchedulingContext The name "PodScheduling" was unusual because in contrast to most other names, it was impossible to put an article in front of it. Now PodSchedulingContext is used instead.	2023-03-14 10:18:08 +01:00
Patrick Ohly	29941b8d3e	api: resource.k8s.io v1alpha1 -> v1alpha2 For Kubernetes 1.27, we intend to make some breaking API changes: - rename PodScheduling -> PodSchedulingHints (https://github.com/kubernetes/kubernetes/issues/114283) - extend ResourceClaimStatus (https://github.com/kubernetes/enhancements/pull/3802) We need to switch from v1alpha1 to v1alpha2 for that.	2023-03-14 07:52:03 +01:00
Patrick Ohly	d2ff210c20	scheduler: add dynamic resource allocation plugin The plugin handles the interaction with ResourceClaims that are referenced by a Pod.	2022-11-11 21:58:03 +01:00

30 Commits