kubernetes

Author	SHA1	Message	Date
Kensei Nakada	c7e7eee554	feature(scheduling_queue): track events per Pods (#118438 ) * feature(sscheduling_queue): track events per Pods * fix typos * record events in one slice and make each in-flight Pod to refer it * fix: use Pop() in test before AddUnschedulableIfNotPresent to register in-flight Pods * eliminate MakeNextPodFuncs * call Done inside the scheduling queue * fix comment * implement done() not to require lock in it * fix UTs * improve the receivedEvents implementation based on suggestions * call DonePod when we don't call AddUnschedulableIfNotPresent * fix UT * use queuehint to filter out events for in-flight Pods * fix based on suggestion from aldo * fix based on suggestion from Wei * rename lastEventBefore → previousEvent * fix based on suggestion * address comments from aldo * fix based on the suggestion from Abdullah * gate in-flight Pods logic by the SchedulingQueueHints feature gate	2023-07-17 15:53:07 -07:00
Kubernetes Prow Robot	bea27f82d3	Merge pull request #118209 from pohly/dra-pre-scheduled-pods dra: pre-scheduled pods	2023-07-13 14:43:37 -07:00
Patrick Ohly	80ab8f0542	dra: handle scheduled pods in kube-controller-manager When someone decides that a Pod should definitely run on a specific node, they can create the Pod with spec.nodeName already set. Some custom scheduler might do that. Then kubelet starts to check the pod and (if DRA is enabled) will refuse to run it, either because the claims are still waiting for the first consumer or the pod wasn't added to reservedFor. Both are things the scheduler normally does. Also, if a pod got scheduled while the DRA feature was off in the kube-scheduler, a pod can reach the same state. The resource claim controller can handle these two cases by taking over for the kube-scheduler when nodeName is set. Triggering an allocation is simpler than in the scheduler because all it takes is creating the right PodSchedulingContext with spec.selectedNode set. There's no need to list nodes because that choice was already made, permanently. Adding the pod to reservedFor also isn't hard. What's currently missing is triggering de-allocation of claims to re-allocate them for the desired node. This is not important for claims that get created for the pod from a template and then only get used once, but it might be worthwhile to add de-allocation in the future.	2023-07-13 21:27:11 +02:00
Mengjiao Liu	19869478c1	Migrate /pkg/controller/disruption to structured and contextual logging	2023-07-12 11:30:45 +08:00
Kubernetes Prow Robot	6f9d1d38d8	Merge pull request #118817 from pohly/dra-delete-claims DRA: improve handling of completed pods	2023-07-06 10:15:15 -07:00
Patrick Ohly	7f5a02fc7e	dra resourceclaim controller: enhance logging Adding logging to event handlers makes it more obvious why (or why not) claims and pods need to be processed.	2023-07-05 16:10:20 +02:00
Kubernetes Prow Robot	c78204dc06	Merge pull request #118202 from pohly/scheduler-perf-unit-test scheduler-perf: run as integration tests	2023-06-28 06:24:31 -07:00
Kubernetes Prow Robot	ddbf3575a7	Merge pull request #116729 from AxeZhan/handlers_sync [Scheduler] Make sure handlers have synced before scheduling	2023-06-28 01:26:31 -07:00
Patrick Ohly	dfd646e0a8	scheduler_perf: fix namespace deletion Merely deleting the namespace is not enough: - Workloads might rely on the garbage collector to get rid of obsolete objects, so we should run it to be on the safe side. - Pods must be force-deleted because kubelet is not running. - Finally, the namespace controller is needed to get rid of deleted namespaces.	2023-06-28 09:22:25 +02:00
Patrick Ohly	2e7f37353c	test/integration: avoid errors in fake PC controller during shutdown Once the context is canceled, the controller can stop processing events. Without this change it prints errors when the apiserver is already down.	2023-06-28 08:14:34 +02:00
Aldo Culquicondor	a4519665fe	Skip terminal Pods with a deletion timestamp from the Daemonset sync (#118716 ) * Skip terminal Pods with a deletion timestamp from the Daemonset sync Change-Id: I64a347a87c02ee2bd48be10e6fff380c8c81f742 * Review comments and fix integration test Change-Id: I3eb5ec62bce8b4b150726a1e9b2b517c4e993713 * Include deleted terminal pods in history Change-Id: I8b921157e6be1c809dd59f8035ec259ea4d96301	2023-06-27 08:56:33 -07:00
kidddddddddddddddddddddd	9c7166ff63	wait for eventhandlers to sync before run scheduler	2023-06-27 23:19:34 +08:00
Mengjiao Liu	1c05cf1d51	kube-scheduler: NewFramework function to pass the context parameter Co-authored-by: Aldo Culquicondor <1299064+alculquicondor@users.noreply.github.com>	2023-05-23 10:17:34 +08:00
Kubernetes Prow Robot	8b33eaa0a7	Merge pull request #116207 from pohly/dra-scheduler-perf scheduler_perf: dynamic resource allocation test cases	2023-05-10 10:58:59 -07:00
Patrick Ohly	034528a9f0	scheduler perf: add DynamicResourceAllocation test cases The default scheduler configuration must be based on the v1 API where the plugin is enabled by default. Then if (and only if) the DynamicResourceAllocation feature gate for a test is set, the corresponding API group also gets enabled. The normal dynamic resource claim controller is started if needed to create ResourceClaims from ResourceClaimTemplates. Without the upcoming optimizations in the scheduler, scheduling with dynamic resources is fairly slow. The new test cases take around 15 minutes wall clock time on my desktop.	2023-05-04 13:08:06 +02:00
Kante Yin	859359ad6a	Fix strict linting Signed-off-by: Kante Yin <kerthcet@gmail.com>	2023-05-04 10:25:10 +08:00
Kante Yin	a7035f5459	Pass Context to StartTestServer Signed-off-by: Kante Yin <kerthcet@gmail.com>	2023-05-04 10:25:09 +08:00
Kante Yin	2d866ec2fc	Teardown only scheduler in integration tests Signed-off-by: Kante Yin <kerthcet@gmail.com>	2023-05-04 10:09:24 +08:00
Patrick Ohly	b3e0bc8864	scheduler_perf: let the test decide which informers are needed This will change when adding dynamic resource allocation test cases. Instead of changing mustSetupScheduler and StartScheduler for that, let's return the informer factory and create informers as needed in the test.	2023-04-27 15:31:40 +02:00
Kubernetes Prow Robot	b8b18ecd85	Merge pull request #114051 from chrishenzie/rwop-preemption [scheduler] Support preemption of pods using ReadWriteOncePod PVCs	2023-02-13 11:45:30 -08:00
Patrick Ohly	a7f658e442	test/integration: fix Broadcaster leak When starting a scheduler, the event broadcaster for it wasn't stopped.	2023-02-01 12:42:50 +01:00
Kante Yin	3d0894fabf	Fix failure(context canceled) in scheduler_perf benchmark (#114843 ) * Fix failure in scheduler_perf benchmark Signed-off-by: Kante Yin <kerthcet@gmail.com> * Fatal when error in cleaning up nodes in scheduler perf tests Signed-off-by: Kante Yin <kerthcet@gmail.com> * Use derived context to better organize the codes Signed-off-by: Kante Yin <kerthcet@gmail.com> * Change log level to 2 in scheduler perf-test Signed-off-by: Kante Yin <kerthcet@gmail.com> --------- Signed-off-by: Kante Yin <kerthcet@gmail.com>	2023-01-30 16:21:00 -08:00
Chris Henzie	dbc7d8ded0	feat: support preemption for pods using ReadWriteOncePod PVCs PVCs using the ReadWriteOncePod access mode can only be referenced by a single pod. When a pod is scheduled that uses a ReadWriteOncePod PVC, return "Unschedulable" if the PVC is already in-use in the cluster. To support preemption, the "VolumeRestrictions" scheduler plugin computes cycle state during the PreFilter phase. This cycle state contains the number of references to the ReadWriteOncePod PVCs used by the pod-to-be-scheduled. During scheduler simulation (AddPod and RemovePod), we add and remove reference counts from the cycle state if they use any of these ReadWriteOncePod PVCs. In the Filter phase, the scheduler checks if there are any PVC reference conflicts, and returns "Unschedulable" if there is a conflict. This is a required feature for the ReadWriteOncePod beta. See for more context: https://github.com/kubernetes/enhancements/tree/master/keps/sig-storage/2485-read-write-once-pod-pv-access-mode#beta	2023-01-30 10:59:22 -08:00
TommyStarK	9e885bce35	test/integration: Replace deprecated pointer function Signed-off-by: TommyStarK <thomasmilox@gmail.com>	2023-01-05 18:38:40 +01:00
Wei Huang	ae5d430c76	Integration tests for KEP Pod Scheduling Readiness - test generic integration in plugins_test.go - test integration with SchedulingGates plugin in queue_test.go	2022-11-08 10:06:44 -08:00
Wojciech Tyczyński	71d87272de	Clean shutdown of apply integration tests	2022-11-07 09:14:15 +01:00
Chris Henzie	2d0afbc054	scheduler: integration test for ReadWriteOncePod alpha Tests scheduler enforcement of the ReadWriteOncePod PVC access mode. - Creates a pod using a PVC with ReadWriteOncePod - Creates a second pod using the same PVC - Observes the second pod fails to schedule because PVC is in-use - Deletes the first pod - Observes the second pod successfully schedules	2022-11-01 15:08:01 -07:00
Wojciech Tyczyński	5b042f0bf4	Remove RunAnAPIServer from integration tests	2022-07-25 17:52:31 +02:00
Kubernetes Prow Robot	cab41bd04d	Merge pull request #111324 from wojtek-t/cleanup_testing_namespace Cleanup no longer used Create/Delete TestingNamespace	2022-07-22 00:05:49 -07:00
Wojciech Tyczyński	aca03a4090	Cleanup no longer used Create/Delete TestingNamespace	2022-07-21 19:44:14 +02:00
Michal Wozniak	2f61b6105c	Add integration tests for podgc	2022-07-20 15:17:14 +02:00
Wojciech Tyczyński	8a959396b8	Clean shutdown of volumescheduling integration tests	2022-05-28 21:14:09 +02:00
Wojciech Tyczyński	c802118e81	Update scheduler tests	2022-05-27 14:57:21 +02:00
Wojciech Tyczyński	deef9e40de	Simplify Create/Delete-TestingNamespace functions	2022-05-15 23:06:26 +02:00
Kubernetes Prow Robot	c0f48d2a2e	Merge pull request #109781 from wojtek-t/fix_scheduler_tests Close events recording sink in integration tests	2022-05-11 08:47:14 -07:00
Wojciech Tyczyński	d1c9042f57	Unify apiserver startup in integration tests	2022-05-09 12:08:38 +02:00
Wei Huang	c96109a321	Refactor scheduler integration test that starts APIServer in a non-standarized manner	2022-05-08 22:28:13 -07:00
Antonio Ojea	ef410d18e1	remove unused function on b/test/integration/util/util.go	2022-05-06 12:47:25 +02:00
Antonio Ojea	3bb965b651	merge test integration scheduler util	2022-05-06 11:27:11 +02:00
Wojciech Tyczyński	61d681387f	Close events recording sink in integration tests	2022-05-04 13:59:47 +02:00
Konstantin Misyutin	1d7cefe9c4	Move volume helpers to "k8s.io/component-helpers/storage/volume". This patch aims to simplify decoupling "pkg/scheduler/framework/plugins" from internal "k8s.io/kubernetes" packages. More described in issue #89930 and PR #102953. Some helpers from "k8s.io/kubernetes/pkg/controller/volume/persistentvolume" package moved to "k8s.io/component-helpers/storage/volume" package: - IsDelayBindingMode - GetBindVolumeToClaim - IsVolumeBoundToClaim - FindMatchingVolume - CheckVolumeModeMismatches - CheckAccessModes - GetVolumeNodeAffinity Also "CheckNodeAffinity" from "k8s.io/kubernetes/pkg/volume/util" package moved to "k8s.io/component-helpers/storage/volume" package to prevent diamond dependency conflict. Signed-off-by: Konstantin Misyutin <konstantin.misyutin@huawei.com>	2022-03-16 15:43:09 +08:00
Kubernetes Prow Robot	546e4fa1ef	Merge pull request #107771 from sanposhiho/fix-tiny make scheduler_perf stable	2022-03-04 17:22:52 -08:00
kerthcet	eafbaad9f7	refactor: rename SchedulerCache to Cache in Scheduler Signed-off-by: kerthcet <kerthcet@gmail.com>	2022-02-24 09:47:21 +08:00
sanposhiho	1080c2d717	Make scheduler_perf stable	2022-02-24 01:29:38 +09:00
kerthcet	a6f695581b	remove legacy scheduler policy config, as well as associated flags policy-config-file, policy-configmap, policy-configmap-namespace and use-legacy-policy-config Signed-off-by: kerthcet <kerthcet@gmail.com>	2021-10-08 23:57:49 +08:00
Wei Huang	a689ad4cda	sched: start dynamicInformerFactory along with regular informerFactory (#105016 ) * sched: start dynamicInformerFactory along with regular informerFactory * fixup: start all informers and then wait for their syncs	2021-09-16 19:33:00 -07:00
Kensei Nakada	a85f3e4cce	Fix(test/integration/util): fix typo on logging message	2021-07-06 17:08:11 +09:00
Mengjiao Liu	da35add03f	Rename master to apiserver in test/integration	2021-06-17 15:48:39 +08:00
Abdullah Gharaibeh	52f5ba3a58	Remove SchedulerAlgorithmSource from scheduler's internal CC API	2021-06-09 19:14:54 -04:00
Mengjiao Liu	6871b2b3c7	Rename masterConfig to controlPlaneConfig	2021-06-04 20:55:08 +08:00

1 2 3

103 Commits