kubernetes/pkg/controller
Christopher Broglie 3c88de52c8 controller/nodelifecycle: Make monitorNodeHealth process nodes concurrently
Marking the pods not ready on a node requires looping over them and
updating each pod's status one at a time. This is performed serially,
and can take a while if we're processing each node serially as well.

Since the time is spent waiting on io, there's an opportunity to go
faster by processing multiple nodes concurrently. This change modifies
the loop to process nodes in parallel, using the same number of workers
as doNodeProcessingPassWorker.

This change also introduces histogram metrics to better observe
monitorNodeHealth.
2023-01-11 12:34:39 -08:00
..
apis/config refactor: remove deprecated flags 2022-04-22 20:28:12 +08:00
bootstrap remove rate limiter metric as it is not in use 2022-10-13 13:07:11 -07:00
certificates remove rate limiter metric as it is not in use 2022-10-13 13:07:11 -07:00
clusterroleaggregation Lock ServerSideApply feature to true 2022-09-27 13:48:28 +02:00
cronjob Merge pull request #113008 from soltysh/promote_cronjob_metrics 2022-11-04 09:36:27 -07:00
daemon Revert "Update daemonSet status even if syncDaemonSet fails" 2022-11-08 15:01:09 +01:00
deployment Merge pull request #113054 from logicalhan/proxy-metric 2022-10-17 11:09:18 -07:00
disruption Enable the feature into beta 2022-11-09 09:02:40 +01:00
endpoint Merge pull request #112546 from oscr/the-the 2022-10-19 10:03:02 -07:00
endpointslice Added events publishing for topologyHints changes 2022-11-07 19:45:40 +00:00
endpointslicemirroring remove rate limiter metric as it is not in use 2022-10-13 13:07:11 -07:00
garbagecollector Reimplement GC dot graph dumping without gonum library 2022-09-23 13:12:33 -04:00
history convert int32 to pointer using library function 2022-07-01 14:58:26 +08:00
job Wait for Pods to finish before considering Failed in Job (#113860) 2022-11-15 09:44:53 -08:00
namespace remove rate limiter metric as it is not in use 2022-10-13 13:07:11 -07:00
nodeipam Merge pull request #110485 from Octopusjust/k8s-pr 2022-11-08 13:51:00 -08:00
nodelifecycle controller/nodelifecycle: Make monitorNodeHealth process nodes concurrently 2023-01-11 12:34:39 -08:00
podautoscaler Merge pull request #112011 from pbeschetnov/ambiguous-selectors 2022-11-06 21:08:16 -08:00
podgc Enable the feature into beta 2022-11-09 09:02:40 +01:00
replicaset Merge pull request #110747 from harshanarayana/cleanup/GIT-110737/logging-improvements 2022-11-03 00:49:34 -07:00
replication switch listeners to use a map, adapt tests 2022-08-08 14:01:10 -07:00
resourceclaim kube-controller-manager: add ResourceClaim controller 2022-11-10 20:23:50 +01:00
resourcequota quota: add an update filter 2022-07-08 18:39:55 -04:00
serviceaccount remove rate limiter metric as it is not in use 2022-10-13 13:07:11 -07:00
statefulset Fix typo in function emptyInvariants() 2022-11-08 07:48:10 -08:00
storageversiongc pkg/controller/storageversiongc: add constructor function newKubeApiserverLease 2022-11-09 15:52:47 -05:00
testutil Wait for Pods to finish before considering Failed in Job (#113860) 2022-11-15 09:44:53 -08:00
ttl Reduce number of buckets in ttl controller for 2k+ nodes clusters 2022-05-05 12:26:36 +00:00
ttlafterfinished remove rate limiter metric as it is not in use 2022-10-13 13:07:11 -07:00
util Merge pull request #112011 from pbeschetnov/ambiguous-selectors 2022-11-06 21:08:16 -08:00
volume Merge pull request #111577 from brianpursley/troubleshoot-unit-test-flake 2022-11-11 00:44:09 -08:00
controller_ref_manager_test.go Merge pull request #101250 from evertrain/master 2021-11-10 09:19:26 -08:00
controller_ref_manager.go Generate and format files 2022-07-26 13:14:05 -04:00
controller_utils_test.go NodeLifecycleController: Remove race condition 2022-10-24 19:36:58 +00:00
controller_utils.go NodeLifecycleController: Remove race condition 2022-10-24 19:36:58 +00:00
doc.go
lookup_cache.go
OWNERS add myself as approver to pkg/controller 2022-01-12 19:33:02 -05:00