Claudiu Belu
5ba74c81ca
unit tests: Skip flaky tests on Windows
...
Some of the unit tests are currently flaky on Windows. This commit
skips them until they are resolved.
2023-03-06 20:46:05 +00:00
Kubernetes Prow Robot
1b8692ce46
Merge pull request #114296 from cbroglie/concurrent-monitor-node-health
...
controller/nodelifecycle: Make monitorNodeHealth process nodes concurrently
2023-01-12 12:42:54 -08:00
Christopher Broglie
3c88de52c8
controller/nodelifecycle: Make monitorNodeHealth process nodes concurrently
...
Marking the pods not ready on a node requires looping over them and
updating each pod's status one at a time. This is performed serially,
and can take a while if we're processing each node serially as well.
Since the time is spent waiting on io, there's an opportunity to go
faster by processing multiple nodes concurrently. This change modifies
the loop to process nodes in parallel, using the same number of workers
as doNodeProcessingPassWorker.
This change also introduces histogram metrics to better observe
monitorNodeHealth.
2023-01-11 12:34:39 -08:00
ialidzhikov
aede3fbf40
pkg/controller: Replace deprecated func usage from the k8s.io/utils/pointer
pkg
2022-11-23 17:40:23 +02:00
Jakub Przychodzeń
de25c5fdcf
NodeLifecycleController: Remove race condition
...
Patch request does not support RV by default, we need to include them explicitly and patching lists actually overwrites whole field. It means that there is a race condition, in which we can overwrite changes to taints that happened between GET and PATCH requests.
2022-10-24 19:36:58 +00:00
Neha Lohia
fa1b6765d5
move pkg/util/node to component-helpers/node/util ( #105347 )
...
Signed-off-by: Neha Lohia <nehapithadiya444@gmail.com>
2021-11-12 07:52:27 -08:00
Mike Dame
4960d0976a
Wire contexts to Core controllers
2021-11-01 10:29:00 -04:00
Michael Beaumont
a5a6762d33
Move pkg/kubelet/apis to k8s.io/kubelet/pkg/apis
2021-02-09 21:37:39 +01:00
Kubernetes Prow Robot
1209c59612
Merge pull request #96876 from howieyuen/no-execute-taint-missing
...
fix nodelifecyle controller not add NoExecute taint bug
2021-01-13 14:17:03 -08:00
pacoxu
441985afb6
set LegacyNodeRoleBehavior to false and mv ServiceNodeExclusion to GA
...
Signed-off-by: pacoxu <paco.xu@daocloud.io>
2021-01-05 22:34:18 +08:00
Hao Yuan
5569db4902
fix nodelifecyle controller not add NoExecute taint bug
2020-12-14 09:34:57 +08:00
Tim Hockin
819ff9b087
Use topology labels instead of old beta names ( #96033 )
...
* Rename const for topology.../zone
* Rename const for topology.../region
* Rename const for failure-domain.../zone
* Rename const for failure-domain.../region
* Restore old names for compat
2020-11-05 20:26:50 -08:00
wawa0210
54c0f8b677
Remove the 'beta' version of the node label (os and arch types)
2020-05-13 22:51:52 +08:00
skilxn-go
6b8fc8dc5e
Move TaintBasedEvictions feature gates to GA
2020-03-09 10:49:00 +08:00
taesun_lee
79680b5d9b
Fix pkg/controller typos in some error messages, comments etc
...
- applied review results by LuisSanchez
- Co-Authored-By: Luis Sanchez <sanchezl@redhat.com>
genernal -> general
iniital -> initial
initalObjects -> initialObjects
intentionaly -> intentionally
inforer -> informer
anotother -> another
triger -> trigger
mutli -> multi
Verifyies -> Verifies
valume -> volume
unexpect -> unexpected
unfulfiled -> unfulfilled
implenets -> implements
assignement -> assignment
expectataions -> expectations
nexpected -> unexpected
boundSatsified -> boundSatisfied
externel -> external
calcuates -> calculates
workes -> workers
unitialized -> uninitialized
afater -> after
Espected -> Expected
nodeMontiorGracePeriod -> NodeMonitorGracePeriod
estimateGrracefulTermination -> estimateGracefulTermination
secondrary -> secondary
ShouldRunDaemonPodOnUnscheduableNode -> ShouldRunDaemonPodOnUnschedulableNode
rrror -> error
expectatitons -> expectations
foud -> found
epackage -> package
succesfulJobs -> successfulJobs
namesapce -> namespace
ConfigMapResynce -> ConfigMapResync
2020-02-27 00:15:33 +09:00
Mike Danese
bfc75d9a5c
manual fixes
2020-02-08 12:32:33 -05:00
Mike Danese
2637772298
some manual fixes
2020-02-07 18:17:40 -08:00
Mike Danese
3aa59f7f30
generated: run refactor
2020-02-07 18:16:47 -08:00
Kubernetes Prow Robot
ae15368355
Merge pull request #84351 from wojtek-t/promote_node_lease_to_GA
...
Promote node lease to GA
2019-11-08 09:00:15 -08:00
Andrew Sy Kim
07229d6c51
pkg/util/node: update GetZoneKey to check both beta and GA labels
...
Signed-off-by: Andrew Sy Kim <kiman@vmware.com>
2019-11-07 21:22:04 -05:00
Kubernetes Prow Robot
8841624982
Merge pull request #81167 from krzysied/node_controller_list
...
Avoid listing pods in NodeLifecycleController
2019-11-07 07:53:10 -08:00
Krzysztof Siedlecki
9406e5bf2a
adding pods lister
2019-11-07 12:02:21 +01:00
Wei Huang
dd74205bcf
Move out const strings in pkg/scheduler/api/well_known_labels.go
2019-11-05 20:56:21 -08:00
wojtekt
ffad401b4e
Promote NodeLease feature to GA
2019-11-05 09:01:12 +01:00
Kubernetes Prow Robot
5e33f3db5f
Merge pull request #84445 from krzysied/node_controller_retry_fix
...
NodeLifecycleController - MarkPodsNotReady retry fix
2019-11-03 20:15:40 -08:00
Krzysztof Siedlecki
3a82f50b2e
adding test for retrying MarkPodsNotReady
2019-10-31 16:56:54 +01:00
draveness
1163a1d51e
feat: update taint nodes by condition to GA
2019-10-19 09:17:41 +08:00
wojtekt
cf9203501e
Swtich nodelifecyclecontroller to coordination/v1
2019-10-16 10:59:02 +02:00
Krzysztof Siedlecki
b1dfa83be6
using pod pointers in node lifecycle controller
2019-10-14 12:44:43 +02:00
Krzysztof Siedlecki
8f48896709
adding pods to DeletePods parameters
2019-10-02 13:11:23 +02:00
Krzysztof Siedlecki
99eeab35a3
adding fakeGetPodsAssignedToNode
2019-09-30 11:03:36 +02:00
Krzysztof Siedlecki
029b72b553
adding lock to node data map
2019-09-12 10:23:24 +02:00
Clayton Coleman
2888e6e923
Node lifecycle controller should use a label for excluding nodes
...
The current mechanism for excluding "master" nodes based on node names
is fragile and should be fixed by using a label exclusion similar to
service load balancers. The legacy code path is preserved behind a
defaulted-on gate and will be removed in the future.
2019-08-28 10:29:08 -04:00
Krzysztof Siedlecki
6842e11f7e
removing redundant code
2019-08-21 17:22:06 +02:00
Andrew Kim
c919139245
update import of generic featuregate code from k8s.io/apiserver/pkg/util/feature -> k8s.io/component-base/featuregate
2019-05-08 10:01:50 -04:00
Yu-Ju Hong
bd2301a628
nodelifecycle controller: reconcile node OS/arch labels
2019-03-06 17:26:23 -08:00
Davanum Srinivas
b975573385
move pkg/kubelet/apis/well_known_labels.go to staging/src/k8s.io/api/core/v1/
...
Co-Authored-By: Weibin Lin <linweibin1@huawei.com>
Change-Id: I163b2f2833e6b8767f72e2c815dcacd0f4e504ea
2019-02-05 13:39:07 -05:00
Shiv Nagarajan
36ee154243
remove deprecated taints from 1.9
2019-01-16 21:20:57 -05:00
Kubernetes Prow Robot
0d63cf9caa
Merge pull request #67037 from Huang-Wei/cleanup-ood
...
cleanup logic related with OutOfDisk
2018-12-20 17:30:27 -08:00
Weibin Lin
842bd1e1ec
update deployment, daemonset, replicaset, statefulset to apps/v1
2018-12-19 10:46:45 -05:00
Wei Huang
8f87e71e0c
cleanup logic related with OutOfDisk
...
- cleanup OOD logic in scheduling and node controller
- update comments and testcases
2018-12-18 11:28:02 -08:00
andrewsykim
5329f09663
consolidate node deletion logic between node lifecycle and cloud node controller
2018-12-03 13:33:53 -05:00
k8s-ci-robot
d425258532
Merge pull request #69788 from ravisantoshgudimetla/taint-based-eviction
...
Add test cases for taintbasedevictions
2018-10-18 06:34:31 -07:00
k8s-ci-robot
aad6437aa9
Merge pull request #64061 from wgliang/master.remove-unused-code-pkg-controller
...
remove unused code of (pkg/controller)
2018-10-17 19:54:05 -07:00
ravisantoshgudimetla
d281d566b3
Add test cases for taintbasedevictions
2018-10-16 18:56:45 -04:00
tanshanshan
b7c7966b9f
Move pkg/scheduler/algorithm/well_known_labels.go out
2018-10-13 09:10:00 +08:00
Zhen Wang
e35d808aa2
NodeLifecycleController treats node lease renewal as a heartbeat signal
2018-10-11 16:07:15 -07:00
Guoliang Wang
b1ac6df4dc
remove unused code of (pkg/controller)
2018-10-09 08:15:30 +08:00
Walter Fender
f3f46d5f5a
Moving the cloudprovider interface to staging.
...
Individual implementations are not yet being moved.
Fixed all dependencies which call the interface.
Fixed golint exceptions to reflect the move.
Added project info as per @dims and
https://github.com/kubernetes/kubernetes-template-project .
Added dims to the security contacts.
Fixed minor issues.
Added missing template files.
Copied ControllerClientBuilder interface to cp.
This allows us to break the only dependency on K8s/K8s.
Added TODO to ControllerClientBuilder.
Fixed GoDeps.
Factored in feedback from JustinSB.
2018-10-04 14:41:20 -07:00
Zhen Wang
88e7e186f0
Rename node status to node health in NodeLifecycleController
...
Since we are going to treat both node status and node lease as node
heartbeat/health signals, this PR makes the renmae changes, so that the
follow-up PRs are easier to review.
2018-10-01 23:19:50 -07:00