kubernetes

Author	SHA1	Message	Date
Guoliang Wang	9d173852c1	Add a metric to track number of scheduler prioritizing goroutines	2019-10-21 16:38:14 +08:00
Guoliang Wang	08f7b22025	Add a metric to track number of scheduler binding goroutines	2019-10-21 16:38:12 +08:00
Kubernetes Prow Robot	ac6c77c440	Merge pull request #84121 from zouyee/renamefit rename FilterPlugin NodeResources	2019-10-20 20:45:37 -07:00
caiweidong	909be01bd6	Add an event to pvc when node expand successfully	2019-10-21 11:33:46 +08:00
Clayton Coleman	bd9260711f	deployment: Ignore namespace termination errors when creating replicasets Instead of reporting an event or displaying an error, simply exit when the namespace is being terminated. This reduces the amount of controller churn on namespace shutdown. Unlike other controllers, we drop the replica set create error very late (in the queue handleErr) in order to avoid changing the structure of the controller substantially.	2019-10-20 18:39:01 -04:00
Clayton Coleman	c6e34e58c5	job: Ignore namespace termination errors when creating pods or jobs Instead of reporting an event or displaying an error, simply exit when the namespace is being terminated. This reduces the amount of controller churn on namespace shutdown. While we could technically exit the entire processing loop early for very large jobs, we should wait for more evidence that is an issue before changing that logic substantially.	2019-10-20 18:39:01 -04:00
Clayton Coleman	8f74c8970b	daemonset: Ignore namespace termination errors when creating pods Instead of reporting an event or displaying an error, simply exit when the namespace is being terminated. This reduces the amount of controller churn on namespace shutdown. While we could technically exit the entire processing loop early for very large daemon sets, we should wait for more evidence that is an issue before changing that logic substantially.	2019-10-20 18:39:00 -04:00
Clayton Coleman	2e8ace82eb	replicaset: Ignore namespace termination errors when creating pods Instead of reporting an event or displaying an error, simply exit when the namespace is being terminated. This reduces the amount of controller churn on namespace shutdown. While we could technically exit the entire processing loop early for very large replica sets, we should wait for more evidence that is an issue before changing that logic substantially.	2019-10-20 18:39:00 -04:00
Clayton Coleman	dc0c21c7d7	serviceaccount: If namespace is terminating, ignore create errors In some scenarios the service account and token controllers can race with namespace deletion, causing a burst of errors as they attempt to recreate secrets being deleted. Instead, detect these errors and do not retry.	2019-10-20 18:39:00 -04:00
Clayton Coleman	937ef77257	endpoints: If namespace is terminating, drop item immediately Avoid sending an event to the namespace that is being terminated, since it will be rejected.	2019-10-20 18:38:59 -04:00
walter	6991069e31	Push context up to cloud node controller. This adds context to the cloud node controller. It continues the propogation started in 59287. Fixes 815. Fixed test code calls.	2019-10-20 11:20:49 -07:00
Kubernetes Prow Robot	019b662ff5	Merge pull request #84017 from ahg-g/ahg-csi Remove CSINode from scheduler cache.	2019-10-20 03:17:37 -07:00
zouyee	04340eaa34	rename FilterPlugin NodeResources Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2019-10-20 12:51:16 +08:00
Clayton Coleman	2ddeb94b56	storage: Deleting a namespace while spec.finalizers pending should not error All objects with graceful deletion allow multiple DELETE calls in the pending state. Namespace is the one outlier, and the error here predates graceful deletion and finalizers. We should make this behavior consistent with other calls and merely indicate success and return the state of the object, the same as if there were pending metadata finalizers. Clients that previously checked for a conflict error during delete to know that the server is already deleting will now no longer receive an error (as if the object were rapidly deleted). There is a small chance that some clients have error checking for this state, but a much larger chance that clients that want to trigger a delete of the namespace do not handle this error case. Discovered in an e2e test that used the framework namespace and triggered deletion of that ns itself, and then the AfterEach step in e2e failed because the namespace was already pending deletion.	2019-10-19 23:08:17 -04:00
Ted Yu	0d704f1ce2	Traverse OwnerReference maps more efficiently	2019-10-19 17:56:11 -07:00
Kubernetes Prow Robot	e1685b5b59	Merge pull request #84074 from zouyee/proirity LeastRequestedPriority/MostRequestedPriority/BalancedResourceAllocation as Score plugins	2019-10-19 17:21:37 -07:00
Clayton Coleman	3c44e11cfa	kubelet: Record preemptions similarly to evictions A preemption is a disruption event that should have a metric so that the rate of preemption can be assessed. Nodes that are under heavy preemption may have conflicting workloads or otherwise need attention. A sudden burst of preemption on a cluster in steady state could indicate pathological conditions within the scheduler or workload controllers.	2019-10-19 19:07:37 -04:00
Kubernetes Prow Robot	9c25b16d88	Merge pull request #84099 from draveness/feature/remove-lister-wrapper feat(scheduler): replace several algorithm listers with client listers	2019-10-19 08:05:37 -07:00
zouyee	bd4167d149	remove unused meta and rename lablance_allocated Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2019-10-19 22:43:15 +08:00
Kubernetes Prow Robot	7e53c9d808	Merge pull request #83756 from hex108/permit Refactor scheduler's framework permit API	2019-10-19 06:47:37 -07:00
zouyee	408c9da2a6	LeastRequestedPriority/MostRequestedPriority/BalancedResourceAllocation as Score plugins Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2019-10-19 20:49:05 +08:00
draveness	ce33fcc311	feat: remove FakePDBLister	2019-10-19 19:10:42 +08:00
draveness	00a12c787c	feat: implement node unschedulable as a filter plugin	2019-10-19 17:29:25 +08:00
draveness	e1f86e3460	feat(scheduler): replace several algorithm listers with client listers	2019-10-19 17:26:32 +08:00
Jun Gong	38b7668bb3	Refactor scheduler's framework permit API	2019-10-19 16:22:39 +08:00
Kubernetes Prow Robot	2b79d284fd	Merge pull request #84066 from Huang-Wei/eps-migration Migrate EvenPodsSpread Predicate to Filter plugin	2019-10-18 21:54:12 -07:00
Abdullah Gharaibeh	a772722660	Remove CSINode from scheduler cache.	2019-10-19 03:52:22 +00:00
Kubernetes Prow Robot	aab740ffc2	Merge pull request #82703 from draveness/feature/graduate-taint-nodes-by-condition-to-ga feat: update taint nodes by condition to GA	2019-10-18 20:01:37 -07:00
draveness	1163a1d51e	feat: update taint nodes by condition to GA	2019-10-19 09:17:41 +08:00
Kubernetes Prow Robot	98fcf2e6c7	Merge pull request #84034 from mrkm4ntr/use-informer-factory Use frameworkHandle to get listers	2019-10-18 15:21:57 -07:00
Kubernetes Prow Robot	70f68062ad	Merge pull request #84014 from ahg-g/ahg-tree Make node tree order part of the snapshot	2019-10-18 15:21:37 -07:00
Wei Huang	64ff958e69	migrate EvenPodsSpread Predicate to Filter plugin	2019-10-18 14:18:02 -07:00
Kubernetes Prow Robot	de9a7d863d	Merge pull request #83934 from wccsama/wcc-service-dev Convert error messages to use event recorder	2019-10-18 12:37:46 -07:00
Abdullah Gharaibeh	63d7733e98	create an ordered list of nodes instead of iterating over the tree	2019-10-18 12:51:46 -04:00
Shintaro Murakami	bf256bcb00	Use frameworkHandle to get listers	2019-10-19 01:29:05 +09:00
Kubernetes Prow Robot	27c679baca	Merge pull request #83982 from lichuqiang/frame_mig1 [migration phase 1] MatchInterPodAffinity as filter plugin	2019-10-18 09:23:58 -07:00
Kubernetes Prow Robot	422256110e	Merge pull request #84073 from draveness/feature/cleanup-framework-plugins feat: several cleanups in the scheduling package	2019-10-18 04:43:57 -07:00
lichuqiang	671f7690fe	[migration phase 1] MatchInterPodAffinity as filter plugin	2019-10-18 16:26:34 +08:00
preisinger	d6431fbdfa	Bugfix kube-proxy README file to list ipvs modules	2019-10-18 09:25:28 +01:00
wccsama	18cf49e3df	Convert error messages to use event recorder remove mix protocol validation remove check nil	2019-10-18 13:30:00 +08:00
Kubernetes Prow Robot	d1a79f136b	Merge pull request #84054 from ahg-g/ahg-gp GeneralPredicate as framework plugin config	2019-10-17 21:19:58 -07:00
nolancon	b0a85177d2	Clean-up and additional test cases for socket-mask unit test.	2019-10-18 04:16:06 +01:00
draveness	39af760930	feat: several cleanups in the scheduling package + Remove unused variable in tests. + Use more common statement for interface conformance check. + Fix several comments in the framework plugins.	2019-10-18 11:14:05 +08:00
Kubernetes Prow Robot	9bf2ba7369	Merge pull request #84015 from ahg-g/ahg-filters cleanup unnecessary func parameters in genericScheduler methods	2019-10-17 19:50:56 -07:00
Kubernetes Prow Robot	91050062f9	Merge pull request #83894 from notpad/feature/migration_nodevolumelimit [migration phase 1] CSIMaxVolumeLimitChecker as filter plugin	2019-10-17 19:50:41 -07:00
Kubernetes Prow Robot	e129a6bc3f	Merge pull request #80004 from Miciah/prefer-to-delete-doubled-up-pods-of-a-replicaset Prefer to delete doubled-up pods of a ReplicaSet	2019-10-17 15:09:58 -07:00
Sean Sullivan	37fca51d74	Relocate tableprinter tests (#84027 ) * Moves TestPrintUnstructuredObject to tableprinter_test.go * Move TestUnknownTypePrinting to correct location of tableprinter_test.go * Removes NewTablePrinter from TestCustomTypePrinting	2019-10-17 09:52:53 -07:00
Miciah Masters	980b6406b2	Prefer to delete doubled-up pods of a ReplicaSet When scaling down a ReplicaSet, delete doubled up replicas first, where a "doubled up replica" is defined as one that is on the same node as an active replica belonging to a related ReplicaSet. ReplicaSets are considered "related" if they have a common controller (typically a Deployment). The intention of this change is to make a rolling update of a Deployment scale down the old ReplicaSet as it scales up the new ReplicaSet by deleting pods from the old ReplicaSet that are colocated with ready pods of the new ReplicaSet. This change in the behavior of rolling updates can be combined with pod affinity rules to preserve the locality of a Deployment's pods over rollout. A specific scenario that benefits from this change is when a Deployment's pods are exposed by a Service that has type "LoadBalancer" and external traffic policy "Local". In this scenario, the load balancer uses health checks to determine whether it should forward traffic for the Service to a particular node. If the node has no local endpoints for the Service, the health check will fail for that node. Eventually, the load balancer will stop forwarding traffic to that node. In the meantime, the service proxy drops traffic for that Service. Thus, in order to reduce risk of dropping traffic during a rolling update, it is desirable preserve node locality of endpoints. * pkg/controller/controller_utils.go (ActivePodsWithRanks): New type to sort pods using a given ranking. * pkg/controller/controller_utils_test.go (TestSortingActivePodsWithRanks): New test for ActivePodsWithRanks. * pkg/controller/replicaset/replica_set.go (getReplicaSetsWithSameController): New method. Given a ReplicaSet, return all ReplicaSets that have the same owner. (manageReplicas): Call getIndirectlyRelatedPods, and pass its result to getPodsToDelete. (getIndirectlyRelatedPods): New method. Given a ReplicaSet, return all pods that are owned by any ReplicaSet with the same owner. (getPodsToDelete): Add an argument for related pods. Use related pods and the new getPodsRankedByRelatedPodsOnSameNode function to take into account whether a pod is doubled up when sorting pods for deletion. (getPodsRankedByRelatedPodsOnSameNode): New function. Return an ActivePodsWithRanks value that wraps the given slice of pods and computes ranks where each pod's rank is equal to the number of active related pods that are colocated on the same node. * pkg/controller/replicaset/replica_set_test.go (newReplicaSet): Set OwnerReferences on the ReplicaSet. (newPod): Set a unique UID on the pod. (byName): New type to sort pods by name. (TestGetReplicaSetsWithSameController): New test for getReplicaSetsWithSameController. (TestRelatedPodsLookup): New test for getIndirectlyRelatedPods. (TestGetPodsToDelete): Augment the "various pod phases and conditions, diff = len(pods)" test case to ensure that scale-down still selects doubled-up pods if there are not enough other pods to scale down. Add a "various pod phases and conditions, diff = len(pods), relatedPods empty" test case to verify that getPodsToDelete works even if related pods could not be determined. Add a "ready and colocated with another ready pod vs not colocated, diff < len(pods)" test case to verify that a doubled-up pod gets preferred for deletion. Augment the "various pod phases and conditions, diff < len(pods)" test case to ensure that not-ready pods are preferred over ready but doubled-up pods. * pkg/controller/replicaset/BUILD: Regenerate. * test/e2e/apps/deployment.go (testRollingUpdateDeploymentWithLocalTrafficLoadBalancer): New end-to-end test. Create a deployment with a rolling update strategy and affinity rules and a load balancer with "Local" external traffic policy, and verify that set of nodes with local endponts for the service remains unchanged during rollouts. (setAffinity): New helper, used by testRollingUpdateDeploymentWithLocalTrafficLoadBalancer. * test/e2e/framework/service/jig.go (GetEndpointNodes): Factor building the set of node names out... (GetEndpointNodeNames): ...into this new method.	2019-10-17 11:52:32 -04:00
Miciah Masters	865c3c5670	TestGetPodsToDelete: Use field names in test cases * pkg/controller/replicaset/replica_set_test.go (TestGetPodsToDelete): Use explicit field names in declarations of test cases.	2019-10-17 11:50:09 -04:00
Kubernetes Prow Robot	fef819254a	Merge pull request #83998 from draveness/feature/node-affinity-score-plugin feat(scheduler): implement node affinity as score plugin	2019-10-17 08:24:38 -07:00

... 113 114 115 116 117 ...

42458 Commits