kubernetes

Author	SHA1	Message	Date
Jordan Liggitt	b8d7ecf73b	Make node removal conditional in processGraphChanges	2020-11-17 10:49:04 -05:00
Jordan Liggitt	ac8d419b4c	Enqueue dependents for deletion when their ownerReference does not match observed parent coordinates When adding a dependent to the graph, we ensure there is a node representing each owner reference, and add the dependent to each parent node. If the parent node already exists, and the dependent's ownerReference coordinates disagree with the verified coordinates, add the dependent to the attemptToDelete queue. This queue will check the dependent's ownerReferences using the coordinates specified by the dependent. If all of the owners can be verified absent, the dependent will be deleted. If some are still present, or if there are errors looking them up, the dependent will not be deleted. If the parent node has been observed via informer event (so we know the coordinates are accurate), and the verified owner is namespaced, and the dependent is not in the same namespace, an event will be recorded for user visibility, since cross-namespace ownerReferences are not supported.	2020-11-17 10:47:39 -05:00
Jordan Liggitt	78317edb8b	Short-circuit attemptToDelete loop for virtual nodes that are removed or observed Virtual nodes are added to the attemptToDelete queue, and continue getting requeued until they are successfully verified absent or are observed via informer. In the meantime, if the real object associated with that UID is observed via informer, or is observed to be deleted via informer, the graph node for that UID can be removed or marked as observed. In that case, we should stop retrying to get the virtual node coordinates.	2020-11-17 10:46:00 -05:00
Jordan Liggitt	cae56bea0a	Replace virtual node with observed node if identity differs If the graph contains a virtual node (because some child object referenced it in an OwnerRef), and a real informer event is observed for that uid at different coordinates, we want to fix the coordinates of the node in the graph to match the actual coordinates. The safe way to do this is to clone the node, replace the identity in the clone, then replace the node with the clone. Modifying the identity directly is not safe because it is accessed lock-free from many code paths. Replacing the node in the graph from processGraphChanges is safe because it is the only graph writer.	2020-11-17 10:42:48 -05:00
Jordan Liggitt	cb7b9ed532	Refactor identityFromEvent	2020-11-17 10:42:48 -05:00
Jordan Liggitt	30eb6683e6	Avoid marking virtual nodes as observed when they haven't been Virtual nodes can be added to the GC graph in order to represent objects which have not been observed via an informer, but are referenced via ownerReferences. These virtual nodes are requeued into attemptToDelete until they are observed via an informer, or successfully verified absent via a live lookup. Previously, both of those code paths called markObserved() to stop requeuing into attemptToDelete. Because it is useful to know whether a particular node has been observed via a real informer event, this commit does the following: * adds a `virtual bool` attribute to graph events so we know which ones came from a real informer * limits the markObserved() call to the code path where a real informer event is observed * uses an alternative mechanism to stop requeueing into attemptToDelete when a virtual node is verified absent via a live lookup	2020-11-17 10:42:48 -05:00
Jordan Liggitt	445f20dbdb	Switch GC absentOwnerCache to full reference Before deleting an object based on absent owners, GC verifies absence of those owners with a live lookup. The coordinates used to perform that live lookup are the ones specified in the ownerReference of the child. In order to performantly delete multiple children from the same parent (e.g. 1000 pods from a replicaset), a 404 response to a lookup is cached in absentOwnerCache. Previously, the cache was a simple uid set. However, since children can disagree on the coordinates that should be used to look up a given uid, the cache should record the exact coordinates verified absent. This is a [apiVersion, kind, namespace, name, uid] tuple.	2020-11-17 10:42:48 -05:00
Jordan Liggitt	09bdf76b8a	Plumb event recorder to garbage collector controller	2020-11-17 10:42:45 -05:00
Kubernetes Prow Robot	da75c26648	Merge pull request #95978 from roycaihw/storage-version/gc Storage version garbage collector	2020-11-12 18:36:37 -08:00
Haowei Cai	1d5a8f8f24	fixup! add storage version garbage collector	2020-11-12 16:34:27 -08:00
Haowei Cai	f675dac440	generated	2020-11-12 16:25:22 -08:00
Haowei Cai	ee9ace14c2	add storage version garbage collector	2020-11-12 16:21:00 -08:00
Kubernetes Prow Robot	765d949bfc	Merge pull request #96440 from robscott/endpointslice-pre-ga Adding NodeName to EndpointSlice API, deprecation updates	2020-11-12 16:03:13 -08:00
Kubernetes Prow Robot	798eb07720	Merge pull request #96443 from alaypatel07/cronjob-controller-2-follow-up handle slow cronjob lister in cronjob controller v2 and improve memory footprint	2020-11-12 13:16:52 -08:00
Kubernetes Prow Robot	4b46d44e0c	Merge pull request #96327 from robscott/app-protocol-ga Graduating AppProtocol to GA	2020-11-12 13:16:39 -08:00
Kubernetes Prow Robot	55856ed727	Merge pull request #93130 from zshihang/master plumb service account token down to csi driver	2020-11-12 13:16:25 -08:00
Rob Scott	84e4b30a3e	Updates related to PR feedback - Remove feature gate consideration from EndpointSlice validation - Deprecate topology field, note that it will be removed in future release - Update kube-proxy to check for NodeName if feature gate is enabled - Add comments indicating the feature gates that can be used to enable alpha API fields - Add comments explaining use of deprecated address type in tests	2020-11-12 12:30:50 -08:00
Kubernetes Prow Robot	e38b1b94f8	Merge pull request #96399 from andrewsykim/service-config move service controller config to k8s.io/cloud-provider/controllers/service/config	2020-11-12 11:21:57 -08:00
Shihang Zhang	d2859cd89b	plumb service account token down to csi driver	2020-11-12 09:26:43 -08:00
Rob Scott	d985438772	Updating EndpointSlice controllers to support NodeName field	2020-11-11 16:50:36 -08:00
Alay Patel	15089aab94	update bazel	2020-11-11 19:47:11 -05:00
Alay Patel	d6ca5b8d14	handle the case for slow cronjob lister, add unit tests	2020-11-11 18:48:57 -05:00
Alay Patel	41c82e69ed	convert to stardard lister, use []*batchv1.Job instead of []batchv1.Job	2020-11-11 18:48:57 -05:00
Kubernetes Prow Robot	667d1c2c3f	Merge pull request #93370 from alaypatel07/add-new-cronjob-controller Add cronjob controller v2	2020-11-11 15:42:50 -08:00
Kubernetes Prow Robot	423f8731ef	Merge pull request #95719 from tsmetana/add-pv_collector-provisioner-metric PV Controller: Add plugin name and volume mode to PV metrics	2020-11-11 01:49:49 -08:00
Alay Patel	38bb53555e	update violation_exceptions.list and make generated	2020-11-10 17:32:06 -05:00
Alay Patel	8d7dd4415e	add cronjob_controllerv2.go	2020-11-10 17:32:06 -05:00
Andrew Sy Kim	b1e0decce1	move service controller config to k8s.io/cloud-provider/controllers/service/config Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2020-11-10 14:59:44 -05:00
Rob Scott	b044fadf66	Graduating AppProtocol to GA	2020-11-09 11:08:19 -08:00
Tim Hockin	819ff9b087	Use topology labels instead of old beta names (#96033 ) * Rename const for topology.../zone * Rename const for topology.../region * Rename const for failure-domain.../zone * Rename const for failure-domain.../region * Restore old names for compat	2020-11-05 20:26:50 -08:00
Andrew Sy Kim	7cf19e5fb7	endpointslice API: rename 'accepting' condition to 'serving' condition Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2020-11-05 19:18:45 -05:00
Andrew Sy Kim	17cf1b4415	endpointslice controller: add test cases to TestSyncServiceFull for terminating endpoints Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2020-11-05 19:18:45 -05:00
Andrew Sy Kim	2947f5ce4f	endpointslice controller: refactor TestSyncServiceFull to use test tables Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2020-11-05 19:18:45 -05:00
Andrew Sy Kim	1c603e90ef	endpointslice controller: set new conditions 'accepting' and 'terminating' Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2020-11-05 19:18:45 -05:00
Kubernetes Prow Robot	2d6cd683bd	Merge pull request #95541 from cofyc/fix95538 volume binding: report UnschedulableAndUnresolvable status instead of an error when bound PVs not found	2020-11-05 15:16:51 -08:00
Shihang Zhang	2c378beb64	abort if namespace doesn't exist or terminating	2020-11-05 11:12:15 -08:00
Yecheng Fu	0961891a7a	report UnschedulableAndUnresolvable status instead of an error when PVCs can't find bound persistent volumes This is an user error. We should't report an error.	2020-11-05 10:28:40 +08:00
Shihang Zhang	d40f0c43c4	separate RootCAConfigMap from BoundServiceAccountTokenVolume	2020-11-04 17:10:39 -08:00
Kubernetes Prow Robot	096819c963	Merge pull request #95909 from pohly/pv-controller-delete-pv-fix PV controller: don't delete PVs when PVC is not known yet	2020-11-02 02:00:52 -08:00
Kubernetes Prow Robot	4b65f70652	Merge pull request #95740 from cici37/moveCCM Move cloud-controller-manager to staging k8s.io/cloud-provider	2020-10-30 13:48:51 -07:00
cici37	9465d95ea6	Move CCM to staging k8s.io/cloud-provider	2020-10-29 20:50:23 -07:00
cici37	a91a2cdad6	Move informer_factory to staging	2020-10-29 12:20:33 -07:00
Patrick Ohly	24f5764787	pv controller test: more test cases The main goal was to cover retrieval of a PVC from the apiserver when it isn't known yet. This is achieved by adding PVCs and (for the sake of completeness) PVs to the reactor, but not the controller, when a special annotation is set. The approach with a special annotation was chosen because it doesn't affect other tests. The other test cases were added while checking the existing tests because (at least at first glance) the situations seemed to be not covered.	2020-10-28 10:52:11 +01:00
Patrick Ohly	22f81e9e0b	pv controller test: use sub tests This makes it possible to run individual tests.	2020-10-28 10:39:59 +01:00
Patrick Ohly	06f934ea1f	pv controller test: enable klog output This makes it possible to run tests with -v=5 and thus actually get some output.	2020-10-28 10:39:10 +01:00
Kubernetes Prow Robot	554319cce8	Merge pull request #95410 from benhxy/staticcheck Fix static check for pkg/controller/podautoscaler	2020-10-27 10:36:14 -07:00
Patrick Ohly	5686664a1d	PV controller: don't delete PVs when PVC is not known yet Normally, the PV controller knows about the PVC that triggers the creation of a PV before it sees the PV, because the PV controller must set the volume.beta.kubernetes.io/storage-provisioner annotation that tells an external provisioner to create the PV. When restarting, the PV controller first syncs its caches, so that case is also covered. However, the creator of a PVC might decided to set that annotation itself to speed up volume creation. While unusual, it's not forbidden and thus part of the external Kubernetes API. Whether it makes sense depends on the intentions of the user. When that is done and there is heavy load, an external provisioner might see the PVC and create a PV before the PV controller sees the PVC. If the PV controller then encounters the PV before the PVC, it incorrectly concludes that the PV needs to be deleted instead of being bound. The same issue occurred earlier for external binding and the existing code for looking up a PVC in the cache or in the apiserver solves the issue also for volume provisioning, it just needs to be enabled also for PVs without the pv.kubernetes.io/bound-by-controller annotation.	2020-10-27 11:26:58 +01:00
Khaled Henidak (Kal)	6675eba3ef	dual stack services (#91824 ) * api: structure change * api: defaulting, conversion, and validation * [FIX] validation: auto remove second ip/family when service changes to SingleStack * [FIX] api: defaulting, conversion, and validation * api-server: clusterIPs alloc, printers, storage and strategy * [FIX] clusterIPs default on read * alloc: auto remove second ip/family when service changes to SingleStack * api-server: repair loop handling for clusterIPs * api-server: force kubernetes default service into single stack * api-server: tie dualstack feature flag with endpoint feature flag * controller-manager: feature flag, endpoint, and endpointSlice controllers handling multi family service * [FIX] controller-manager: feature flag, endpoint, and endpointSlicecontrollers handling multi family service * kube-proxy: feature-flag, utils, proxier, and meta proxier * [FIX] kubeproxy: call both proxier at the same time * kubenet: remove forced pod IP sorting * kubectl: modify describe to include ClusterIPs, IPFamilies, and IPFamilyPolicy * e2e: fix tests that depends on IPFamily field AND add dual stack tests * e2e: fix expected error message for ClusterIP immutability * add integration tests for dualstack the third phase of dual stack is a very complex change in the API, basically it introduces Dual Stack services. Main changes are: - It pluralizes the Service IPFamily field to IPFamilies, and removes the singular field. - It introduces a new field IPFamilyPolicyType that can take 3 values to express the "dual-stack(mad)ness" of the cluster: SingleStack, PreferDualStack and RequireDualStack - It pluralizes ClusterIP to ClusterIPs. The goal is to add coverage to the services API operations, taking into account the 6 different modes a cluster can have: - single stack: IP4 or IPv6 (as of today) - dual stack: IPv4 only, IPv6 only, IPv4 - IPv6, IPv6 - IPv4 * [FIX] add integration tests for dualstack * generated data * generated files Co-authored-by: Antonio Ojea <aojea@redhat.com>	2020-10-26 13:15:59 -07:00
Ben Hu	4e62298c1b	Fix static checks for pkg/controller/podautoscaler	2020-10-23 18:53:07 +00:00
Kubernetes Prow Robot	ec453ffb1a	Merge pull request #90691 from arjunrn/container-resource-hpa Add container based scaling to HPA	2020-10-23 05:51:51 -07:00

1 2 3 4 5 ...

5144 Commits