kubernetes

Author	SHA1	Message	Date
Jean-Francois Remy	e83184568d	Add unit tests - actual_state_of_world_test.go: test the new method GetVolumesToReportAttachedForNode for an existing node and a non-existing node - node_status_updater_test.go: test UpdateNodeStatuses and UpdateNodeStatuses in nominal case with 2 nodes getting one volume each. Test UpdateNodeStatuses with the first call to node.patch failing but the following one succeeding - add comment in node_status_updater.go - fix log line in reconciler.go - rename variable in actual_state_of_world.go	2022-02-22 12:21:58 -08:00
Jean-Francois Remy	f1717baaaa	Fix nodes volumesAttached status not updated The UpdateNodeStatuses code stops too early in case there is an error when calling updateNodeStatus. It will return immediately which means any remaining node won't have its update status put back to true. Looking at the call sites for UpdateNodeStatuses, it appears this is not the only issue. If the lister call fails with anything but a Not Found error, it's silently ignored which is wrong in the detach path. Also the reconciler detach path calls UpdateNodeStatuses but the real intent is to only update the node currently processed in the loop and not proceed with the detach call if there is an error updating that specifi node volumesAttached property. With the current implementation, it will not proceed if there is an error updating another node (which is not completely bad but not ideal) and worse it will proceed if there is a lister error on that node which means the node volumesAttached property won't have been updated. To fix those issues, introduce the following changes: - [node_status_updater] introduce UpdateNodeStatusForNode which does what UpdateNodeStatuses does but only for the provided node - [node_status_updater] if the node lister call fails for anything but a Not Found error, we will return an error, not ignore it - [node_status_updater] if the update of a node volumesAttached properties fails we continue processing the other nodes - [actual_state_of_world] introduce GetVolumesToReportAttachedForNode which does what GetVolumesToReportAttached but for the node whose name is provided it returns a bool which indicates if the node in question needs an update as well as the volumesAttached list. It is used by UpdateNodeStatusForNode - [actual_state_of_world] use write lock in updateNodeStatusUpdateNeeded, we're modifying the map content - [reconciler] use UpdateNodeStatusForNode in the detach loop	2022-02-22 12:20:53 -08:00
Neha Lohia	fa1b6765d5	move pkg/util/node to component-helpers/node/util (#105347 ) Signed-off-by: Neha Lohia <nehapithadiya444@gmail.com>	2021-11-12 07:52:27 -08:00
Davanum Srinivas	442a69c3bd	switch over k/k to use klog v2 Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:27 -04:00
Davanum Srinivas	954996e231	Move from glog to klog - Move from the old github.com/golang/glog to k8s.io/klog - klog as explicit InitFlags() so we add them as necessary - we update the other repositories that we vendor that made a similar change from glog to klog * github.com/kubernetes/repo-infra * k8s.io/gengo/ * k8s.io/kube-openapi/ * github.com/google/cadvisor - Entirely remove all references to glog - Fix some tests by explicit InitFlags in their init() methods Change-Id: I92db545ff36fcec83afe98f550c9e630098b3135	2018-11-10 07:50:31 -05:00
Di Xu	13a355c837	refactor method to pkg/util/node	2017-11-06 09:51:09 +08:00
Kevin	4c8539cece	use core client with explicit version globally	2017-10-27 15:48:32 +08:00
Dr. Stefan Schimanski	1d053c4f7c	controllers: simplify deepcopy calls	2017-08-29 19:21:24 +02:00
Cheng Xing	1234d2f500	On AttachDetachController node status update, do not retry when node doesn't exist but keep the node entry in cache	2017-08-16 15:42:15 -07:00
Jacob Simpson	a765b8cfca	Migrate api.Scheme to scheme.Scheme	2017-07-17 15:05:38 -07:00
Jacob Simpson	29c1b81d4c	Scripted migration from clientset_generated to client-go.	2017-07-17 15:05:37 -07:00
Kubernetes Submit Queue	18362beb0d	Merge pull request #42254 from justinsb/volumes_dont_leak_nodestatusupdateneeded Automatic merge from submit-queue volumes: SetNodeStatusUpdateNeeded on error If an error happened during the UpdateNodeStatuses loop, there were some code paths where we would not call SetNodeStatusUpdateNeeded, leaking the state. Add it to all paths by adding a function. Part of #40583 ```release-note NONE ```	2017-06-22 21:43:04 -07:00
Chao Xu	60604f8818	run hack/update-all	2017-06-22 11:31:03 -07:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
Justin Santa Barbara	d420531f95	volumes: SetNodeStatusUpdateNeeded on error If an error happened during the UpdateNodeStatuses loop, there were some code paths where we would not call SetNodeStatusUpdateNeeded, leaking the state. Add it to all paths by adding a function. Part of #40583	2017-06-01 00:32:20 -04:00
Cheng Xing	f9dc2d5ca3	Node status updater now deletes the node entry in attach updates when node is missing in NodeInformer cache. Fixes #42438 . - Added RemoveNodeFromAttachUpdates as part of node status updater operations.	2017-05-24 18:31:47 -07:00
Andy Goldstein	70c6087600	Replace hand-written informers with generated ones Replace existing uses of hand-written informers with generated ones. Follow-up commits will switch the use of one-off informers to shared informers.	2017-02-06 13:49:27 -05:00
Dr. Stefan Schimanski	d7eb3b6870	pkg/util: move uuid and strategicpatch into k8s.io/apimachinery	2017-01-25 19:45:09 +01:00
deads2k	b0b156b381	make tools/cache authoritative	2017-01-25 08:29:45 -05:00
Wojciech Tyczynski	d08abdb187	Allow for returning map[string]interface{} from patch.	2017-01-18 11:53:30 +01:00
NickrenREN	639572ac68	fix redundant alias and remove unused function	2017-01-09 17:13:09 +08:00
Chao Xu	03d8820edc	rename /release_1_5 to /clientset	2016-12-14 12:39:48 -08:00
Kubernetes Submit Queue	43233caaf0	Merge pull request #37871 from Random-Liu/use-patch-in-kubelet Automatic merge from submit-queue (batch tested with PRs 36692, 37871) Use PatchStatus to update node status in kubelet. Fixes https://github.com/kubernetes/kubernetes/issues/37771. This PR changes kubelet to update node status with `PatchStatus`. @caesarxuchao @ymqytw told me that there is a limitation of current `CreateTwoWayMergePatch`, it doesn't support primitive type slice which uses strategic merge. * I checked the node status, the only primitive type slices in NodeStatus are as follows, they are not using strategic merge: * [`ContainerImage.Names`](https://github.com/kubernetes/kubernetes/blob/master/pkg/api/v1/types.go#L2963) * [`VolumesInUse`](https://github.com/kubernetes/kubernetes/blob/master/pkg/api/v1/types.go#L2909) * Volume package is already [using `CreateStrategicMergePath` to generate node status update patch](https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/volume/attachdetach/statusupdater/node_status_updater.go#L111), and till now everything is fine. @yujuhong @dchen1107 /cc @kubernetes/sig-node	2016-12-09 11:29:11 -08:00
Random-Liu	beba1ebbf8	Use PatchStatus to update node status in kubelet.	2016-12-08 17:13:59 -08:00
Jordan Liggitt	6819706adf	Pass addressable values to DeepCopy	2016-12-08 14:16:01 -05:00
Kubernetes Submit Queue	fb7e9d901d	Merge pull request #37939 from yarntime/fix_typo_in_node_status_updater Automatic merge from submit-queue (batch tested with PRs 37997, 37939, 37990, 36700, 37258) fix typo in node_status_updater fix typo.	2016-12-02 19:26:47 -08:00
yarntime@163.com	df6e9db9d9	fix typo	2016-12-02 17:33:45 +08:00
rkouj	638ef1b977	SetNodeUpdateStatusNeeded whenever nodeAdd event is received	2016-11-30 21:12:34 -08:00
Chao Xu	7eeb71f698	cmd/kube-controller-manager	2016-11-23 15:53:09 -08:00
ymqytw	3cc294b1e0	Revert "support patch list of primitives" This reverts commit `34891ad9f6`.	2016-11-22 21:06:36 -08:00
ymqytw	d248843b65	Revert "try old patch after new patch fails" This reverts commit `f32696e734`.	2016-11-22 21:02:30 -08:00
ymqytw	f32696e734	try old patch after new patch fails	2016-11-17 14:28:09 -08:00
ymqytw	34891ad9f6	support patch list of primitives	2016-11-09 11:46:59 -08:00
Kubernetes Submit Queue	453bfa1f0f	Merge pull request #34368 from jingxu97/Oct/statusupdate-10-7 Automatic merge from submit-queue Node status updater should SetNodeStatusUpdateNeeded if it fails to update status When volume controller tries to update the node status, if it fails to update the nodes status, it should call SetNodeStatusUpdateNeeded so that the volume list could be updated next time.	2016-10-26 11:09:16 -07:00
Jan Safranek	ad946f4fcc	Fixed mutation warning in Attach/Detach controller Objects from shared informer must not be changed, they are shared among all controllers. This fixes CacheMutationDetector panic with this output: CACHE *api.Node[5] ALTERED! {"metadata":{"name":"ip-172-18-8-71.ec2.internal","selfLink":"/api/v1/nodes/ip-172-18-8-71.ec2.internal","uid":"73d07d16-976e-11e6-8225-0e2f14b56070","resourceVersion":"136","creationTimestamp":"2016-10-21T09:12:12Z","labels":{"beta.kubernetes.io/arch":"amd64","beta.kubernetes.io/instance-type":"t2.medium","beta.kubernetes.io/os":"linux","failure-domain.beta.kubernetes.io/region":"us-east-1","failure-domain.beta.kubernetes.io/zone":"us-east-1d","kubernetes.io/hostname":"ip-172-18-8-71.ec2.internal"},"annotations":{"volumes.kubernetes.io/controller-managed-attach-detach":"true"}},"spec":{"externalID":"i-9cb6180f","providerID":"aws:///us-east-1d/i-9cb6180f"},"status":{"capacity":{"alpha.kubernetes.io/nvidia-gpu":"0","cpu":"2","memory":"4045568Ki","pods":"110"},"allocatable":{"alpha.kubernetes.io/nvidia-gpu":"0","cpu":"2","memory":"4045568Ki","pods":"110"},"conditions":[{"type":"OutOfDisk","status":"False","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:12Z","reason":"KubeletHasSufficientDisk","message":"kubelet has sufficient disk space available"},{"type":"MemoryPressure","status":"False","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:12Z","reason":"KubeletHasSufficientMemory","message":"kubelet has sufficient memory available"},{"type":"DiskPressure","status":"False","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:12Z","reason":"KubeletHasNoDiskPressure","message":"kubelet has no disk pressure"},{"type":"InodePressure","status":"False","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:12Z","reason":"KubeletHasNoInodePressure","message":"kubelet has no inode pressure"},{"type":"Ready","status":"True","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:22Z","reason":"KubeletReady","message":"kubelet is posting ready status"}],"addresses":[{"type":"InternalIP","address":"172.18.8.71"},{"type":"LegacyHostIP","address":"172.18.8.71"},{"type":"ExternalIP","address":"54.85.104.236"}],"daemonEndpoints":{"kubeletEndpoint":{"Port":10250}},"nodeInfo":{"machineID":"78a79498db8e4fdc9ac24b5e436a982c","systemUUID":"EC2BB406-5467-4ABE-B54D-D9993C45714F","bootID":"2553d6b8-1ddb-4ef0-902a-d09a807b89ba","kernelVersion":"4.6.7-300.fc24.x86_64","osImage":"Fedora 24 (Cloud Edition)","containerRuntimeVersion":"docker://1.10.3","kubeletVersion":"v1.5.0-alpha.1.726+5aac5eddb809e4","kubeProxyVersion":"v1.5.0-alpha.1.726+5aac5eddb809e4","operatingSystem":"linux","architecture":"amd64"},"images":[{"names":["openshift/origin-release:latest"],"sizeBytes":714569002},{"names":["openshift/origin-haproxy-router-base:latest"],"sizeBytes":294417608},{"names":["openshift/origin-base:latest"],"sizeBytes":275310761},{"names":["docker.io/centos@sha256:2ae0d2c881c7123870114fb9cc7afabd1e31f9888dac8286884f6cf59373ed9b","docker.io/centos:centos7"],"sizeBytes":196744353},{"names":["gcr.io/google_containers/busybox@sha256:4bdd623e848417d96127e16037743f0cd8b528c026e9175e22a84f639eca58ff","gcr.io/google_containers/busybox:1.24"],"sizeBytes":1113554},{"names":["gcr.io/google_containers/pause-amd64@sha256:163ac025575b775d1c0f9bf0bdd0f086883171eb475b5068e7defa4ca9e76516","gcr.io/google_containers/pause-amd64:3.0"],"sizeBytes":746888}],"volumesInUse":["kubernetes.io/aws-ebs/aws://us-east-1d/vol-f4bd0352"] A: ,"volumesAttached":[{"name":"kubernetes.io/aws-ebs/aws://us-east-1d/vol-f4bd0352","devicePath":"/dev/xvdba"}]}} B: }}	2016-10-25 14:28:10 +02:00
Jing Xu	70efadc2f4	Node status updater should SetNodeStatusUpdateNeeded if it fails to update status When volume controller tries to update the node status, if it fails to update the nodes status, it should call SetNodeStatusUpdateNeeded so that the volume list could be updated next time.	2016-10-24 13:59:39 -07:00
Justin Santa Barbara	54195d590f	Use strongly-typed types.NodeName for a node name We had another bug where we confused the hostname with the NodeName. To avoid this happening again, and to make the code more self-documenting, we use types.NodeName (a typedef alias for string) whenever we are referring to the Node.Name. A tedious but mechanical commit therefore, to change all uses of the node name to use types.NodeName Also clean up some of the (many) places where the NodeName is referred to as a hostname (not true on AWS), or an instanceID (not true on GCE), etc.	2016-09-27 10:47:31 -04:00
Kubernetes Submit Queue	0a4316f11e	Merge pull request #32807 from jingxu97/stateupdateNeeded-9-15 Automatic merge from submit-queue Fix race condition in setting node statusUpdateNeeded flag This PR fixes the race condition in setting node statusUpdateNeeded flag in master's attachdetach controller. This flag is used to indicate whether a node status has been updated by the node_status_updater or not. When updater finishes update a node status, it is set to false. When the node status is changed such as volume is detached or new volume is attached to the node, the flag is set to true so that updater can update the status again. The previous workflow has a race condition as follows 1. updater gets the currently attached volume list from the node which needs to be updated. 2. A new volume A is attached to the same node right after 1 and set the flag to TRUE 3. updater updates the node attached volume list (which does not include volume A) and then set the flag to FALSE. The result is that volume A will be never added to the attached volume list so at node side, this volume is never attached. So in this PR, the flag is set to FALSE when updater tries to get the attached volume list (as in an atomic operation). So in the above example, after step 2, the flag will be TRUE again, in step 3, updater does not set the flag if updates is sucessful. So after that, flag is still TRUE and in next round of update, the node status will be updated.	2016-09-23 11:25:16 -07:00
Jing Xu	14cad206f5	Fix race conditino in setting node statusUpdateNeeded flag This PR fixes the race condition in setting node statusUpdateNeeded flag in master's attachdetach controller. This flag is used to indicate whether a node status has been updated by the node_status_updater or not. When updater finishes update a node status, it is set to false. When the node status is changed such as volume is detached or new volume is attached to the node, the flag is set to true so that updater can update the status again. The previous workflow has a race condition as follows 1. updater gets the currently attached volume list from the node which needs to be updated. 2. A new volume A is attached to the same node right after 1 and set the flag to TRUE 3. updater updates the node attached volume list (which does not include volume A) and then set the flag to FALSE. The result is that volume A will be never added to the attached volume list so at node side, this volume is never attached. So in this PR, the flag is set to FALSE when updater tries to get the attached volume list (as in an atomic operation). So in the above example, after step 2, the flag will be TRUE again, in step 3, updater does not set the flag if updates is sucessful. So after that, flag is still TRUE and in next round of update, the node status will be updated. This PR also changes a unit test due to the workflow changes	2016-09-22 14:02:30 -07:00
Mike Danese	a765d59932	move informer and controller to pkg/client/cache Signed-off-by: Mike Danese <mikedanese@google.com>	2016-09-15 12:50:08 -07:00
Jing Xu	70deeb0ae4	node not exist during node status update should not block others When node is deleted, attach-detach controller cache may contain stale information of this node, and update node status fails in reconciler loop. But one node update failure should not block updating other nodes. Also the warning message easily flush the log file. This PR is just a quick fix of this issue. More complete fix including make sure controller cache up to date will be addressed in another PR.	2016-08-18 13:51:30 -07:00
saadali	0c72568247	Skip safe to detach if node api obj doesn't exist	2016-08-16 21:30:51 -07:00
saadali	89fd358c52	Assume volume detached if node doesn't exist Fixes #29358	2016-07-22 22:07:32 -07:00
saadali	0dd17fff22	Reorganize volume controllers and manager	2016-07-01 18:50:25 -07:00

44 Commits