kubernetes

Author	SHA1	Message	Date
Jing Xu	69b9f9b1f0	Fix issue in node status updating VolumeAttached list During volume detach, the following might happen in reconciler 1. Pod is deleting 2. remove volume from reportedAsAttached, so node status updater will update volumeAttached list 3. detach failed due to some issue 4. volume is added back in reportedAsAttached 5. reconciler loops again the volume, remove volume from reportedAsAttached 6. detach will not be trigged because exponential back off, detach call will fail with exponential backoff error 7. another pod is added which using the same volume on the same node 8. reconciler loops and it will NOT try to tigger detach anymore At this point, volume is still attached and in actual state, but volumeAttached list in node status does not has this volume anymore, and will block volume mount from kubelet. The fix in first round is to add volume back into the volume list that need to reported as attached at step 6 when detach call failed with error (exponentical backoff). However this might has some performance issue if detach fail for a while. During this time, volume will be keep removing/adding back to node status which will cause a surge of API calls. So we changed to logic to check first whether operation is safe to retry which means no pending operation or it is not in exponentical backoff time period before calling detach. This way we can avoid keep removing/adding volume from node status. Change-Id: I5d4e760c880d72937d34b9d3e904ecad125f802e	2021-10-05 09:44:35 -07:00
Benjamin Elder	56e092e382	hack/update-bazel.sh	2021-02-28 15:17:29 -08:00
Jayasekhar Konduru	9b2b73600d	Recover CSI volumes from dangling attachments Change-Id: I72105d67d8a4069ab19bfa4638a7ac365cf4194c	2020-12-11 18:31:53 -08:00
Cheng Xing	d9a629fe3a	IsVolumeAttachedToNode() renamed to GetAttachState(), and returns 3 states instead of combining "uncertain" and "detached" into "false"	2020-10-29 13:24:51 -07:00
Cheng Xing	a61743b125	Fixes Attach Detach Controller reconciler race reading ActualStateOfWorld and operation pending states; fixes reconciler_test mock detach to account for multiple attaches on a node	2020-10-27 23:51:55 -07:00
Davanum Srinivas	07d88617e5	Run hack/update-vendor.sh Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:33 -04:00
Davanum Srinivas	442a69c3bd	switch over k/k to use klog v2 Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:27 -04:00
taesun_lee	79680b5d9b	Fix pkg/controller typos in some error messages, comments etc - applied review results by LuisSanchez - Co-Authored-By: Luis Sanchez <sanchezl@redhat.com> genernal -> general iniital -> initial initalObjects -> initialObjects intentionaly -> intentionally inforer -> informer anotother -> another triger -> trigger mutli -> multi Verifyies -> Verifies valume -> volume unexpect -> unexpected unfulfiled -> unfulfilled implenets -> implements assignement -> assignment expectataions -> expectations nexpected -> unexpected boundSatsified -> boundSatisfied externel -> external calcuates -> calculates workes -> workers unitialized -> uninitialized afater -> after Espected -> Expected nodeMontiorGracePeriod -> NodeMonitorGracePeriod estimateGrracefulTermination -> estimateGracefulTermination secondrary -> secondary ShouldRunDaemonPodOnUnscheduableNode -> ShouldRunDaemonPodOnUnschedulableNode rrror -> error expectatitons -> expectations foud -> found epackage -> package succesfulJobs -> successfulJobs namesapce -> namespace ConfigMapResynce -> ConfigMapResync	2020-02-27 00:15:33 +09:00
sunxiaofei03	45d41ed9e5	replace iteration with hashmap in *state_of_world	2019-08-29 19:22:25 +08:00
David Xia	fabfd950b1	cleanup: fix some log and error capitalizations Part of https://github.com/kubernetes/kubernetes/issues/15863	2019-07-20 18:26:16 -04:00
Kubernetes Prow Robot	5777fdfe31	Merge pull request #78105 from cwdsuzhou/narrow_down_lock Narrow down the lock	2019-06-14 04:08:23 -07:00
caiweidong	e39ec09975	Add error info for plugin do not support attachment	2019-05-23 18:32:49 +08:00
caiweidong	5a0e7f19b6	Narrow down the lock	2019-05-20 19:12:27 +08:00
Jing Xu	7bac6ca73a	Address comments This commit addressed the comment and also add a unit test.	2019-01-11 10:57:37 -08:00
Jing Xu	562d0fea53	Handle failed attach operation leave uncertain volume attach state This commit adds the unit tests for the PR. It also includes some files that are affected by the function name changes.	2018-11-19 17:21:49 -08:00
Jing Xu	47331cf0a2	WIP: Handle failed attach operation leave uncertain volume attach state This PR fixes issue #32727. When an attach operation fails, it is still possible that the volume will be attached to the node later. This PR adds the logic to record the volume to node with attached state no matter whether the operation succedded or not. If the operation fails, mark the attached state to false. If the operation succeeded, mark the attached state to true. The reconciler will still issue attach operation until it returns successfully. If the pod is removed in the mean time, the reconciler will issue detach operations for all the volumes no matter what is the attached state.	2018-11-19 17:19:10 -08:00
Davanum Srinivas	954996e231	Move from glog to klog - Move from the old github.com/golang/glog to k8s.io/klog - klog as explicit InitFlags() so we add them as necessary - we update the other repositories that we vendor that made a similar change from glog to klog * github.com/kubernetes/repo-infra * k8s.io/gengo/ * k8s.io/kube-openapi/ * github.com/google/cadvisor - Entirely remove all references to glog - Fix some tests by explicit InitFlags in their init() methods Change-Id: I92db545ff36fcec83afe98f550c9e630098b3135	2018-11-10 07:50:31 -05:00
houjun	9a84e413fc	Fix missing 'break'	2018-10-23 17:14:05 +08:00
Jeff Grafton	23ceebac22	Run hack/update-bazel.sh	2018-06-22 16:22:57 -07:00
wackxu	b3ba80b223	update bazel	2018-02-27 20:23:36 +08:00
wackxu	f737ad62ed	update import	2018-02-27 20:23:35 +08:00
Lei Wang	2e0abfa29f	Fix grammar and log issue in volume cache code	2018-02-23 17:46:53 +08:00
Jeff Grafton	ef56a8d6bb	Autogenerated: hack/update-bazel.sh	2018-02-16 13:43:01 -08:00
Di Xu	48388fec7e	fix all the typos across the project	2018-02-11 11:04:14 +08:00
Kubernetes Submit Queue	7de1a8e0f5	Merge pull request #56288 from jsafrane/multiattach-pods Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add list of pods that use a volume to multiattach events So users knows what pods are blocking a volume and can realize their error. Release note: ```release-note NONE ``` UX: * User can get one of following events, depending what other pod(s) are already using a volume and in which namespace they are: ``` Multi-Attach error for volume"volume-name" Volume is already exclusively attached to one node and can't be attached to another Multi-Attach error for volume "volume-name" Volume is already used by pod(s) pod3 and 1 pod(s) in different namespaces ``` * controller-manager gets always full logs: * When the node where is the volume attached is known: ``` Multi-Attach error for volume "volume-name" (UniqueName: "fake-plugin/volume-name") from node "node1" Volume is already used by pods ns2/pod2, ns1/pod3 on node node2, node3 ``` * When the node where is the volume attached is not known: ``` Multi-Attach error for volume "volume-name" (UniqueName: "fake-plugin/volume-name") from node "node1" Volume is already exclusively attached to node node2 and can't be attached to another ``` /kind bug /sig storage /assign @gnufied	2018-01-25 05:31:34 -08:00
Jan Safranek	e46c886bf3	Add list of pods that use a volume to multiattach events So users knows what pods are blocking a volume and can realize their error.	2018-01-24 13:22:03 +01:00
Kubernetes Submit Queue	6f3e1dabe4	Merge pull request #57501 from linyouchong/linyouchong-20171221 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. fix incorrect comment	2018-01-09 11:34:27 -08:00
Jeff Grafton	efee0704c6	Autogenerate BUILD files	2017-12-23 13:12:11 -08:00
linyouchong	4acc23b409	fix incorrect comment	2017-12-21 23:36:20 +08:00
guangxuli	cb73ab2b07	The printing level for node updated failed info should be used WARNING type just use Warning instead of Warningf	2017-11-01 16:54:49 +08:00
Hemant Kumar	e3f0c8bb2d	Fixes spam from node status updates The same error is logged in 2 places which is unncessary.	2017-10-19 09:37:07 -04:00
Kubernetes Submit Queue	1d8f1e268f	Merge pull request #47699 from supereagle/fix-typos Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. fix typos: remove duplicated word in comments What this PR does / why we need it: Remove the duplicated word `the` in comments Which issue this PR fixes : fixes # Special notes for your reviewer: ```release-note NONE ```	2017-10-17 02:35:52 -07:00
Jeff Grafton	aee5f457db	update BUILD files	2017-10-15 18:18:13 -07:00
Hemant Kumar	414c3104ca	Make sure we use rwlocks not just RLock	2017-10-10 17:52:55 -04:00
Hemant Kumar	67d4c40849	Fix spam of multiattach errors in event logs We should be careful while generating multiattach errors. We seem to be generating too many of them because old code had minor bug.	2017-10-03 15:45:06 -04:00
supereagle	87c29a08e1	fix typos: remove duplicated word in comments	2017-09-16 14:38:10 +08:00
Hemant Kumar	8edae9b3fc	Always populate volume status from node	2017-09-12 09:03:42 -04:00
Cheng Xing	1234d2f500	On AttachDetachController node status update, do not retry when node doesn't exist but keep the node entry in cache	2017-08-16 15:42:15 -07:00
Jeff Grafton	a7f49c906d	Use buildozer to delete licenses() rules except under third_party/	2017-08-11 09:32:39 -07:00
Jeff Grafton	33276f06be	Use buildozer to remove deprecated automanaged tags	2017-08-11 09:31:50 -07:00
Chao Xu	60604f8818	run hack/update-all	2017-06-22 11:31:03 -07:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
Kubernetes Submit Queue	b795ec7de0	Merge pull request #42251 from justinsb/simplify_append Automatic merge from submit-queue (batch tested with PRs 42252, 42251, 42249, 47512, 47887) volumes: simplify append-to-slice code Minor simplification - can append to empty/nil slice. Part of #40583 ```release-note NONE ```	2017-06-21 22:13:27 -07:00
Kubernetes Submit Queue	2df2247a82	Merge pull request #42250 from justinsb/volumes_getnodeandvolume_comment Automatic merge from submit-queue volumes: add comment on getNodeAndVolume Add comments on getNodeAndVolume to explain the code - it is a little subtle, and it confused me on first reading. Part of #40583 ```release-note NONE ```	2017-06-20 15:07:47 -07:00
Kubernetes Submit Queue	c34b359bd7	Merge pull request #45923 from verult/cxing/NodeStatusUpdaterFix Automatic merge from submit-queue (batch tested with PRs 46383, 45645, 45923, 44884, 46294) Node status updater now deletes the node entry in attach updates... … when node is missing in NodeInformer cache. - Added RemoveNodeFromAttachUpdates as part of node status updater operations. What this PR does / why we need it: Fixes issue of unnecessary node status updates when node is deleted. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes #42438 Special notes for your reviewer: Unit tested added, but a more comprehensive test involving the attach detach controller requires certain testing functionality that is currently absent, and will require larger effort. Will be added at a later time. There is an edge case caused by the following steps: 1) A node is deleted and restarted. The node exists, but is not yet recognized by Kubernetes. 2) A pod requiring a volume attach with nodeName specifically set to this node. This would make the pod stuck in ContainerCreating state. This is low-pri since it's a specific edge case that can be avoided. Release note: ```release-note NONE ```	2017-05-26 12:58:02 -07:00
Cheng Xing	f9dc2d5ca3	Node status updater now deletes the node entry in attach updates when node is missing in NodeInformer cache. Fixes #42438 . - Added RemoveNodeFromAttachUpdates as part of node status updater operations.	2017-05-24 18:31:47 -07:00
NickrenREN	add091b1fb	fix regression in UX experience for double attach volume send event when volume is not allowed to multi-attach	2017-05-25 09:27:24 +08:00
Alexander Block	06baeb33b2	Don't try to attach volumes which are already attached to other nodes	2017-05-18 06:56:30 +02:00
Hemant Kumar	951a36aac7	Add Keepterminatedpodvolumes as a annotation on node and lets make sure that controller respects it and doesn't detaches mounted volumes.	2017-05-11 22:31:14 -04:00
NickrenREN	0861688237	add and clear err message in RemoveVolumeFromReportAsAttached	2017-05-08 09:37:21 +08:00

1 2

71 Commits