kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	31a482a149	Merge pull request #120344 from rohitssingh/disable_force_detach Add a flag to disable force detach behavior in kube-controller-manager	2024-02-22 13:02:38 -08:00
Rohit Singh	13dddca6a2	Add "disable-force-detach-on-timeout" flag to kube-controller-manager	2024-02-22 18:31:52 +00:00
carlory	4a4940694f	remove stale comments	2023-11-09 11:58:50 +08:00
Jan Safranek	7fc11f47ff	Mark a volume as uncertain-attached after detach error Volume that failed Detach() should not be marked as attached, CSI external-attacher is probably still trying to detach it. Mark it uncertain instead and wait for Detach() to succeed.	2023-09-13 10:03:28 +02:00
carlory	f443c458af	move non-graceful node shutdown to GA	2023-07-11 13:51:51 +08:00
xing-yang	cca6601106	Add reason to force detach metric	2023-07-10 06:30:05 +00:00
guangli.bao	931cc96b8d	remote windows condition when #116693 is merged Signed-off-by: guangli.bao <guangli.bao@daocloud.io>	2023-05-24 14:54:22 +08:00
Kubernetes Prow Robot	484645e817	Merge pull request #116659 from claudiubelu/skip-flaky-tests-2 unit tests: Skip flaky tests on Windows (part 2)	2023-05-23 20:04:48 -07:00
SataQiu	3fa55d469c	fix a bug where the AttachedVolume is not printed correctly in the log	2023-05-11 22:04:30 +08:00
Claudiu Belu	0979d55443	unit tests: Skip flaky tests on Windows (part 2) Some of the unit tests are currently flaky on Windows. This commit skips them until they are resolved.	2023-04-13 12:07:18 +00:00
Kubernetes Prow Robot	8cdc7fa542	Merge pull request #116675 from pacoxu/volume-flake deflake: Add retry with timeout to wait for final conditions	2023-04-11 18:19:09 -07:00
Harshal Patil	1972dd1005	Do not log entire pod struct while attaching the volume Signed-off-by: Harshal Patil <harpatil@redhat.com>	2023-04-05 20:24:12 -04:00
Paco Xu	8e36e948ce	verifyVolumeNoStatusUpdateNeeded may cause flake and so only keep the last ones	2023-03-30 10:44:45 +08:00
Paco Xu	c14068c202	deflake: Add retry with timeout to wait for final conditions	2023-03-22 11:24:09 +08:00
杨军10092085	361e4ff0fa	volume: use contextual logging	2023-03-14 08:37:30 +08:00
mowangdk	bf244d3046	Lower volume attached touch log level	2022-11-16 16:49:07 +08:00
Kubernetes Prow Robot	d7bff1c809	Merge pull request #111577 from brianpursley/troubleshoot-unit-test-flake Add logging for reconciler unit test	2022-11-11 00:44:09 -08:00
Kubernetes Prow Robot	98742f9d77	Merge pull request #110747 from harshanarayana/cleanup/GIT-110737/logging-improvements structured-logging: replace KObjs with KObjSlice for logging	2022-11-03 00:49:34 -07:00
Humble Chirammal	4bafd53a02	Correct typos in pkg/controller/volume Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-09-16 16:50:20 +05:30
ZhangKe10140699	186ddce07b	Fix problem in updating VolumeAttached in node status	2022-08-02 19:01:57 +08:00
Brian Pursley	a29fb9abae	Add logging for reconciler unit test	2022-07-30 10:33:27 -04:00
Harsha Narayana	c3cbc443ef	structured-logging: replace KObjs with KObjSlice for logging	2022-07-01 09:52:07 +05:30
Jan Safranek	3b94ac228a	Don't force detach volume from healthy nodes 6 minute force-deatch timeout should be used only for nodes that are not healthy. In case a CSI driver is being upgraded or it's simply slow, NodeUnstage can take more than 6 minutes. In that case, Pod is already deleted from the API server and thus A/D controller will force-detach a mounted volume, possibly corrupting the volume and breaking CSI - a CSI driver expects NodeUnstage to succeed before Kubernetes can call ControllerUnpublish.	2022-06-24 12:51:41 +02:00
Ashutosh Kumar	c00975370a	Handle Non-graceful Node Shutdown (#108486 ) Signed-off-by: Ashutosh Kumar <sonasingh46@gmail.com> Co-authored-by: Ashutosh Kumar <sonasingh46@gmail.com> Co-authored-by: xing-yang <xingyang105@gmail.com>	2022-03-26 09:23:21 -07:00
Kubernetes Prow Robot	66daef4aa7	Merge pull request #108167 from jfremy/fix-107973 Fix nodes volumesAttached status not being updated	2022-03-01 12:49:54 -08:00
Kubernetes Prow Robot	06e107081e	Merge pull request #104732 from mengjiao-liu/remove-flag-experimental-check-node-capabilities-before-mount kubelet: Remove the deprecated flag `--experimental-check-node-capabilities-before-mount`	2022-02-24 07:56:30 -08:00
Jean-Francois Remy	e83184568d	Add unit tests - actual_state_of_world_test.go: test the new method GetVolumesToReportAttachedForNode for an existing node and a non-existing node - node_status_updater_test.go: test UpdateNodeStatuses and UpdateNodeStatuses in nominal case with 2 nodes getting one volume each. Test UpdateNodeStatuses with the first call to node.patch failing but the following one succeeding - add comment in node_status_updater.go - fix log line in reconciler.go - rename variable in actual_state_of_world.go	2022-02-22 12:21:58 -08:00
Jean-Francois Remy	f1717baaaa	Fix nodes volumesAttached status not updated The UpdateNodeStatuses code stops too early in case there is an error when calling updateNodeStatus. It will return immediately which means any remaining node won't have its update status put back to true. Looking at the call sites for UpdateNodeStatuses, it appears this is not the only issue. If the lister call fails with anything but a Not Found error, it's silently ignored which is wrong in the detach path. Also the reconciler detach path calls UpdateNodeStatuses but the real intent is to only update the node currently processed in the loop and not proceed with the detach call if there is an error updating that specifi node volumesAttached property. With the current implementation, it will not proceed if there is an error updating another node (which is not completely bad but not ideal) and worse it will proceed if there is a lister error on that node which means the node volumesAttached property won't have been updated. To fix those issues, introduce the following changes: - [node_status_updater] introduce UpdateNodeStatusForNode which does what UpdateNodeStatuses does but only for the provided node - [node_status_updater] if the node lister call fails for anything but a Not Found error, we will return an error, not ignore it - [node_status_updater] if the update of a node volumesAttached properties fails we continue processing the other nodes - [actual_state_of_world] introduce GetVolumesToReportAttachedForNode which does what GetVolumesToReportAttached but for the node whose name is provided it returns a bool which indicates if the node in question needs an update as well as the volumesAttached list. It is used by UpdateNodeStatusForNode - [actual_state_of_world] use write lock in updateNodeStatusUpdateNeeded, we're modifying the map content - [reconciler] use UpdateNodeStatusForNode in the detach loop	2022-02-22 12:20:53 -08:00
Patrick Ohly	9eaa2dc554	avoid klog Info calls without verbosity In the following code pattern, the log message will get logged with v=0 in JSON output although conceptually it has a higher verbosity: if klog.V(5).Enabled() { klog.Info("hello world") } Having the actual verbosity in the JSON output is relevant, for example for filtering out only the important info messages. The solution is to use klog.V(5).Info or something similar. Whether the outer if is necessary at all depends on how complex the parameters are. The return value of klog.V can be captured in a variable and be used multiple times to avoid the overhead for that function call and to avoid repeating the verbosity level.	2022-01-12 07:48:36 +01:00
Mengjiao Liu	beda4cafb6	kubelet: Remove the deprecated flag `--experimental-check-node-capabilities-before-mount`	2022-01-06 11:47:11 +08:00
Jing Xu	69b9f9b1f0	Fix issue in node status updating VolumeAttached list During volume detach, the following might happen in reconciler 1. Pod is deleting 2. remove volume from reportedAsAttached, so node status updater will update volumeAttached list 3. detach failed due to some issue 4. volume is added back in reportedAsAttached 5. reconciler loops again the volume, remove volume from reportedAsAttached 6. detach will not be trigged because exponential back off, detach call will fail with exponential backoff error 7. another pod is added which using the same volume on the same node 8. reconciler loops and it will NOT try to tigger detach anymore At this point, volume is still attached and in actual state, but volumeAttached list in node status does not has this volume anymore, and will block volume mount from kubelet. The fix in first round is to add volume back into the volume list that need to reported as attached at step 6 when detach call failed with error (exponentical backoff). However this might has some performance issue if detach fail for a while. During this time, volume will be keep removing/adding back to node status which will cause a surge of API calls. So we changed to logic to check first whether operation is safe to retry which means no pending operation or it is not in exponentical backoff time period before calling detach. This way we can avoid keep removing/adding volume from node status. Change-Id: I5d4e760c880d72937d34b9d3e904ecad125f802e	2021-10-05 09:44:35 -07:00
Benjamin Elder	56e092e382	hack/update-bazel.sh	2021-02-28 15:17:29 -08:00
caodonghui	f435e24403	Remove deadcode	2021-02-23 17:58:47 +08:00
Kubernetes Prow Robot	8bf42039e6	Merge pull request #96552 from pandaamanda/klog_fmt use klog.Info and klog.Warning when had no format	2021-01-15 17:57:43 -08:00
xiongzhongliang	90f4aeeea4	use klog.Info and klog.Warning when had no format	2020-11-14 00:55:06 +08:00
Cheng Xing	d9a629fe3a	IsVolumeAttachedToNode() renamed to GetAttachState(), and returns 3 states instead of combining "uncertain" and "detached" into "false"	2020-10-29 13:24:51 -07:00
Cheng Xing	a61743b125	Fixes Attach Detach Controller reconciler race reading ActualStateOfWorld and operation pending states; fixes reconciler_test mock detach to account for multiple attaches on a node	2020-10-27 23:51:55 -07:00
Davanum Srinivas	07d88617e5	Run hack/update-vendor.sh Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:33 -04:00
Davanum Srinivas	442a69c3bd	switch over k/k to use klog v2 Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:27 -04:00
Kubernetes Prow Robot	ef672c1c2d	Merge pull request #88678 from verult/slow-rxm-attach Parallelize attach operations across different nodes for volumes that allow multi-attach	2020-03-06 13:17:21 -08:00
Cheng Xing	ef3d66b98b	Parallelize attach operations across different nodes for volumes that allow multi-attach	2020-03-05 22:22:05 -08:00
taesun_lee	79680b5d9b	Fix pkg/controller typos in some error messages, comments etc - applied review results by LuisSanchez - Co-Authored-By: Luis Sanchez <sanchezl@redhat.com> genernal -> general iniital -> initial initalObjects -> initialObjects intentionaly -> intentionally inforer -> informer anotother -> another triger -> trigger mutli -> multi Verifyies -> Verifies valume -> volume unexpect -> unexpected unfulfiled -> unfulfilled implenets -> implements assignement -> assignment expectataions -> expectations nexpected -> unexpected boundSatsified -> boundSatisfied externel -> external calcuates -> calculates workes -> workers unitialized -> uninitialized afater -> after Espected -> Expected nodeMontiorGracePeriod -> NodeMonitorGracePeriod estimateGrracefulTermination -> estimateGracefulTermination secondrary -> secondary ShouldRunDaemonPodOnUnscheduableNode -> ShouldRunDaemonPodOnUnschedulableNode rrror -> error expectatitons -> expectations foud -> found epackage -> package succesfulJobs -> successfulJobs namesapce -> namespace ConfigMapResynce -> ConfigMapResync	2020-02-27 00:15:33 +09:00
Jordan Liggitt	cd1059e3c4	Revert "Merge pull request #87258 from verult/slow-rxm-attach" This reverts commit `15c3f1b119`, reversing changes made to `52d7614a8c`.	2020-01-29 14:58:32 -05:00
Cheng Xing	c6a03fa5be	Parallelize attach operations across different nodes for volumes that allow multi-attach	2020-01-27 15:02:25 -08:00
yuxiaobo	81e9f21f83	Correct spelling mistakes Signed-off-by: yuxiaobo <yuxiaobogo@163.com>	2019-11-06 20:25:19 +08:00
danielqsj	657a1a1a34	change import alias of utils/strings	2019-01-30 10:44:09 +08:00
danielqsj	093328e57f	migrate to k8s.io/utils/strings	2019-01-30 10:24:00 +08:00
Jing Xu	7bac6ca73a	Address comments This commit addressed the comment and also add a unit test.	2019-01-11 10:57:37 -08:00
Jing Xu	562d0fea53	Handle failed attach operation leave uncertain volume attach state This commit adds the unit tests for the PR. It also includes some files that are affected by the function name changes.	2018-11-19 17:21:49 -08:00
Davanum Srinivas	954996e231	Move from glog to klog - Move from the old github.com/golang/glog to k8s.io/klog - klog as explicit InitFlags() so we add them as necessary - we update the other repositories that we vendor that made a similar change from glog to klog * github.com/kubernetes/repo-infra * k8s.io/gengo/ * k8s.io/kube-openapi/ * github.com/google/cadvisor - Entirely remove all references to glog - Fix some tests by explicit InitFlags in their init() methods Change-Id: I92db545ff36fcec83afe98f550c9e630098b3135	2018-11-10 07:50:31 -05:00

1 2 3

112 Commits