kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	66daef4aa7	Merge pull request #108167 from jfremy/fix-107973 Fix nodes volumesAttached status not being updated	2022-03-01 12:49:54 -08:00
Kubernetes Prow Robot	06e107081e	Merge pull request #104732 from mengjiao-liu/remove-flag-experimental-check-node-capabilities-before-mount kubelet: Remove the deprecated flag `--experimental-check-node-capabilities-before-mount`	2022-02-24 07:56:30 -08:00
Jean-Francois Remy	e83184568d	Add unit tests - actual_state_of_world_test.go: test the new method GetVolumesToReportAttachedForNode for an existing node and a non-existing node - node_status_updater_test.go: test UpdateNodeStatuses and UpdateNodeStatuses in nominal case with 2 nodes getting one volume each. Test UpdateNodeStatuses with the first call to node.patch failing but the following one succeeding - add comment in node_status_updater.go - fix log line in reconciler.go - rename variable in actual_state_of_world.go	2022-02-22 12:21:58 -08:00
Jean-Francois Remy	f1717baaaa	Fix nodes volumesAttached status not updated The UpdateNodeStatuses code stops too early in case there is an error when calling updateNodeStatus. It will return immediately which means any remaining node won't have its update status put back to true. Looking at the call sites for UpdateNodeStatuses, it appears this is not the only issue. If the lister call fails with anything but a Not Found error, it's silently ignored which is wrong in the detach path. Also the reconciler detach path calls UpdateNodeStatuses but the real intent is to only update the node currently processed in the loop and not proceed with the detach call if there is an error updating that specifi node volumesAttached property. With the current implementation, it will not proceed if there is an error updating another node (which is not completely bad but not ideal) and worse it will proceed if there is a lister error on that node which means the node volumesAttached property won't have been updated. To fix those issues, introduce the following changes: - [node_status_updater] introduce UpdateNodeStatusForNode which does what UpdateNodeStatuses does but only for the provided node - [node_status_updater] if the node lister call fails for anything but a Not Found error, we will return an error, not ignore it - [node_status_updater] if the update of a node volumesAttached properties fails we continue processing the other nodes - [actual_state_of_world] introduce GetVolumesToReportAttachedForNode which does what GetVolumesToReportAttached but for the node whose name is provided it returns a bool which indicates if the node in question needs an update as well as the volumesAttached list. It is used by UpdateNodeStatusForNode - [actual_state_of_world] use write lock in updateNodeStatusUpdateNeeded, we're modifying the map content - [reconciler] use UpdateNodeStatusForNode in the detach loop	2022-02-22 12:20:53 -08:00
yujunwang	8f96600907	perf:logic-optimiz-for-DetermineVolumeAction	2022-01-22 23:45:29 +08:00
Kubernetes Prow Robot	184daed0db	Merge pull request #107559 from liggitt/invalid-selectors Handle invalid selectors properly	2022-01-19 14:49:31 -08:00
Jordan Liggitt	c0af728f43	Handle invalid selectors properly	2022-01-14 12:11:02 -05:00
Wojciech Tyczyński	551790729f	Remove selflink references in different testing-related files	2022-01-14 12:58:05 +01:00
Patrick Ohly	9eaa2dc554	avoid klog Info calls without verbosity In the following code pattern, the log message will get logged with v=0 in JSON output although conceptually it has a higher verbosity: if klog.V(5).Enabled() { klog.Info("hello world") } Having the actual verbosity in the JSON output is relevant, for example for filtering out only the important info messages. The solution is to use klog.V(5).Info or something similar. Whether the outer if is necessary at all depends on how complex the parameters are. The return value of klog.V can be captured in a variable and be used multiple times to avoid the overhead for that function call and to avoid repeating the verbosity level.	2022-01-12 07:48:36 +01:00
Mengjiao Liu	beda4cafb6	kubelet: Remove the deprecated flag `--experimental-check-node-capabilities-before-mount`	2022-01-06 11:47:11 +08:00
Kubernetes Prow Robot	abfe397ceb	Merge pull request #107166 from jsafrane/fix-pv-controller-tests Fix PV controller unit test 5-7	2022-01-04 23:03:27 -08:00
Jan Safranek	0f9832d095	Fix PV name in unit test Test 5-5 should use PV with "5-5"i in the name. It makes log analysys much easier.	2022-01-03 15:20:12 +01:00
Kubernetes Prow Robot	f0dbc32ed9	Merge pull request #106853 from gnufied/disable-exp-backoff-volume-not-inuse When volume is not marked in-use, do not backoff	2021-12-22 19:46:37 -08:00
Jan Safranek	045ca75c03	Deflake PV metrics unit test Test 5-7 tries to delete a PVC at the very same time when it detects that the PV controller started processing the PVC. The controller then sometimes can't update the PVC and generate an event for it that the test expects. From PV controller logs (not shown in CI): > I1221 14:36:34.548160 104481 pv_controller.go:815] updating PersistentVolumeClaim[default/claim5-7] status: set phase Lost failed: cannot update claim claim5-7: claim not found Typical error in CI: > FAIL: TestControllerSync (83.22s) > framework_test.go:202: Event "Warning ClaimLost" not emitted Therefore wait for the PVC to be fully processed before deleting the PVC to avoid races.	2021-12-21 15:36:44 +01:00
Jan Safranek	e323306ce0	Add new watchers to PV controller tests Add fake Pod and Node watchers to the tests. It only reduces test noise: Failed to watch *v1.Pod: unhandled watch: testing.WatchActionImpl{ActionImpl:testing.ActionImpl{Namespace:"", Verb:"watch", Resource:schema.GroupVersionResource{Group:"", Version:"v1", Resource:"pods"}, Subresource:""}, WatchRestrictions:testing.WatchRestrictions{Labels:labels.internalSelector(nil), Fields:fields.andTerm{}, ResourceVersion:""}}	2021-12-21 15:36:34 +01:00
Hemant Kumar	7989f27044	use node informer to check volumes attachment status before backoff fix unit tests	2021-12-20 11:57:05 -05:00
Davanum Srinivas	9405e9b55e	Check in OWNERS modified by update-yamlfmt.sh Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2021-12-09 21:31:26 -05:00
Kubernetes Prow Robot	12901b95c9	Merge pull request #106344 from ikeeip/fix_import_formatting Fix golang imports in k8s.io/pkg/controller/volume/persistentvolume package	2021-12-07 17:26:40 -08:00
Kubernetes Prow Robot	6805e6ee41	Merge pull request #104722 from leiyiz/migration turning on the CSIMigrationGCE feature flag	2021-11-16 15:28:32 -08:00
Kubernetes Prow Robot	f151a40d8d	Merge pull request #106154 from gnufied/recover-expansion-failure-123 Recover expansion failure	2021-11-16 13:21:34 -08:00
Léiyì Zhang	275fdf0884	fixing unit test failures induced by turning on CSIMigrationGCE disable CSIMigrationGCE in some unit tests	2021-11-16 19:26:30 +00:00
Hemant Kumar	1ddd598d31	Implement controller and kubelet changes for recovery from resize failures	2021-11-16 11:06:46 -05:00
Kubernetes Prow Robot	ce98eda406	Merge pull request #106376 from jsafrane/stabilize-unit-test Fix deletion protection unit test	2021-11-15 13:04:48 -08:00
Neha Lohia	fa1b6765d5	move pkg/util/node to component-helpers/node/util (#105347 ) Signed-off-by: Neha Lohia <nehapithadiya444@gmail.com>	2021-11-12 07:52:27 -08:00
Jan Safranek	bb8157d780	Fix deletion protection unit test The test should not depend on current set of default feature gates, it should always ensure the ones necessary for the tests are set.	2021-11-12 10:47:15 +01:00
Konstantin Misyutin	7434fdf1d4	fix import formatting in k8s.io/pkg/controller/volume/persistentvolume package Signed-off-by: Konstantin Misyutin <konstantin.misyutin@huawei.com>	2021-11-11 16:31:38 +08:00
Deepak Kinni	bfd5f23a0b	PV controller changes to support PV Deletion protection finalizer Signed-off-by: Deepak Kinni <dkinni@vmware.com>	2021-11-08 10:35:58 -08:00
Konstantin Misyutin	808c8f42d5	Remove StorageObjectInUseProtection feature gate logic This feature has graduated to GA in v1.11 and will always be enabled. So no longe need to check if enabled. Signed-off-by: Konstantin Misyutin <konstantin.misyutin@huawei.com>	2021-11-03 00:13:50 +03:00
Mike Dame	4960d0976a	Wire contexts to Core controllers	2021-11-01 10:29:00 -04:00
Kubernetes Prow Robot	c592bd40f2	Merge pull request #105609 from pohly/generic-ephemeral-volume-ga generic ephemeral volume GA	2021-10-28 17:36:50 -07:00
Konstantin Misyutin	dbc9d7b71a	Remove tests when StorageObjectInUseProtection feature is disabled As well as feature gate are locked, the tests when this feature is disabled will crash. So we should remove them together with locking the feature. Signed-off-by: Konstantin Misyutin <konstantin.misyutin@huawei.com>	2021-10-15 19:39:37 +08:00
Kubernetes Prow Robot	baaa53db64	Merge pull request #105211 from xiaopingrubyist/fix-pv-controller-claim-cache-issue fix:claim cached in pvcontroller is not the newest may cause unexpected issue	2021-10-14 05:47:18 -07:00
torubylist	f28a8d7f2b	fix:cached claim is not the newest will cause unexpected issue	2021-10-13 20:03:00 +08:00
Patrick Ohly	a8c930ef46	generic ephemeral volume: graduation to GA The feature gate gets locked to "true", with the goal to remove it in two releases. All code now can assume that the feature is enabled. Tests for "feature disabled" are no longer needed and get removed. Some code wasn't using the new helper functions yet. That gets changed while touching those lines.	2021-10-11 20:54:20 +02:00
Kubernetes Prow Robot	b0eac84937	Merge pull request #105345 from pohly/generic-ephemeral-volume-util generic ephemeral volume util, base code and controller	2021-10-07 08:19:47 -07:00
Patrick Ohly	4ae0eecb34	controller: use generic ephemeral volume helper functions The name concatenation and ownership check were originally considered small enough to not warrant dedicated functions, but the intent of the code is more readable with them. There also was a missing owner check in the attach controller.	2021-10-06 14:01:44 +02:00
Kubernetes Prow Robot	debd6c1e9e	Merge pull request #104526 from jingxu97/aug/volumeattach Fix issue in node status updating VolumeAttached list	2021-10-05 17:30:32 -07:00
Jing Xu	69b9f9b1f0	Fix issue in node status updating VolumeAttached list During volume detach, the following might happen in reconciler 1. Pod is deleting 2. remove volume from reportedAsAttached, so node status updater will update volumeAttached list 3. detach failed due to some issue 4. volume is added back in reportedAsAttached 5. reconciler loops again the volume, remove volume from reportedAsAttached 6. detach will not be trigged because exponential back off, detach call will fail with exponential backoff error 7. another pod is added which using the same volume on the same node 8. reconciler loops and it will NOT try to tigger detach anymore At this point, volume is still attached and in actual state, but volumeAttached list in node status does not has this volume anymore, and will block volume mount from kubelet. The fix in first round is to add volume back into the volume list that need to reported as attached at step 6 when detach call failed with error (exponentical backoff). However this might has some performance issue if detach fail for a while. During this time, volume will be keep removing/adding back to node status which will cause a surge of API calls. So we changed to logic to check first whether operation is safe to retry which means no pending operation or it is not in exponentical backoff time period before calling detach. This way we can avoid keep removing/adding volume from node status. Change-Id: I5d4e760c880d72937d34b9d3e904ecad125f802e	2021-10-05 09:44:35 -07:00
Kubernetes Prow Robot	b6924839ca	Merge pull request #101987 from sky-philipalmeida/patch-1 Log if PV is still in use trying to delete it	2021-09-23 14:30:54 -07:00
Phil	f1a9402082	Log if PV is still in use trying to delete it Similar to what we have in: https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/volume/pvcprotection/pvc_protection_controller.go#L181 The objective is to have a easy way to monitor if a PV will enter in Terminating state due to a failed removal when still in use. This way we can capture the PV log and alert according. The code is not tested. Update pv_protection_controller.go Change call to Infof	2021-09-21 18:05:16 +01:00
Shivanshu Raj Shrivastava	bbd809cbd0	Fixing incorrectly migrated structured logs (#105122 ) * added keys for structured logging * used KObj	2021-09-19 12:28:08 -07:00
Kubernetes Prow Robot	bcd2ffbdc1	Merge pull request #104590 from Jiawei0227/anno Add GA AnnStorageProvisioner annotation to PVC	2021-09-03 06:09:49 -07:00
Kubernetes Prow Robot	fca3175df7	Merge pull request #104231 from astraw99/fix_unified_workers Unify controller worker num param `threadiness` to `workers`	2021-08-27 09:34:05 -07:00
Jiawei Wang	8de0f11946	Add GA AnnStorageProvisioner annotation to PVC This PR adds GA AnnStorageProvisioner annotation to a PVC if the PVC requires dynamic provisioning. This also deprecates the beta AnnStorageProvisioner annotation and it will be removed in a later release.	2021-08-26 12:46:47 -07:00
Stephen Augustus	481cf6fbe7	generated: Run hack/update-gofmt.sh Signed-off-by: Stephen Augustus <foo@auggie.dev>	2021-08-24 15:47:49 -04:00
Konstantin Misyutin	29bd66d018	Remove "pkg/controller/volume/scheduling" dependency from "pkg/scheduler/framework/plugins" All dependencies of VolumeBinding plugin from "k8s.io/kubernetes/pkg/controller/volume/scheduling" package moved to "k8s.io/kubernetes/pkg/scheduler/framework/plugins/volumebinding" package: - whole file pkg/controller/volume/scheduling/scheduler_assume_cache.go - whole file pkg/controller/volume/scheduling/scheduler_assume_cache_test.go - whole file pkg/controller/volume/scheduling/scheduler_binder.go - whole file pkg/controller/volume/scheduling/scheduler_binder_fake.go - whole file pkg/controller/volume/scheduling/scheduler_binder_test.go Package "k8s.io/kubernetes/pkg/controller/volume/scheduling/metrics" moved to "k8s.io/kubernetes/pkg/scheduler/framework/plugins/volumebinding/metrics" because it only used in VolumeBinding plugin and (e2e) tests. More described in issue #89930 and PR #102953. Signed-off-by: Konstantin Misyutin <konstantin.misyutin@huawei.com>	2021-08-13 19:08:45 +08:00
astraw99	e6df935fd3	unify worker num to workers	2021-08-09 15:46:04 +08:00
SataQiu	7fa0b9b6c1	add --concurrent-ephemeralvolume-syncs flag for kube-controller-manager	2021-07-25 21:36:57 +08:00
Cheng Xing	0e315355df	Pass FsGroup to MountDevice	2021-07-03 16:29:42 -07:00
Chris Henzie	83e3ee780a	Rename access mode contains helper method So it is consistent with other methods performing the same check (one for internal and external types)	2021-06-28 21:24:56 -07:00
Jordan Liggitt	ca279bbcc1	Fix race in attachdetach tests	2021-06-04 01:59:32 -04:00
yuzhiquan	0b8dc56408	fix volume failing test	2021-06-04 09:45:21 +08:00
Tim Ebert	cd3709232f	Fix VolumeAttachment garbage collection for migrated PVs	2021-05-28 08:35:05 +02:00
Kubernetes Prow Robot	894803ab2e	Merge pull request #98199 from yangjunmyfm192085/run-test3 fix mistake about [avaliable] for index_test.go	2021-05-25 02:46:22 -07:00
Kubernetes Prow Robot	838a967be5	Merge pull request #101175 from lojies/cleanupforpvcontroller code cleanup:remove redundant return statement in pv_controller.go	2021-05-24 21:48:49 -07:00
Jiawei Wang	be583070d2	Use CSI driver to determine unique name for migrated in-tree plugins	2021-05-06 10:31:30 -07:00
Kubernetes Prow Robot	fe88bdc1ab	Merge pull request #101304 from wangyx1992/capatial-log-controller cleanup: fix errors in wrapped format and log capitalization in controller	2021-04-22 15:55:52 -07:00
wangyx1992	fd51e654af	cleanup: fix errors in wrapped format and log capitalization in controller Signed-off-by: wangyx1992 <wang.yixiang@zte.com.cn>	2021-04-22 15:40:54 +08:00
andyzhangx	e10d3948f5	fix: azure file namespace issue in csi translation fix build failure fix comments	2021-04-20 07:23:09 +00:00
Kubernetes Prow Robot	df9ad4d7d2	Merge pull request #96094 from Hellcatlk/m Some comments' typos	2021-04-16 11:54:22 -07:00
卢振兴10069964	8009823867	code cleanup:remove redundant return statement in pv_controller.go	2021-04-16 09:02:21 +08:00
Kubernetes Prow Robot	410d092d8a	Merge pull request #99643 from pohly/generic-ephemeral-volume-beta generic ephemeral volume beta	2021-03-09 17:39:26 -08:00
Kubernetes Prow Robot	5155865ae2	Merge pull request #99326 from sunpa93/fs_resize_fix fix: use pv annotation to trigger filesystem resize when necessary	2021-03-09 11:05:18 -08:00
Kubernetes Prow Robot	dc74b9d0c7	Merge pull request #98753 from Jiawei0227/length Relax csiNodeIDMaxLength to longer limit	2021-03-09 09:19:00 -08:00
Kubernetes Prow Robot	a56fa34d6b	Merge pull request #99942 from jsafrane/refactor-migration-featuregates Refactor CSI migration plugin manager to get featureGates as a parameter	2021-03-09 04:27:46 -08:00
Sung Jun Park	5f69cf74d8	fix: when newly binding pvc to a pv, adjust pvc.status.capacity to pv's annotation that denotes the pre-resize capacity of the original pvc that pv was bound to if it has one test: confirm that pvc's status capacity is adjusted if pv has a pre-resize capacity annotation	2021-03-09 07:55:10 +00:00
Patrick Ohly	555d4a12bf	generic ephemeral volumes: drop ReadOnly field As discussed during the alpha review, the ReadOnly field is not really needed because volume mounts can also be read-only. It's a historical oddity that can be avoided for generic ephemeral volumes as part of the promotion to beta.	2021-03-09 08:22:48 +01:00
Jiawei Wang	1e16615fb0	Relax csiNodeIDMaxLength to longer limit Update csiNodeIDMaxLength to 192 bytes	2021-03-08 13:52:43 -08:00
Patrick Ohly	3fa43540b6	CSIStorageCapacity: check MaximumVolumeSize during scheduling If available, then the MaximumVolumeSize is a better indicator whether creating a volume has a chance to succeed than the total (?) Capacity, which is potentially larger and less well-defined.	2021-03-08 20:52:51 +01:00
Patrick Ohly	5ca0814165	CSIStorageCapacity: use beta API	2021-03-08 20:52:50 +01:00
Jan Safranek	219cbc818a	Refactor CSI migration plugin manager to get featureGates as a parameter This allows caller to provide fake ones for testing of various corner cases (migration on A/D controller disabled while enabled on kubelet).	2021-03-08 13:50:01 +01:00
Yecheng Fu	d791f7feef	Prioritizing nodes based on volume capacity: unit tests	2021-03-05 23:59:25 +08:00
Yecheng Fu	21a43586e7	Prioritizing nodes based on volume capacity	2021-03-05 23:59:25 +08:00
Kubernetes Prow Robot	7afa538f18	Merge pull request #99626 from pohly/generic-ephemeral-volume-protection-controller-cleanup PVC protection controller: clarify pod shutdown	2021-03-04 11:00:58 -08:00
Kubernetes Prow Robot	a238698ea0	Merge pull request #99446 from pohly/generic-ephemeral-enablement Generic ephemeral volume enablement	2021-03-04 11:00:30 -08:00
Kubernetes Prow Robot	180d9cfa8b	Merge pull request #99632 from pohly/storage-capacity-enablement volume binder: storage capacity enablement	2021-03-04 02:08:20 -08:00
Patrick Ohly	512401a8a2	scheduler: tests for generic ephemeral volumes This covers some failure scenarios and feature gate enablement.	2021-03-03 10:13:05 +01:00
Patrick Ohly	d2cc70ee2c	scheduler: fail when a pod uses disabled generic ephemeral volumes Without this error, kube-scheduler was simply ignoring the special volume source and scheduled the pod. This was unlikely to work in practice because the volume might have needed binding or the feature is also disabled on kubelet which then doesn't know what to do with the volume.	2021-03-03 10:13:05 +01:00
Patrick Ohly	98f75290ba	generic ephemeral volume: simpler metrics A CounterVector with status as label may create unnecessary overhead and using the success case with the empty label value wasn't easy. It's better to have two seperate counters, one for total number of calls and one for failed calls.	2021-03-02 12:01:37 +01:00
Patrick Ohly	6cb28fd1b4	generic ephemeral volume: add metrics As discussed during the production readiness review, a metric for the PVC create operations is useful. The "ephemeral_volume" workqueue metrics were already added in the initial implementation. The new code follows the example set by the endpoints controller.	2021-03-02 12:01:37 +01:00
Patrick Ohly	e98c40a6f9	volume binder: test different CSIStorageCapacity/CSIDriver combinations When the feature is disabled either in the scheduler or the CSIDriver, the scheduler is expected to schedule pods without considering whether storage capacity is available.	2021-03-02 11:08:57 +01:00
Patrick Ohly	1f3ede50f7	PVC protection controller: clarify pod shutdown The code was correct and now the comment references the code in kubelet to illustrate how pod shutdown works.	2021-03-02 08:31:12 +01:00
Kubernetes Prow Robot	1b88c2ee47	Merge pull request #98912 from wzshiming/ut/speed-up-volume-scheduling Speed up pkg/controller/volume/scheduling unit tests	2021-03-01 13:58:16 -08:00
Kubernetes Prow Robot	5498ee641b	Merge pull request #99561 from BenTheElder/remove-bazel Remove Bazel	2021-03-01 09:55:27 -08:00
Kubernetes Prow Robot	f6152d1521	Merge pull request #97086 from xing-yang/check_datasource Only CSI plugin can have a DataSource	2021-03-01 06:53:26 -08:00
wzshiming	67e4ba0797	Speed up pkg/controller/volume/scheduling unit tests	2021-03-01 11:53:45 +08:00
Benjamin Elder	56e092e382	hack/update-bazel.sh	2021-02-28 15:17:29 -08:00
Kubernetes Prow Robot	c200a8f9b7	Merge pull request #98433 from damemi/remove-helper-from-volume-zone Move GetPersistentVolumeClaimClass to component-helpers	2021-02-26 12:38:15 -08:00
xing-yang	676a3a7012	Only CSI plugin can have a DataSource	2021-02-25 15:27:26 +00:00
Kubernetes Prow Robot	8feec9bf94	Merge pull request #99351 from CaoDonghui123/fixissues3 Remove deadcode	2021-02-24 15:29:34 -08:00
caodonghui	f435e24403	Remove deadcode	2021-02-23 17:58:47 +08:00
Marek Siarkowicz	7b1f3584f5	Fix usage of klog.InfoS	2021-02-20 19:22:16 +01:00
Jiawei Wang	3d61b56bcd	update bazel	2021-02-12 17:50:40 -08:00
Jiawei Wang	6a7222cf4e	Add migrated field to storage_operation_duration_seconds metric	2021-02-12 17:35:01 -08:00
Kubernetes Prow Robot	18605d8814	Merge pull request #98792 from wzshiming/ut/speed-up-persistentvolume Speed up pkg/controller/volume/persistentvolume unit tests	2021-02-10 01:06:59 -08:00
wzshiming	fb518af0fc	Speed up pkg/controller/volume/persistentvolume unit tests	2021-02-05 15:09:36 +08:00
wangkai1994	7edf9e0155	change to kref and kobj	2021-02-03 17:45:38 +08:00
wangkai1994	ab11816570	migrate pkg/controller/volume/pvcprotection.go to structured logs	2021-02-02 17:42:20 +08:00
Mike Dame	ba72411aa2	Move GetPersistentVolumeClaimClass to component-helpers The goal of this move is related to issue 89930, to break the dependence of scheduling plugins on internal helpers. This function can easily move to component-helpers where it will be used by other components as well.	2021-02-01 10:48:38 -05:00
JunYang	f241036d2b	fix mistake about [avaliable] for index_test.go Signed-off-by: JunYang <yang.jun22@zte.com.cn>	2021-01-20 08:37:50 +08:00

1 2 3 4 5 ...

909 Commits