kubernetes

Author	SHA1	Message	Date
Jing Xu	9588d2098a	Redesign and implement volume reconstruction work This PR is the first part of redesign of volume reconstruction work. The changes include 1. Remove dependency on volume spec stored in actual state for volume cleanup process (UnmountVolume and UnmountDevice) Modify AttachedVolume struct to add DeviceMountPath so that volume unmount operation can use this information instead of constructing from volume spec 2. Modify reconciler's volume reconstruction process (syncState). Currently workflow is when kubelet restarts, syncState() is only called once before reconciler starts its loop. a. If volume plugin supports reconstruction, it will use the reconstructed volume spec information to update actual state as before. b. If volume plugin cannot support reconstruction, it will use the scanned mount path information to clean up the mounts. In this PR, all the plugins still support reconstruction (except glusterfs), so reconstruction of some plugins will still have issues. The next PR will modify those plugins that cannot support reconstruction well. This PR addresses issue #52683, #54108 (This PR includes the changes to update devicePath after local attach finishes)	2018-02-05 13:14:09 -08:00
Hemant Kumar	afeb53e5ee	Perform resize of mounted volume if necessary Add e2e test for mounted volume resize	2018-01-29 17:49:50 -05:00
Hemant Kumar	1fa8cbc5e4	Improve messaging on resize - we now provide clear message to user what to do when cloudprovider resizing is finished and file system resizing is needed. - add a event when resizing is successful. - Use Patch for updating PVCs in both kubelet and controller-manager - Extract updating pvc util function in one place. - Only update resize conditions on progress	2018-01-29 15:07:51 -05:00
Kubernetes Submit Queue	6dfc4b38fb	Merge pull request #57702 from mlmhl/volume_resize_event Automatic merge from submit-queue (batch tested with PRs 57702, 57128). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. format error message and remove duplicated event for resize volume failure What this PR does / why we need it: 1. The `operationGenerator.resizeFileSystem` method returns errors generated by `volumeToMount.GenerateErrorDetailed`, and the outside code(`operationGenerator.GenerateMountVolumeFunc`) uses `volumeToMount.GenerateError` to generate a new error again, which lead to the event message redundant and confused, we should use `volumeToMount.GenerateError` inside `operationGenerator.resizeFileSystem` only, in outside code is not necessary. 2. The `eventRecorderFunc` will record an event if `resizeFileSystem` returns an error, so we needn't to record event inside `resizeFileSystem` itself. Release note: ```release-note NONE ``` /sig storage /kind enhancement	2018-01-03 08:30:30 -08:00
NickrenREN	74b197e7fe	fix expand panic	2018-01-03 10:31:57 +08:00
mlmhl	2bf6b54f05	format error message and remove duplicated event for resize volume failure	2018-01-03 10:28:54 +08:00
Jeff Grafton	efee0704c6	Autogenerate BUILD files	2017-12-23 13:12:11 -08:00
Tim Hockin	e9dd8a68f6	Revert k8s.gcr.io vanity domain This reverts commit `eba5b6092a`. Fixes https://github.com/kubernetes/kubernetes/issues/57526	2017-12-22 14:36:16 -08:00
Kubernetes Submit Queue	5b55f614d0	Merge pull request #57260 from davidz627/attachMountLogFix Automatic merge from submit-queue (batch tested with PRs 55475, 57155, 57260, 57222). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Improved mount/attach error logging and added attach event. Fixed kubelet error message to be more descriptive. Added Attach success event for help in debugging. The attach event is helpful when the node may not have the correct information about attachment status, it allows the user to see whether the Attach was run at all. If there is no success/failure attach message we can infer that there was no attach started at all. Fixes #57217	2017-12-18 19:45:43 -08:00
Tim Hockin	eba5b6092a	Use k8s.gcr.io vanity domain for container images	2017-12-18 09:18:34 -08:00
David Zhu	fffd152e0d	Fixed kubelet error message to be more descriptive. Added Attach success event for help in debugging.	2017-12-15 15:36:59 -08:00
David Zhu	e3f8f64c17	refactored mount, attach, resize operation's so that all failures generate events and event generation is more consistent. refactored operation generator and operation executor to use more general generated functions for operations, completions, and events.	2017-12-14 11:09:12 -08:00
Hemant Kumar	c82d412993	Do not resize file system on a read-only mount	2017-11-29 11:56:30 -05:00
Hemant Kumar	c0353ca20c	Remove conditions from PVC after successful resize	2017-11-29 10:10:32 -05:00
Hemant Kumar	7be94c4b06	Implement resizing support for GCE Fix GCE attacher test Update bazel files	2017-11-22 16:24:58 -05:00
Hemant Kumar	2f2a643684	Implement file system resizing support on kubelet start Update bazel files Fix operation executor tests	2017-11-22 16:06:10 -05:00
mtanino	8903e8cd85	BlockVolumesSupport: CRI, VolumeManager and OperationExecutor changes This patch contains following changes. - container runtime changes for adding block devices - volumemanager changes - operationexecutor changes	2017-11-20 14:10:26 -05:00
Hemant Kumar	5297c146c1	Fix dangling attach errors Detach volumes from shutdown nodes and ensure that dangling volumes are handled correctly in AWS	2017-11-16 08:43:36 -05:00
Kevin	4c8539cece	use core client with explicit version globally	2017-10-27 15:48:32 +08:00
Jeff Grafton	aee5f457db	update BUILD files	2017-10-15 18:18:13 -07:00
Hemant Kumar	0ad846cb18	Add documentation comments for volume expand controller These comments help clear out some of the design choices made in code.	2017-09-26 22:25:23 -04:00
Hemant Kumar	cd2a68473a	Implement controller for resizing volumes	2017-09-04 09:02:34 +02:00
Kubernetes Submit Queue	2a2f499455	Merge pull request #50036 from wongma7/metrics Automatic merge from submit-queue Add volume operation metrics to operation executor and PV controller This PR implements the proposal for high level volume metrics https://github.com/kubernetes/community/pull/809 Special notes for your reviewer: ~Differences from proposal:~ all resolved ~"verify_volume" is now "verify_volumes_are_attached" + "verify_volumes_are_attached_per_node" + "verify_controller_attached_volume." Which of them do we want?~ ~There is no "mount_device" metric because the MountVolume operation combines MountDevice and mount (plugin.Setup). Do we want to extract the mount_device metric or is it okay to keep mountvolume as one? For attachable volumes, MountDevice is the actual mount and Setup is a bindmount + setvolumeownership. For unattachable, mountDevice does not occur and Setup is an actual mount + setvolumeownership.~ ~PV controller metrics I did not implement following the proposal at all. I did not change goroutinemap nor scheduleOperation. Because provisionClaimOperation does not return an error, so it's impossible for the caller to know if there is actually a failure worth reporting. So I manually create a new metric inside the function according to some conditions.~ @gnufied I have tested the operationexecutor metrics but not provision & delete. Sample: ![screen shot 2017-08-02 at 15 01 08](https://user-images.githubusercontent.com/13111288/28889980-a7093526-7793-11e7-9aa9-ad7158be76fa.png) Release note: ```release-note Add error count and time-taken metrics for storage operations such as mount and attach, per-volume-plugin. ```	2017-08-28 04:20:49 -07:00
mtanino	5ff9dc0b3b	WaitForAttach refactoring for iSCSI attacher/detacher This change is prerequisite for implementing iSCSI attacher and detacher. In order to use chap authentication at iSCSI plugin after implementing attacher and detacher, secret is needed at AttachDisk() which is called from WaitForAttach(). To obtain secret, pod information is required, but WaitForAttach() doesn't pass pod information inside. This patch adds 'pod' as an argument of WaitForAttach() and adds changes to drivers who implements WaitForAttach(). Fixes #48953	2017-08-26 17:21:34 -04:00
Matthew Wong	3ed34183d0	Add volume operation metrics to operation executor and PV controller	2017-08-23 14:27:47 -04:00
Jeff Grafton	a7f49c906d	Use buildozer to delete licenses() rules except under third_party/	2017-08-11 09:32:39 -07:00
Jeff Grafton	33276f06be	Use buildozer to remove deprecated automanaged tags	2017-08-11 09:31:50 -07:00
Hemant Kumar	0b1d61db00	Fix controller crash because of nil volume spec For volumes that don't support bulk volume verifiction, a nil volume spec can cause crash of controller.	2017-07-21 18:42:11 -04:00
Jacob Simpson	29c1b81d4c	Scripted migration from clientset_generated to client-go.	2017-07-17 15:05:37 -07:00
Chao Xu	60604f8818	run hack/update-all	2017-06-22 11:31:03 -07:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
mbohlool	c91a12d205	Remove all references to types.UnixUserID and types.UnixGroupID	2017-06-21 04:09:07 -07:00
FengyunPan	1f47323187	Waiting attach operation to be finished rather than returning nil	2017-06-06 22:58:44 +08:00
ailusazh	f57224c0d2	Add SuccessfulMountVolume message to the events of pod	2017-06-01 17:56:47 +08:00
Kubernetes Submit Queue	0aad9d30e3	Merge pull request #44897 from msau42/local-storage-plugin Automatic merge from submit-queue (batch tested with PRs 46076, 43879, 44897, 46556, 46654) Local storage plugin What this PR does / why we need it: Volume plugin implementation for local persistent volumes. Scheduler predicate will direct already-bound PVCs to the node that the local PV is at. PVC binding still happens independently. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): Part of #43640 Release note: ``` Alpha feature: Local volume plugin allows local directories to be created and consumed as a Persistent Volume. These volumes have node affinity and pods will only be scheduled to the node that the volume is at. ```	2017-05-30 23:20:02 -07:00
Kubernetes Submit Queue	ef1febf789	Merge pull request #46367 from bobveznat/master Automatic merge from submit-queue (batch tested with PRs 46450, 46272, 46453, 46019, 46367) Move MountVolume.SetUp succeeded to debug level This message is verbose and repeated over and over again in log files creating a lot of noise. Leave the message in, but require a -v in order to actually log it. What this PR does / why we need it: Moves a verbose log message to actually be verbose. Which issue this PR fixes fixes #46364 Fixes #29059	2017-05-26 18:49:04 -07:00
Kubernetes Submit Queue	c34b359bd7	Merge pull request #45923 from verult/cxing/NodeStatusUpdaterFix Automatic merge from submit-queue (batch tested with PRs 46383, 45645, 45923, 44884, 46294) Node status updater now deletes the node entry in attach updates... … when node is missing in NodeInformer cache. - Added RemoveNodeFromAttachUpdates as part of node status updater operations. What this PR does / why we need it: Fixes issue of unnecessary node status updates when node is deleted. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes #42438 Special notes for your reviewer: Unit tested added, but a more comprehensive test involving the attach detach controller requires certain testing functionality that is currently absent, and will require larger effort. Will be added at a later time. There is an edge case caused by the following steps: 1) A node is deleted and restarted. The node exists, but is not yet recognized by Kubernetes. 2) A pod requiring a volume attach with nodeName specifically set to this node. This would make the pod stuck in ContainerCreating state. This is low-pri since it's a specific edge case that can be avoided. Release note: ```release-note NONE ```	2017-05-26 12:58:02 -07:00
Bob Van Zant	aca05c922c	Move MountVolume.SetUp succeeded to debug level This message is verbose and repeated over and over again in log files creating a lot of noise. Leave the messsage in, but require a -v in order to actually log it. Fixes #29059	2017-05-26 10:54:34 -07:00
Cheng Xing	f9dc2d5ca3	Node status updater now deletes the node entry in attach updates when node is missing in NodeInformer cache. Fixes #42438 . - Added RemoveNodeFromAttachUpdates as part of node status updater operations.	2017-05-24 18:31:47 -07:00
NickrenREN	add091b1fb	fix regression in UX experience for double attach volume send event when volume is not allowed to multi-attach	2017-05-25 09:27:24 +08:00
Michelle Au	dd46c7f88e	Local volume plugin	2017-05-22 14:44:51 -07:00
Michelle Au	06f25b03eb	Check volume node affinity before mount	2017-05-22 14:44:06 -07:00
FengyunPan	4a6e1f2a1d	Don't return err when volume's status is 'attaching' When volume's status is 'attaching', its attachments will be None, controllermanager can't get device path and make some failed event. But it is normal, let's fix it.	2017-05-12 19:53:50 +08:00
Kubernetes Submit Queue	5b3d0bbe66	Merge pull request #44714 from jamiehannaford/unix_user_type Automatic merge from submit-queue (batch tested with PRs 44590, 44969, 45325, 45208, 44714) Use dedicated UnixUserID and UnixGroupID types What this PR does / why we need it: DRYs up type definitions by using the dedicated types in apimachinery Which issue this PR fixes #38120 Release note: ```release-note UIDs and GIDs now use apimachinery types ```	2017-05-05 14:08:17 -07:00
Jamie Hannaford	9440a68744	Use dedicated Unix User and Group ID types	2017-05-05 14:07:38 +02:00
Ian Chakeres	bbc8859176	Refactor volume operation log and error messages	2017-05-04 13:29:01 -07:00
Tomas Smetana	852c44ae59	Fix issue #34242 : Attach/detach should recover from a crash When the attach/detach controller crashes and a pod with attached PV is deleted afterwards the controller will never detach the pod's attached volumes. To prevent this the controller should try to recover the state from the nodes status.	2017-04-20 13:04:50 +02:00
Mike Danese	a05c3c0efd	autogenerated	2017-04-14 10:40:57 -07:00
Kubernetes Submit Queue	b625085230	Merge pull request #42325 from tsmetana/remove-unused-method-from-og Automatic merge from submit-queue Remove unused method from operation_generator This is only a removal of the GerifyVolumeIsSafeToDetach [sic] method from operation_executor. The method is not called from anywhere, moreover there is a private method named verifyVolumeIsSafeToDetach (which is being used). This looks like a cut&paste mistake that deserves to be cleaned. ```release-note NONE ```	2017-03-31 10:56:40 -07:00
Kubernetes Submit Queue	803369b9cc	Merge pull request #42006 from screeley44/error-events3 Automatic merge from submit-queue (batch tested with PRs 42522, 42545, 42556, 42006, 42631) Fixes MountVolume.NewMounter errors not displayed to users via describe events Fixes #42004 This fixes the problem of mount errors being eaten and not displayed to users again. Specifically erros caught in MountVolume.NewMounter (like missing endpoints, etc...) Current behavior for any mount failure: ``` Events: FirstSeen LastSeen Count From SubObjectPath Type Reason Message --------- -------- ----- ---- ------------- -------- ------ ------- 12m 12m 1 default-scheduler Normal Scheduled Successfully assigned glusterfs-bb-pod1 to 127.0.0.1 10m 1m 5 kubelet, 127.0.0.1 Warning FailedMount Unable to mount volumes for pod "glusterfs-bb-pod1_default(67c9dfa7-f9f5-11e6-aee2-5254003a59cf)": timeout expired waiting for volumes to attach/mount for pod "default"/"glusterfs-bb-pod1". list of unattached/unmounted volumes=[glusterfsvol] 10m 1m 5 kubelet, 127.0.0.1 Warning FailedSync Error syncing pod, skipping: timeout expired waiting for volumes to attach/mount for pod "default"/"glusterfs-bb-pod1". list of unattached/unmounted volumes=[glusterfsvol] ``` New Behavior: For example on glusterfs - deliberately didn't create endpoints, now correct message is displayed: ``` Events: FirstSeen LastSeen Count From SubObjectPath Type Reason Message --------- -------- ----- ---- ------------- -------- ------ ------- 2m 2m 1 default-scheduler Normal Scheduled Successfully assigned glusterfs-bb-pod1 to 127.0.0.1 54s 54s 1 kubelet, 127.0.0.1 Warning FailedMount Unable to mount volumes for pod "glusterfs-bb-pod1_default(8edd2c25-fa09-11e6-92ae-5254003a59cf)": timeout expired waiting for volumes to attach/mount for pod "default"/"glusterfs-bb-pod1". With error timed out waiting for the condition. list of unattached/unmounted volumes=[glusterfsvol] 54s 54s 1 kubelet, 127.0.0.1 Warning FailedSync Error syncing pod, skipping: timeout expired waiting for volumes to attach/mount for pod "default"/"glusterfs-bb-pod1". With error timed out waiting for the condition. list of unattached/unmounted volumes=[glusterfsvol] 2m 6s 814 kubelet, 127.0.0.1 Warning FailedMount MountVolume.NewMounter failed for volume "kubernetes.io/glusterfs/8edd2c25-fa09-11e6-92ae-5254003a59cf-glusterfsvol" (spec.Name: "glusterfsvol") pod "8edd2c25-fa09-11e6-92ae-5254003a59cf" (UID: "8edd2c25-fa09-11e6-92ae-5254003a59cf") with: endpoints "glusterfs-cluster" not found ```	2017-03-24 15:10:33 -07:00

... 2 3 4 5 6

255 Commits