Commit Graph

1490 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
a331cf74b7 Merge pull request #102050 from pohly/fix/deflake-metrics-proxy
remove metrics proxy
2021-07-02 12:30:10 -07:00
ravisantoshgudimetla
67bc23411b [storage] [test] Ensure proper resource creation
Ensure resources are created in zone with schedulable
nodes. For example, if we have 4 zones with 3 zones
having worker nodes and 1 zone having master nodes(unscheduable
for workloads), we should not create resources like PV, PVC or
pods in that zone.
2021-06-30 18:01:57 -04:00
Kubernetes Prow Robot
7ad7c0757a Merge pull request #103160 from ravisantoshgudimetla/fix-ubernetes-tests
Run ubernetes tests on gke only
2021-06-29 18:29:14 -07:00
ravisantoshgudimetla
c65b80a637 [storage] [test] Remove extra zone test
We're running ubernetes tests
`should only be allowed to provision PDs in zones
where nodes exist`
on gcp&gke. While the test is useful in exercising
the scenario of identifying extra zone and
creating a node in it, not every Kube
distribution uses the same approach to create a node,
further if even there is an extra zone, we cannot
guarantee the zone to have enough quota. There can also
be other GCP specific edge cases all of which cannot be
covered within this test. So, removing the test
as agreed upon with the storage team
2021-06-29 12:52:58 -04:00
Kubernetes Prow Robot
756203fda0 Merge pull request #102576 from dobsonj/101911
kubelet: do not call RemoveAll on volumes directory for orphaned pods
2021-06-29 06:54:40 -07:00
Kubernetes Prow Robot
d47448cc85 Merge pull request #103114 from learner0810/fix/remove-errors-pkg
update to remove github.com/pkg/errors
2021-06-25 12:13:11 -07:00
Kubernetes Prow Robot
7a9c8940fd Merge pull request #102682 from chendave/e2e_storage
Check empty zone after the validation of providers
2021-06-24 18:56:44 -07:00
Kubernetes Prow Robot
164ce31e7f Merge pull request #103074 from wongma7/volumelimitsmigration
Disable deprecated volume limits test when CSI migration enabled and replace with CSI-specific test
2021-06-24 05:41:53 -07:00
lzj
7cf30d5690 fix/remove-errors-pkg 2021-06-24 16:09:59 +08:00
Matthew Wong
cd318aec0b Disable deprecated volume limits test when migration enabled and replace with CSI-specific test 2021-06-22 11:31:00 -07:00
Mauricio Poppe
7aa8a497df replaced usage of powershell commands with linux commands 2021-06-22 18:27:55 +00:00
Mauricio Poppe
f0d7e9c9d1 undo windows compatibility changes because the linux commands work 2021-06-22 18:27:54 +00:00
Mauricio Poppe
9a146bc3e9 Remove [LinuxOnly] string from ginkgo tests 2021-06-22 18:27:54 +00:00
Mauricio Poppe
54070a6628 Enable another test in windows, use a /bin/sh command instead of a PS one 2021-06-22 18:27:53 +00:00
Mauricio Poppe
ee34cd640a enable gcpdcsi multivolume tests with windows nodes 2021-06-22 18:27:50 +00:00
Kubernetes Prow Robot
7f4abd897e Merge pull request #102414 from divyenpatel/use-ga-topology-labels-for-vsphere
Update vSphere volume topology label to GA
2021-06-21 18:13:57 -07:00
Divyen Patel
518844fd25 use GA topology labels for vsphere 2021-06-21 10:37:31 -07:00
Kubernetes Prow Robot
513ae557a3 Merge pull request #103002 from TeweiLuo/remove-unmanaged-zone-tests
Remove gcepd unmanaged zone test.
2021-06-20 23:44:51 -07:00
tewei.luo
72a53288a5 Remove gcepd unmanaged zone test.
This test verifies an implementation detail in the in-tree gcepd
plugin. The behavior is not implementated in the gcepd CSI driver
and therefore the test will be obsolete after CSI migration.
2021-06-19 04:13:11 +00:00
Kubernetes Prow Robot
33c4383985 Merge pull request #102657 from pohly/subpath-test-flake
storage e2e: extend timeouts for subpath restart tests
2021-06-18 12:12:05 -07:00
Kubernetes Prow Robot
4afb72a863 Merge pull request #100183 from jsafrane/fix-unstage-retry
Mark volume as uncertain after Unmount* fails
2021-06-18 11:04:06 -07:00
Kubernetes Prow Robot
cda98f20b2 Merge pull request #102918 from jsafrane/cloning-topology
Run volume cloning tests in the same topology segment
2021-06-16 12:48:32 -07:00
Jan Safranek
54ad7e40f1 Add e2e test for NodeUnstage error cases 2021-06-16 18:39:03 +02:00
Jan Safranek
057422504a Run volume cloning tests in the same topology segment
Some CSI drivers can't clone a volume into other topology segment (e.g. a
cloud availability zone). The scheduler does not know about these
restrictions and schedules pods with PVCs that clone a volume mostly
randomly.

Run all volume cloning tests in the same topology segment, if such segment
is available and has at least one schedulable node.
2021-06-16 15:41:33 +02:00
Patrick Ohly
5e9076da93 e2e: grab controller and scheduler metrics via port forwarding
The previous approach with grabbing via a nginx proxy had some
drawbacks:
- it did not work when the pods only listened on localhost (as
  configured by kubeadm) and the proxy got deployed on a different
  node
- starting the proxy raced with starting the pods, causing
  sporadic test failures because the proxy was not set up
  properly unless it saw all pods when starting the e2e.test
- the proxy was always started, whether it is needed or not
- the proxy was left running after a test and then the next
  test run triggered potentially confusing messages when
  it failed to create objects for the proxy

The new approach is similar to "kubectl port-forward" + "kubectl get
--raw". It uses the port forwarding feature to establish a TCP
connection via a custom dialer, then lets client-go handle TLS and
credentials.

Somehow verifying the server certificate did not work. As this
shouldn't be a big concern for E2E testing, certificate checking gets
disabled on the client side instead of investigating this further.
2021-06-16 12:02:40 +02:00
Jan Safranek
3e0269ce6e Move common code to ensureTopologyRequirements
Every call to ensureTopologyRequirements has the same setup + checks.
Therefore move this common code into the call to reduce redundancy.
2021-06-16 11:24:29 +02:00
Kubernetes Prow Robot
6298e1fe6a Merge pull request #102895 from jingxu97/june/skipmatrics
Skip migrated metrics for Windows CSI tests
2021-06-15 19:50:11 -07:00
Jing Xu
8fb909291f Skip migrated metrics for Windows CSI tests
The Windows CSI migration test are failing due to issue https://github.com/kubernetes/kubernetes/issues/102893

Change-Id: I8c4c1b376a915f1a0982a9be5e35960886cc77bf
2021-06-15 15:21:35 -07:00
Jordan Liggitt
ff2c614245 Switch to non-deprecated timestamppb.Now() 2021-06-15 09:53:06 -04:00
Jordan Liggitt
1134456c89 Fix CSI mock driver to get marshaleable grpc error 2021-06-15 09:53:06 -04:00
Jan Safranek
29aa4c0ee8 Check content of volumes in snapshot/clone multivolume tests
Check that both original and its restored snapshot / clone have the same
content during the test.
2021-06-10 14:22:00 +02:00
Jan Safranek
68f139548b Remove unused parameters from TestConcurrentAccessToRelatedVolumes 2021-06-10 10:41:47 +02:00
Kubernetes Prow Robot
a3abd06ad5 Merge pull request #102690 from tsmetana/fix-e2e-csi-mock-pv-leak
CSI e2e: stop leaking pvs in CSI mock snapshot test
2021-06-09 03:05:47 -07:00
Jonathan Dobson
484eb01822 kubelet: do not call RemoveAll on volumes directory for orphaned pods 2021-06-08 13:57:35 -06:00
Patrick Ohly
528baa09f6 e2e storage: disable health-monitor controller in hostpath deployment
This reverts commit
c15fd76ee9. Most (all?) of the hostpath
tests and several other tests started to fail again in
gce-scale-master-correctness after re-enabling the controller. This
shows that it was not just the obsolete agent which causes scalability
problems, but also the controller.

It has to be disabled until the scalability problems are addressed.
2021-06-08 20:27:05 +02:00
Kubernetes Prow Robot
cc7721362c Merge pull request #102665 from gnufied/add-online-expansion-cap
Add explicit capability for online volume expansion
2021-06-08 08:33:36 -07:00
Tomas Smetana
3968ee532d CSI e2e: stop leaking pvs in CSI mock snapshot test 2021-06-08 14:14:44 +02:00
Dave Chen
8d0936b743 Check empty zone after the validation of providers
This make sure the testcase that cannot run locally will be skipped
instead of throwing the misleading failure message.

Signed-off-by: Dave Chen <dave.chen@arm.com>
2021-06-08 13:51:21 +08:00
Kubernetes Prow Robot
588d984407 Merge pull request #102538 from jsafrane/test-xfs
Add e2e test for a volume + its clone used on the same node
2021-06-07 17:00:03 -07:00
Hemant Kumar
95c8b02096 Add explicit capability for online volume expansion 2021-06-07 13:43:18 -04:00
Jan Safranek
28511e82ad Add e2e test for a volume + its clone used on the same node
CSI driver need to pass special mount opts to XFS filesystem to be able to
mount a volume + its clone or its restored snapshot on the same node. Add a
test to exhibit this behavior.

The test is optional for now, giving CSI drivers time to fix it.
2021-06-07 18:29:47 +02:00
Patrick Ohly
2067b69b92 storage e2e: extend timeouts for subpath restart tests
These tests have been flaky for a long time, with a relatively low
rate of flakes. Nonetheless it seems better to extend the timeouts to
reduce the flakiness.
2021-06-07 11:39:17 +02:00
Patrick Ohly
c15fd76ee9 e2e storage: enable health-check controller in hostpath deployment
It was disabled together with the agent to avoid test failures in
gce-master-scale-correctness (https://github.com/kubernetes/kubernetes/issues/102452). That
solved the problem, but we still need to check whether the controller
alone works.
2021-06-05 18:16:19 +02:00
Patrick Ohly
c26c423b1c storage e2e: disable health check containers
They are not needed for any of the tests and may be causing too much
overhead (see
https://github.com/kubernetes/kubernetes/issues/102452#issuecomment-854452816).

We already disabled them earlier and then re-enabled them again
because it wasn't clear how much overhead they were causing. A recent
change in how the sidecars get
deployed (https://github.com/kubernetes/kubernetes/pull/102282) seems
to have made the situation worse again. There's no logical explanation
for that yet, though.

(cherry picked from commit 0c2cee5676e64976f9e767f40c4c4750a8eeb11f)
2021-06-04 09:57:02 +02:00
Patrick Ohly
53af002742 storage e2e: capture driver pod changes, including all events
As seen in https://github.com/kubernetes/kubernetes/issues/102452, we
currently don't have pod events for the CSI driver pods because of the
different namespace and would need them to determine whether the
driver gets evicted.

Previously, only changes of the pods where logged. Perhaps even more
interesting are events in the namespace.
2021-06-02 17:37:03 +02:00
Kubernetes Prow Robot
c4df930252 Merge pull request #102292 from tsmetana/fix-local-pv-e2e-pv-leak
Local PV e2e: fix leaked local volumes
2021-05-31 08:28:26 -07:00
Kubernetes Prow Robot
31f6cca256 Merge pull request #102282 from pohly/csi-driver-update
storage e2e: use csi-driver-host-path v1.7.2 in single pod
2021-05-26 13:46:27 -07:00
Kubernetes Prow Robot
3e9c865d9d Merge pull request #100531 from sjenning/storage-e2e-reduce-pod-count
e2e: reduce pods used in shared local pv test
2021-05-26 12:07:37 -07:00
Patrick Ohly
4acb6a865c storage e2e: use csi-driver-host-path v1.7.2 in single pod
The new default deployment in that release puts sidecars into the same
pod as the driver. This is expected to reduce load during testing.
2021-05-26 09:07:46 +02:00
Tomas Smetana
2c79f52de9 Local PV e2e: fix leaked local volumes 2021-05-25 16:24:30 +02:00