Commit Graph

22546 Commits

Author SHA1 Message Date
Jonathan Dobson
5d1725f17b CSI Inline Volumes: promote API tests to conformance 2022-10-26 07:41:01 -06:00
Artur Żyliński
9f31669a53 New histogram: Pod start SLI duration 2022-10-26 11:28:17 +02:00
Kubernetes Prow Robot
bb12faf15b Merge pull request #113141 from dobsonj/csi-inline-pod-test-flake
test: remove flaky pod update test in CSIInlineVolumes e2e
2022-10-25 16:26:45 -07:00
Han Kang
fa2eb8e3ea make changes suggested by sig docs
Change-Id: I793eea636e7a83da7f87b23a3aa4e7ddfaf42ec6
2022-10-25 17:08:00 -04:00
Han Kang
1e99f54bb6 adjust sizes and documentation
Change-Id: Icd89c0c5bd6fbfb616255132db602b4db5ee2fe9
2022-10-25 16:32:55 -04:00
Han Kang
eb7253dfd5 ignore golint error
Change-Id: I696ab9ae8cb47407208e5a848b5ab852f1319037
2022-10-25 16:11:19 -04:00
Han Kang
8dffb57188 add documentation about auto-generating auto-documentation
Change-Id: I6bfa6a7e8d83d211f4e52865e033b9911f607c58
2022-10-25 15:09:54 -04:00
Han Kang
325f09045f add automated documentation generation from our static analysis pipeline
Change-Id: I66a0b4fd836ad3b51873517f16eb5a73b05bf329
2022-10-25 14:44:09 -04:00
Jonathan Dobson
e377d071bb test: remove flaky pod update test in CSIInlineVolumes e2e 2022-10-25 11:55:59 -06:00
Kubernetes Prow Robot
74e33de9aa Merge pull request #113297 from claudiubelu/test-images/fixes-failing-images
test images: Fixes httpd-new image
2022-10-24 14:40:45 -07:00
Shihang Zhang
569cd70a52 track legacy service account tokens 2022-10-24 09:37:53 -07:00
Kubernetes Prow Robot
7ad4b04632 Merge pull request #113303 from claudiubelu/update-windowsservercore-cache
test images: Revert windows-servercore-cache version
2022-10-24 08:28:25 -07:00
Kubernetes Prow Robot
9b9a963a8d Merge pull request #113238 from pacoxu/storage-quantity-warning
pvc storage request warning for fractional byte value
2022-10-24 06:30:37 -07:00
Claudiu Belu
ae3885b889 test images: Revert windows-servercore-cache version
The Windows Server Core images are quite large (~2GB each), and pulling
it for multiple build jobs / E2E images is inefficient, especially if
have to build for multiple OS versions.

The windows-servercore-cache image is meant to simply cache the Windows files
we need from the Windows Server core images, so we can pull the small cache image
instead of the entire image. It is never meant to be a promotable image,
the version is not meant to be bumped.

The other images (e.g.: agnhost) rely on the version 1.0 images.
2022-10-23 13:50:42 -07:00
Brian McQueen
08c22d6d9a bumped version of tf-wide-deep image to 1.3 in test manifest, and removed the data download from the tf-wide-deep pod spec command 2022-10-23 10:13:13 -07:00
Claudiu Belu
fb7aaab3b9 test images: Fixes httpd-new image
httpd-2.4.48-win64-VC15.zip no longer exists. Use httpd-2.4.54-win64-VC15.zip
instead.
2022-10-23 10:07:17 -07:00
Claudiu Belu
fab91f2de2 test images: Retrigger failing images
Building multiarch images may flake if multiple images are being built
on the same host. Some of the E2E test image failed to build because of
this.
2022-10-23 09:49:57 -07:00
Paco Xu
140502af8c add warning for PVC template in statefulset and in pod ephemeral volume source 2022-10-22 09:14:03 +08:00
Mark Rossetti
7a50b719cf Adding e2e test for running Windows hostprocess containers as members of a local usergroup
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2022-10-21 17:17:44 -07:00
David Porter
efc9e73266 test: Fix e2e_node restart_test flake
In the `should correctly account for terminated pods after restart`, the
test first creates a set of `restartNever` pods, followed by a set of
`restartAlways` pods. Both the `restartNever` and `restartAlways` pods
request an entire CPU. As a result, the `restartAlways` pods will not be
admitted, if the `restartNever` pods did not terminate yet.

Depending on the timing/how fast the pods terminate, the test can pass
sometimes fail which results in flakes. To de-flake the test, the test
should wait until the `restartNever` pods enter a terminal `Succeeded`
phase, before creating the `restartAlways` pods.

To do this, generalize the function `waitForPods` to accept a pod
condition (`testutils.PodRunningReadyOrSucceeded`, or
`testutils.PodSucceeded`). Also introduce a new "Succeeded" pod
condition, so the test can explicitly wait until the pods enter the
Succeeded phase.

Signed-off-by: David Porter <david@porter.me>
2022-10-21 17:14:56 -07:00
Kubernetes Prow Robot
76fe4039e5 Merge pull request #113149 from marosset/hpc-inclusterconfig-e2e
Add e2e test to validate that hostProcess containers can use inclusterclient
2022-10-21 16:21:56 -07:00
Kubernetes Prow Robot
19aaafbe10 Merge pull request #113219 from marosset/fix-hpc-e2e-metrics
HostProcess containers metrics e2e test fixes
2022-10-21 15:05:56 -07:00
Mark Rossetti
8d915ea979 Add e2e test to validate that hostProcess containers can use inclusterclient
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2022-10-21 14:15:31 -07:00
Mark Rossetti
da0b66b3fe Add e2e to validate stats for Windows hostprocess containers
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2022-10-21 10:07:38 -07:00
Kubernetes Prow Robot
4d50d28777 Merge pull request #113193 from kerthcet/cleanup/remove-unnecessary-tests
Remove unuseful multi scheduler testcases
2022-10-21 08:32:03 -07:00
Kubernetes Prow Robot
18b81513b6 Merge pull request #112025 from kerthcet/refactor/handle-scheduling-failure
Refactor schedulingCycle and bindingCycle in scheduler
2022-10-21 08:31:51 -07:00
kerthcet
f7f857814f Refactor schedulingCycle and bindingCycle in scheduler
Signed-off-by: kerthcet <kerthcet@gmail.com>
2022-10-21 13:53:18 +08:00
Kubernetes Prow Robot
2313e2b825 Merge pull request #113176 from alculquicondor/finalizer_metric
Add metric for terminated pods with tracking finalizer
2022-10-20 20:52:01 -07:00
Kubernetes Prow Robot
f3ae27f5ef Merge pull request #113213 from pohly/e2e-instrumentation-dead-code-removal
test/e2e/instrumentation/logging/utils: remove dead package
2022-10-20 18:25:14 -07:00
Shingo Omura
d59af27d69 change the username in the image that is aligned with the group name 2022-10-21 10:18:30 +09:00
David Porter
048ed7ddc0 test: Stop kubelet systemd service after node e2e
Currently, when running node e2e it's not possible to use the ginkgo `--repeat`
flag to run the test suite multiple times. This is useful when debugging tests
and ensuring they are not flaky by re-running them several times. Currently if
using `--repeat` ginkgo flag, the 2nd run of the test will fail due to kubelet
not starting with message like:

```
Failed to start transient service unit: Unit kubelet-20221020T040841.service already exists.
```

This is because during the test startup, kubelet is started as a transient unit
file via `systemd-run`. The unit is started with the `--remain-after-exit` flag
to ensure that the unit will remain even if the kubelet is restarted. The test
suite currently uses `systemd kill` command to stop kubelet. This works fine for
stopping the kubelet, but on the second run, when `systemd-run` is used to start
systemd unit again it will fail because the unit already exists. This is because
`systemd kill` will not delete the systemd unit, only send SIGTERM signal to it.

To fix this, add `unitName` as a field to the `server` struct. When
kubelet server is constructed, set the unit name. As part of e2e test
termination, in `E2EServices.Stop()``, stop the kubelet systemd unit. By
stopping the kubelet systemd unit, systemd will delete the systemd
transient unit, allowing it to be created and started again in a
subsequent e2e run.

Signed-off-by: David Porter <david@porter.me>
2022-10-20 13:31:23 -07:00
Kubernetes Prow Robot
fc866d87dc Merge pull request #113208 from soltysh/azure_timeouts
Set default test timeouts first, only then modify the required ones
2022-10-20 12:07:20 -07:00
Kubernetes Prow Robot
fb9b466f30 Merge pull request #113159 from pacoxu/patch-3
flake: wait for crd ready in kubectl apply script
2022-10-20 12:06:56 -07:00
Kubernetes Prow Robot
dbb3b4f340 Merge pull request #113212 from pohly/e2e-progress-on-stdout-removal
e2e: remove JSON progress output on stdout
2022-10-20 10:29:12 -07:00
Kubernetes Prow Robot
0a08757181 Merge pull request #113128 from pohly/e2e-storage-cleanup
e2e storage: cleanup with context
2022-10-20 10:29:00 -07:00
Aldo Culquicondor
12d308f5c4 Add metric for terminated pods with tracking finalizer
Change-Id: I26f3169588c30ed82250cb7baff8e277f8d13bb7
2022-10-20 11:35:20 -04:00
Patrick Ohly
3e560e8341 e2e: remove JSON progress output on stdout
The original intention was to address "frustration of end users running the e2e
suite is that they take a significant amount of time and it is difficult to
gauge progress".

But Ginkgo's output is different now than it was in Kubernetes 1.19. If users
want to see progress, then "ginkgo --progress" might provide enough
information.

Printing to os.Stdout doesn't work as intended anyway when output redirection
is enabled (the default for parallel runs) and causes these JSON snippets to
appear as "show stdout" for each failed test in a Prow job, which is
distracting.
2022-10-20 16:33:10 +02:00
Maciej Szulik
24d30bf8b2 Set default test timeouts first, only then modify the required ones 2022-10-20 15:35:17 +02:00
Patrick Ohly
f15d7f6cca e2e storage: eliminate context.TODO and cleanup callbacks
Tests should accept a context from Ginkgo and pass it through to all functions
which may block for a longer period of time. In particular all Kubernetes API
calls through client-go should use that context. Then if a timeout occurs,
the test returns immediately because everything that it could block on will
return.

Cleanup code then needs to run in a separate Ginkgo node, typically
DeferCleanup, which ensures that it gets a separate context which has not timed
out yet.
2022-10-20 13:56:59 +02:00
Patrick Ohly
d8d3dc9476 e2e storage: eliminate explicit test driver cleanup
Cleanup can be handled inside the setup code with ginkgo.DeferCleanup. This
simplifies the code.
2022-10-20 13:56:59 +02:00
Patrick Ohly
f314c6b831 test/e2e/instrumentation/logging/utils: remove dead package
The code is not imported anywhere in k/k and therefore should
be removed.
2022-10-20 10:59:31 +02:00
Paco Xu
05de50e1a5 kubectl-alpha-events: e2e ignore some timeout errors(flake) 2022-10-20 16:41:29 +08:00
Michal Wozniak
b1e575aaf7 Self-nominate mimowo as a reviewer for pkg/controller/job & test/integration/job
I think I'm ready to start review and LGTM code changes within this
package, but not necessarily for the entire sig-apps.

My PRs to the packages:
https://github.com/kubernetes/kubernetes/pull/110292
https://github.com/kubernetes/kubernetes/pull/111113
https://github.com/kubernetes/kubernetes/pull/112948

PRs to the packages I contributed reviews to:
https://github.com/kubernetes/kubernetes/pull/113166
https://github.com/kubernetes/kubernetes/pull/110294
2022-10-20 09:22:35 +02:00
kerthcet
26ad401b7a Remove unuseful multi scheduler testcases
Signed-off-by: kerthcet <kerthcet@gmail.com>
2022-10-20 15:15:49 +08:00
Paco Xu
da070707de flake: wait for crd ready in kubectl apply script 2022-10-20 12:58:04 +08:00
Shingo Omura
5beae0c4dc test/images/agnhost: Adds a test user and group in the image for a E2E test case of SupplementalGroups with pre-defined group in the image 2022-10-20 11:17:59 +09:00
Kubernetes Prow Robot
ad26b315f2 Merge pull request #86139 from jasimmons/pr_normalize_probes_lifecycle_handlers
Normalize HTTP  lifecycle handlers with HTTP probers
2022-10-19 17:44:56 -07:00
Mark Rossetti
fa2603906b HostProcess containers metrics e2e test fixes
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2022-10-19 16:10:09 -07:00
Kubernetes Prow Robot
05b46d7ce2 Merge pull request #113166 from alculquicondor/simply_job_metrics_test
Simplify tests for job metrics by resetting them
2022-10-19 13:57:32 -07:00
Kubernetes Prow Robot
45636684a4 Merge pull request #112897 from fromanirh/podresources-metrics-e2e-tests
register podresources metrics
2022-10-19 13:57:18 -07:00