kubernetes

Author	SHA1	Message	Date
huweiwen	c2ccb921ea	e2e pod: fail fast on failed pod no need to wait until timeout. reduce test time	2024-03-04 00:01:02 +08:00
Kubernetes Prow Robot	3516bc6f49	Merge pull request #122456 from AxeZhan/beta3960 [KEP 3960]: graduate PodLifecycleSleepAction to beta	2024-02-19 07:44:50 -08:00
AxeZhan	c74ec3df09	graduate PodLifecycleSleepAction to beta	2024-02-19 19:40:52 +08:00
Patrick Ohly	aa772d77fb	e2e pod: dump pod in unexpected phase When stopping polling, the provided messages becomes the complete failure message. This means that the code which calls gomega.StopTrying must include the pod in the message instead of just summarizing the phase. This makes the failure more useful.	2024-02-14 09:01:32 +01:00
Patrick Ohly	43539c855f	e2e framework: unify logging, support skipping helpers ginkgo.GinkgoHelper is a recent addition to ginkgo which allows functions to mark themselves as helper. This then changes which callstack gets reported for failures. It makes sense to support the same mechanism also for logging. There's also no reason why framework.Logf should produce output that is in a different format than klog log entries. Having time stamps formatted differently makes it hard to read test output which uses a mixture of both. Another user-visible advantage is that the error log entry from framework.ExpectNoError now references the test source code. With textlogger there is a simple replacement for klog that can be reconfigured to let the caller handle stack unwinding. klog itself doesn't support that and should be modified to support it (feature freeze). Emitting printf-style output via that logger would work, but become less readable because the message string would get quoted instead of printing it verbatim as before. So instead, the traditional klog header gets reproduced in the framework code. In this example, the first line is from klog, the second from Logf: I0111 11:00:54.088957 332873 factory.go:193] Registered Plugin "containerd" ... I0111 11:00:54.987534 332873 util.go:506] >>> kubeConfig: /var/run/kubernetes/admin.kubeconfig Indention is a bit different because the initial output is printed before installing the logger which writes through ginkgo.GinkgoWriter. One welcome side effect is that now "go vet" detects mismatched parameters for framework.Logf because fmt.Sprintf is called without mangling the format string. Some of the calls were incorrect.	2024-01-20 18:23:31 +01:00
Patrick Ohly	3aa366a3eb	e2e: remove dead code The dead code was found with: deadcode -test -filter=k8s.io/kubernetes/test/e2e/framework/... ./test/e2e ./test/e2e_node ./test/e2e_node ./test/e2e_kubeadm See https://go.dev/blog/deadcode for an introduction. Only dead code which is clearly not needed anymore (glog logging), questionable (skipping based on feature gates) or redundant (WaitForPodSuccessInNamespaceSlow) gets removed for now. More removals might make sense in the future.	2024-01-17 12:57:35 +01:00
Patrick Ohly	f9ceab37ca	e2e: pass context into pod helper functions This ensures that progress reports and timeouts work.	2023-11-14 15:57:55 +01:00
Patrick Ohly	fc3ee07b51	e2e pods: fix WaitForPodsResponding retry The status error was embedded inside the new error constructed by WaitForPodsResponding's get function, but not wrapped. Therefore `apierrors.IsServiceUnavailable(err)` didn't find it and returned false -> no retries. Wrapping fixes this and Gomega formatting of the error remains useful: err := &errors.StatusError{} err.ErrStatus.Code = 503 err.ErrStatus.Message = "temporary failure" err2 := fmt.Errorf("Controller %s: failed to Get from replica pod %s:\n%w\nPod status:\n%s", "foo", "bar", err, "some status") fmt.Println(format.Object(err2, 1)) fmt.Println(errors.IsServiceUnavailable(err2)) => <fmt.wrapError \| 0xc000139340>: Controller foo: failed to Get from replica pod bar: temporary failure Pod status: some status { msg: "Controller foo: failed to Get from replica pod bar:\ntemporary failure\nPod status:\nsome status", err: <errors.StatusError \| 0xc0001a01e0>{ ErrStatus: { TypeMeta: {Kind: "", APIVersion: ""}, ListMeta: { SelfLink: "", ResourceVersion: "", Continue: "", RemainingItemCount: nil, }, Status: "", Message: "temporary failure", Reason: "", Details: nil, Code: 503, }, }, } true	2023-09-11 11:54:15 +02:00
carlory	f33265cf5d	HandleRetry has already called in the GetObject	2023-09-07 15:48:18 +08:00
Kubernetes Prow Robot	d48fc2ad2d	Merge pull request #119035 from saschagrunert/critical-pod Fix `should be able to create and delete a critical pod` test	2023-07-06 00:51:03 -07:00
Patrick Ohly	c903c29c3b	e2e: support admissionapi.LevelRestricted in test/e2e/framwork/pod CreatePod and MakePod only accepted an `isPrivileged` boolean, which made it impossible to write tests using those helpers which work in a default framework.Framework, because the default there is LevelRestricted. The simple boolean gets replaced with admissionapi.Level. Passing LevelRestricted does the same as calling e2epod.MixinRestrictedPodSecurity. Instead of explicitly passing a constant to these modified helpers, most tests get updated to pass f.NamespacePodSecurityLevel. This has the advantage that if that level gets lowered in the future, tests only need to be updated in one place. In some cases, helpers taking client+namespace+timeouts parameters get replaced with passing the Framework instance to get access to f.NamespacePodSecurityEnforceLevel. These helpers don't need separate parameters because in practice all they ever used where the values from the Framework instance.	2023-07-03 16:26:28 +02:00
Sascha Grunert	bcbc12cd79	Fix `should be able to create and delete a critical pod` test The namespace the crictical pod was referring to was wrong, because it was using the generated one instead of `kube-system`. This and the resulting test condition is now fixed. The test seems to run only in `ci-crio-cgroupv1-node-e2e-flaky` for now. Closes https://github.com/kubernetes/kubernetes/issues/109296 Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2023-07-03 11:15:59 +02:00
SataQiu	4d2ff08bfa	e2e-framework: code cleanup for mismatched comments	2023-05-26 12:36:19 +08:00
carlory	20602c819b	e2e framework: remove dependency on k8s.io/kubernetes/pkg/api/v1/pod	2023-05-12 08:39:37 +08:00
Ed Bartosh	ff9ebfa90d	e2e framework: control k/k/pkg imports Modified import restrictions for the e2e framework submodules to enable control of the k/k/pkg imports.	2023-04-17 00:17:16 +03:00
Ed Bartosh	dc4f6f9da6	e2e framework: remove last dependency to k/k/pkg/util Copied and modified RemoveString function from k/k/pkg/util/slice/slice.go to e2e/framework/pod/pod_client.go This is the last dependency from e2e framework to k/k/pkg/util	2023-04-15 14:28:32 +03:00
Ed Bartosh	40521fe360	e2e framework: remove last dependency to k/k/pkg/kubelet Copied and modified pod format function from k/k/pkg/kubelet/util/format/pod.go to e2e/framework/pod/pod_client.go This is the last dependency from e2e framework to k/k/pkg/kubelet	2023-04-14 22:32:06 +03:00
Michal Wozniak	3d68f362c3	Give terminal phase correctly to all pods that will not be restarted	2023-03-16 21:25:29 +01:00
Patrick Ohly	fe59e091eb	dependencies: ginkgo v2.9.1, gomega v1.27.4 They contain some nice-to-have improvements (for example, better printing of errors with gomega/format.Object) but nothing that is critical right now. "go mod tidy" was run manually in staging/src/k8s.io/kms/internal/plugins/mock (https://github.com/kubernetes/kubernetes/pull/116613 not merged yet).	2023-03-14 22:26:27 +01:00
David Porter	c5a1f0188b	test: Add node e2e test to verify static pod termination Add node e2e test to verify that static pods can be started after a previous static pod with the same config temporarily failed termination. The scenario is: 1. Static pod is started 2. Static pod is deleted 3. Static pod termination fails (internally `syncTerminatedPod` fails) 4. At later time, pod termination should succeed 5. New static pod with the same config is (re)-added 6. New static pod is expected to start successfully To repro this scenario, setup a pod using a NFS mount. The NFS server is stopped which will result in volumes failing to unmount and `syncTerminatedPod` to fail. The NFS server is later started, allowing the volume to unmount successfully. xref: 1. https://github.com/kubernetes/kubernetes/pull/113145#issuecomment-1289587988 2. https://github.com/kubernetes/kubernetes/pull/113065 3. https://github.com/kubernetes/kubernetes/pull/113093 Signed-off-by: David Porter <david@porter.me>	2023-03-03 10:00:48 -06:00
Kubernetes Prow Robot	edea44c82e	Merge pull request #113205 from mimowo/oomkiller-e2e-node-test Add e2e_node test for oom killed container reason	2023-02-21 14:23:55 -08:00
Michal Wozniak	fd28f69ca4	Add e2e_node test for oom killed container reason	2023-02-20 08:15:45 +01:00
Patrick Ohly	3e760310b2	e2e: revise import restrictions - test/e2e/framework/.go should have very minimal dependencies. We can enforce that via import-boss. - What each test/e2e/framework/ sub-package uses is less relevant, although ideally it also should be as minimal as possible in each case. Enforcing this via import-boss ensures that new dependencies get flagged as problem and thus will get additional scrutiny. It might be okay to add them, but it needs to be considered.	2023-02-12 14:56:45 +01:00
Patrick Ohly	136f89dfc5	e2e: use error wrapping with %w The recently introduced failure handling in ExpectNoError depends on error wrapping: if an error prefix gets added with `fmt.Errorf("foo: %v", err)`, then ExpectNoError cannot detect that the root cause is an assertion failure and then will add another useless "unexpected error" prefix and will not dump the additional failure information (currently the backtrace inside the E2E framework). Instead of manually deciding on a case-by-case basis where %w is needed, all error wrapping was updated automatically with sed -i "s/fmt.Errorf$.$: '$%s\\|%v$'\",$. err)$/fmt.Errorf\1: %w\",\3/" $(git grep -l 'fmt.Errorf' test/e2e*) This may be unnecessary in some cases, but it's not wrong.	2023-02-06 15:39:13 +01:00
Patrick Ohly	9878e735dd	e2e pod: unit test for pod status + API error This covers new behavior in gomega.	2023-02-06 15:39:13 +01:00
Patrick Ohly	1bd1167d56	e2e pod: remove dead code	2023-02-06 15:39:13 +01:00
Patrick Ohly	3bb735e6fa	e2e pod: use gomega.Eventually in WaitForRestartablePods	2023-02-06 15:39:13 +01:00
Patrick Ohly	1e346c4e4a	e2e pod: convert ProxyResponseChecker into matcher Instead of pod responses being printed to the log each time polling fails, we get a consolidated failure message with all unexpected pod responses if (and only if) the check times out or a progress report gets produced.	2023-02-06 15:39:13 +01:00
Patrick Ohly	c3266cde77	e2e: consolidate pod response checking This renames PodsResponding to WaitForPodsResponding for the sake of consistency and adds a timeout parameter. That is necessary because some other users of NewProxyResponseChecker used a much lower timeout (2min vs. 15min). Besides simplifying some code, it also makes it easier to rewrite ProxyResponseChecker because it only gets used in WaitForPodsResponding.	2023-02-06 15:39:13 +01:00
Patrick Ohly	89a5d6d8af	e2e pod: use gomega.Eventually in WaitForPodNotFoundInNamespace	2023-02-06 15:39:13 +01:00
Patrick Ohly	9df3e2a47a	e2e: replace WaitForPodToDisappear with WaitForPodNotFoundInNamespace WaitForPodToDisappear was always called such that it listed all pods, which made it less efficient than trying to get just the one pod it was checking for. Being able to customize the poll interval in practice wasn't useful, therefore it can be replaced with WaitForPodNotFoundInNamespace.	2023-02-06 15:39:12 +01:00
Patrick Ohly	45d4631069	e2e: consolidate checking a pod list WaitForPods is now a generic function which lists pods and then checks the pods that it found against some provided condition. A parameter determines how many pods must be found resp. match the condition for the check to succeed.	2023-02-06 15:39:12 +01:00
Patrick Ohly	d8428c6fb1	e2e pod: use gomega.Eventually in WaitTimeoutForPodReadyInNamespace/WaitForPodCondition These get converted together because they relied on FinalErr which now isn't needed anymore.	2023-02-06 15:39:12 +01:00
Patrick Ohly	3dd185aa40	e2e pod: use gomega.Eventually in WaitForPodsRunningReady The code becomes simpler (78 insertions, 91 deletions), easier to read (all code entirely inside WaitForPodsRunningReady, no need to declare and later overwrite variables) and possibly more correct (if all API calls failed, the resulting error was ignored when allowedNotReadyPods > 0).	2023-02-06 15:39:12 +01:00
Patrick Ohly	4d63e7d4d6	e2e: remove unused label filter from WaitForPodsRunningReady None of the users of the functions passed anything other than nil or an empty map and the implementation ignore the parameter - it seems like a candidate for simplification.	2023-02-06 15:39:12 +01:00
Patrick Ohly	8181f97ecc	e2e framework: include additional stack backtrace in failures When a Gomega failure is converted to an error, the stack at the time when the failure occurs may be useful: error wrapping provides some bread crumbs that can be followed to determine where the failure really occurred, but error wrapping may be missing or ambiguous. To provide the additional information, a FailureError now includes a full stack backtrace. The backtrace intentionally makes no attempt to exclude framework functions besides the gomega support itself because helpers like e2e/framework/pod may be relevant. That backtrace is not included in the failure message for the sake of brevity. Instead, it gets logged as part of the test's output.	2023-02-06 15:39:12 +01:00
Patrick Ohly	005a9da0cc	e2e framework: implement pod polling with gomega.Eventually gomega.Eventually provides better progress reports: instead of filling up the log with rather useless one-line messages that are not enough to to understand the current state, it integrates with Gingko's progress reporting (SIGUSR1, --poll-progress-after) and then dumps the same complete failure message as after a timeout. That makes it possible to understand why progress isn't getting made without having to wait for the timeout. The other advantage is that the failure message for some unexpected pod state becomes more readable: instead of encapsulating it as "observed object" inside an error, it directly gets rendered by gomega.	2023-02-06 15:39:12 +01:00
Antonio Ojea	7f5ae1c0c1	Revert "e2e: wait for pods with gomega"	2023-02-06 12:08:22 +01:00
Patrick Ohly	222f655062	e2e: use error wrapping with %w The recently introduced failure handling in ExpectNoError depends on error wrapping: if an error prefix gets added with `fmt.Errorf("foo: %v", err)`, then ExpectNoError cannot detect that the root cause is an assertion failure and then will add another useless "unexpected error" prefix and will not dump the additional failure information (currently the backtrace inside the E2E framework). Instead of manually deciding on a case-by-case basis where %w is needed, all error wrapping was updated automatically with sed -i "s/fmt.Errorf$.$: '$%s\\|%v$'\",$. err)$/fmt.Errorf\1: %w\",\3/" $(git grep -l 'fmt.Errorf' test/e2e*) This may be unnecessary in some cases, but it's not wrong.	2023-01-31 13:01:39 +01:00
Patrick Ohly	5973e2c8cb	e2e pod: unit test for pod status + API error This covers new behavior in gomega.	2023-01-31 13:01:39 +01:00
Patrick Ohly	901928cd54	e2e pod: remove dead code	2023-01-31 13:01:39 +01:00
Patrick Ohly	f5782f1dbd	e2e pod: use gomega.Eventually in WaitForRestartablePods	2023-01-31 13:01:39 +01:00
Patrick Ohly	5d8e970be6	e2e pod: convert ProxyResponseChecker into matcher Instead of pod responses being printed to the log each time polling fails, we get a consolidated failure message with all unexpected pod responses if (and only if) the check times out or a progress report gets produced.	2023-01-31 13:01:39 +01:00
Patrick Ohly	3b579fca91	e2e: consolidate pod response checking This renames PodsResponding to WaitForPodsResponding for the sake of consistency and adds a timeout parameter. That is necessary because some other users of NewProxyResponseChecker used a much lower timeout (2min vs. 15min). Besides simplifying some code, it also makes it easier to rewrite ProxyResponseChecker because it only gets used in WaitForPodsResponding.	2023-01-31 13:01:39 +01:00
Patrick Ohly	4491c80074	e2e pod: use gomega.Eventually in WaitForPodNotFoundInNamespace	2023-01-31 13:01:39 +01:00
Patrick Ohly	6eea1b2efa	e2e: replace WaitForPodToDisappear with WaitForPodNotFoundInNamespace WaitForPodToDisappear was always called such that it listed all pods, which made it less efficient than trying to get just the one pod it was checking for. Being able to customize the poll interval in practice wasn't useful, therefore it can be replaced with WaitForPodNotFoundInNamespace.	2023-01-31 13:01:39 +01:00
Patrick Ohly	4740d34edb	e2e: consolidate checking a pod list WaitForPods is now a generic function which lists pods and then checks the pods that it found against some provided condition. A parameter determines how many pods must be found resp. match the condition for the check to succeed.	2023-01-31 07:52:26 +01:00
Patrick Ohly	cd0c756c72	e2e pod: use gomega.Eventually in WaitTimeoutForPodReadyInNamespace/WaitForPodCondition These get converted together because they relied on FinalErr which now isn't needed anymore.	2023-01-31 07:52:26 +01:00
Patrick Ohly	671835e976	e2e pod: use gomega.Eventually in WaitForPodsRunningReady The code becomes simpler (78 insertions, 91 deletions), easier to read (all code entirely inside WaitForPodsRunningReady, no need to declare and later overwrite variables) and possibly more correct (if all API calls failed, the resulting error was ignored when allowedNotReadyPods > 0).	2023-01-31 07:52:26 +01:00
Patrick Ohly	3ebab68c8a	e2e: remove unused label filter from WaitForPodsRunningReady None of the users of the functions passed anything other than nil or an empty map and the implementation ignore the parameter - it seems like a candidate for simplification.	2023-01-31 07:52:26 +01:00

1 2 3 4 5

222 Commits