Kubernetes Prow Robot
884bc96fec
Merge pull request #120773 from swatisehgal/tm-metrics-e2e-deflake
...
topology-mgr: metrics: Deflake Topology Manager metrics e2e tests
2023-09-20 11:26:26 -07:00
Kubernetes Prow Robot
7fb7e2625b
Merge pull request #120401 from shijinye/e2eclean-node-notequal
...
cleanup:e2e:stop using deprecated framework.ExpectNotEqual
2023-09-20 11:26:19 -07:00
Kubernetes Prow Robot
3191493cea
Merge pull request #119402 from Tal-or/e2e_podres_terminal_pods
...
e2e:podresources: verify count for terminal pods
2023-09-20 11:26:11 -07:00
Swati Sehgal
f5d915b594
topology-mgr: metrics: Deflake Topology Manager metrics e2e tests
...
On local execution of Topology Manager metrics tests, the tests pass rate was 100%.
Yet, we can see that the Topology Manager metrics tests are failing in upstream
CI consistently: https://testgrid.k8s.io/sig-node-presubmits#pr-kubelet-serial-gce-e2e-topology-manager .
From the logs, it was identified that these failures are because of timeouts,
so we are increasing the default timeout as well as polling interval frequency
of obtaining KubeletMetrics to deflake this test.
We have noticed a similar flake in case of CPU manager metrics tests as well:
https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/directory/pull-kubernetes-node-kubelet-serial-cpu-manager/1701615009836044288 .
Once it is confirmed that the issue is resolved for Topology Manager test,
we will be fix this for CPU Manager as well in a follow-up PR.
Signed-off-by: Swati Sehgal <swsehgal@redhat.com >
2023-09-20 13:37:27 +01:00
Kubernetes Prow Robot
a68093a3ff
Merge pull request #120506 from alexzielenski/import-restrictions
...
Update e2e import restrictions
2023-09-13 21:56:22 -07:00
Kubernetes Prow Robot
160fe010f3
Merge pull request #120464 from gjkim42/deflake-container-lifecycle-e2e-test
...
e2e_node: Assign enough time to finish the postStart hook
2023-09-12 17:44:44 -07:00
Kubernetes Prow Robot
04e5914079
Merge pull request #120349 from ruquanzhao/fixTopologyManagerJobs
...
e2e-node: fix TopologyManager test jobs.
2023-09-12 17:44:37 -07:00
Kubernetes Prow Robot
8aeebda818
Merge pull request #120306 from Rei1010/nodeClean
...
e2e_node:stop using deprecated framework.ExpectError
2023-09-12 17:44:23 -07:00
Todd Neal
af151eeba2
specifically check that the pod was successful
2023-09-12 13:40:20 -05:00
Gunju Kim
1fb4eee94e
Use container log instead of termination log
...
Since the termination log cannot be accessed until the container is
terminated, use the container log.
2023-09-11 22:55:09 +09:00
Alexander Zielenski
7a13b11af0
update e2e import restrictions
2023-09-07 12:20:29 -07:00
Kubernetes Prow Robot
b27670dfbd
Merge pull request #118740 from saschagrunert/kubelet-label-types
...
Make kubelet label types public
2023-09-06 23:46:57 -07:00
Gunju Kim
b468e4eb1c
e2e_node: Assign enough time to finish the postStart hook
...
This deflakes the "Containers Lifecycle should not launch second
container before PostStart of the first container completed" test by
assigning enough time to finish the postStart hook.
2023-09-07 00:42:54 +09:00
Kubernetes Prow Robot
56cc5e77a1
Merge pull request #120441 from tzneal/revert-npd-update
...
Revert "bump npd to v0.8.14"
2023-09-06 06:39:04 -07:00
Kubernetes Prow Robot
debe30de70
Merge pull request #120281 from gjkim42/feature-gate-sidecar-containers-in-kuberuntime
...
Feature-gate SidecarContainers code in pkg/kubelet/kuberuntime
2023-09-05 18:34:54 -07:00
Todd Neal
355ae44a3c
Revert "bump npd to v0.8.14"
...
This reverts commit 7b44d73f73 .
2023-09-05 20:28:53 -05:00
jinye
a774887262
cleanup:e2e:stop using deprecated framework.ExpectNotEqual
2023-09-05 18:16:57 +08:00
RuquanZhao
bfc3c2110f
e2e-node: fix TopologyManager test jobs.
...
Signed-off-by: Ruquan Zhao <ruquan.zhao@arm.com >
2023-09-01 17:53:16 +08:00
wen.rui
3d9b5d0577
e2e_node:stop using deprecated framework.ExpectError
2023-09-01 17:42:36 +08:00
Kubernetes Prow Robot
400059d025
Merge pull request #120194 from bzsuni/bz/bump/npd
...
bump npd to v0.8.14
2023-08-31 20:52:30 -07:00
Gunju Kim
63177db32c
Add an e2e test for the pod sandbox changed scenario
...
This adds an e2e test to ensure that a pod should restart its containers
in right order after the pod sandbox is changed.
2023-09-01 00:13:47 +09:00
Todd Neal
ede524e1a6
fix a pidpressure test flake
...
With the new busybox, ash has a built-in sleep command. Prior to this
change we were creating half the pids expected since `sleep` wasn't
actually launching a new binary. Use the full path to /bin/sleep which
avoids the built-in and actually launches a new process.
2023-08-30 22:44:36 -05:00
bzsuni
7b44d73f73
bump npd to v0.8.14
...
Signed-off-by: bzsuni <bingzhe.sun@daocloud.io >
2023-08-30 19:03:33 +08:00
Fan Shang Xiang
8d9517318a
Extend npd e2e timeout to fix npd e2e error
2023-08-29 17:22:28 +08:00
Kubernetes Prow Robot
232d343d58
Merge pull request #119969 from saschagrunert/cni-plugins
...
Update CNI plugins to v1.3.0
2023-08-23 12:41:57 -07:00
Dixita Narang
d2dbc583a0
Adding coverage for OOM Kill scenario due to node allocatable memory limits, when pod level memory limits are not set
2023-08-22 00:45:17 +00:00
Davanum Srinivas
3e9a4c15a8
Restrict what imports get into code within test/e2e_node
...
Signed-off-by: Davanum Srinivas <davanum@gmail.com >
2023-08-21 15:04:23 -04:00
Kubernetes Prow Robot
4dee8398ae
Merge pull request #120078 from tzneal/investigate-test-failure
...
expect the new resource_scape_error metric
2023-08-21 04:13:34 -07:00
Todd Neal
b8512cfe24
expect the new resource_scape_error metric
2023-08-20 14:17:54 -05:00
Todd Neal
905f07f1ac
Revert "mark the OOM killer as serial to reduce flakes"
...
This reverts commit bd6f548746 .
Running as serial didn't completely eliminate the flake so I think
there's something more going on here. Reverting the change to serial
since its not a solution.
2023-08-20 13:38:07 -05:00
Todd Neal
bd6f548746
mark the OOM killer as serial to reduce flakes
...
In testing I could only reproduce the flake by running stress-ng to load
the CPU. Running it as serial should reduce and hopefully eliminate the
flakiness.
2023-08-18 13:18:50 -05:00
Todd Neal
577197559a
remove the legacy test dependency
...
This removes the import which added a bunch of apparently
old failing tests.
2023-08-17 12:54:20 -05:00
Sascha Grunert
7933368460
Update CNI plugins to v1.3.0
...
Signed-off-by: Sascha Grunert <sgrunert@redhat.com >
2023-08-17 09:50:53 +02:00
Kubernetes Prow Robot
4d166947cf
Merge pull request #119097 from pacoxu/fix-eviction-pid
...
PIDPressure condition is triggered slow on CRI-O with large PID pressure/heavy load
2023-08-16 16:36:19 -07:00
Kubernetes Prow Robot
88d14edc26
Merge pull request #119197 from saschagrunert/stop-container-runtime-err
...
Check dbus error on container runtime start/stop
2023-08-16 15:27:52 -07:00
Kubernetes Prow Robot
b1e35d5616
Merge pull request #119974 from tzneal/bump-busybox-test-version
...
bump the busybox test version to resolve test failures
2023-08-16 12:44:13 -07:00
Kubernetes Prow Robot
dd44792cec
Merge pull request #119880 from saschagrunert/seccomp-filter
...
Make seccomp status checks in e2e tests more robust
2023-08-16 12:43:54 -07:00
Todd Neal
b75c5d33e5
bump the busybox test version to resolve test failures
...
- bump busybox version
- specify the path to /bin/sleep to avoid calling a new shell
builtin
2023-08-16 08:50:20 -05:00
Kubernetes Prow Robot
c41c448b80
Merge pull request #119890 from tzneal/containers-lifecycle-flake
...
crio: increase test buffer to eliminate test flakes
2023-08-15 23:13:45 -07:00
Kubernetes Prow Robot
061ae8a68b
Merge pull request #119765 from tzneal/detect-nfsv3-and-change-mount-path
...
fix mirror pod nfs test failure due to differing NFS versions
2023-08-15 23:12:44 -07:00
Kubernetes Prow Robot
3111fee8bf
Merge pull request #119670 from lengrongfu/fix/oomkill-multi-target-container
...
fix OOM killer
2023-08-15 19:43:40 -07:00
Kubernetes Prow Robot
3525255622
Merge pull request #119212 from CoderSherlock/master
...
Added oomkill test for init container and fix typos
2023-08-15 15:17:48 -07:00
Todd Neal
e258228e4a
use a buffer equivalent to grace period to eliminate test flakes
...
This modifies the test to wait up to 2x the grace period for the pod to
be removed.
2023-08-11 14:08:11 -05:00
Todd Neal
717c149a73
fix mirror pod nfs test failure due to differing NFS versions
...
/exports *(rw,fsid=0,insecure,no_root_squash)
can be mounted as `/exports` using NFSv3 and `/` using NFSv4
Mount as '/', since clients that support both can try both.
2023-08-11 07:27:05 -05:00
Sascha Grunert
8ab6bee676
Make seccomp status checks in e2e tests more robust
...
The tests have been introduced in
ca7be7dc6d
and checked for `ecc` in `/proc/self/status` since its creation.
We got a new field `Seccomp_filters:` with the Linux commit
c818c03b66 ,
means that `ecc` would now match both and interfere with possible test
results depending on the host.
The field `Seccomp:` got introduced in
2f4b3bf6b2
and has never changed since then, means we can use it directly to make
the tests more strict.
Refers to https://github.com/kubernetes-sigs/cri-tools/pull/1236
Signed-off-by: Sascha Grunert <sgrunert@redhat.com >
2023-08-10 09:51:03 +02:00
lengrongfu
c23cee1be3
fix OOM killer
...
Signed-off-by: lengrongfu <rongfu.leng@daocloud.io >
2023-07-30 11:16:12 +08:00
Davanum Srinivas
b4ef4015a2
Avoid pulling mounter.tar through the CDN
...
Signed-off-by: Davanum Srinivas <davanum@gmail.com >
2023-07-28 22:15:55 -04:00
upodroid
a65d207507
calculate the correct machine-type
2023-07-26 23:10:06 +00:00
upodroid
7d13c9b096
set map to nil if an empty string is passed
2023-07-26 10:32:27 +03:00
Talor Itzhak
3964f71fe0
e2e:podresources: verify count for terminal pods
...
PodResourcesAPI reports in the List call about resources of pods in terminal phase.
The internal managers reassign resources assigned to pods in terminal phase, so podresources should ignore them.
Whether this behavior intended or not (the docs are not unequivocal)
this e2e test demonstrates and verifies the mentioned above.
Signed-off-by: Talor Itzhak <titzhak@redhat.com >
2023-07-23 12:46:41 +03:00