Commit Graph

1925 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
4293a6329b Merge pull request #99805 from bobbypage/shutdown-test
Fixes to node shutdown e2e test
2021-03-04 16:06:35 -08:00
David Porter
bd2e557b25 Fixes to node shutdown e2e test
- Test was failing due to using `sleep infinity` inside the busybox
  container which was going into a crash loop. `sleep infinity` isn't
  supported by the sleep version in busybox, so replace it with a `while
  true; sleep loop`.

- Replace usage of dbus message emitting from gdbus to dbus-send. The
  test was failing on ubuntu which doesn't have gdbus installed.
  dbus-send is installed on COS and Ubuntu, so use it instead.

- Replace check of pod phase with the test util function `PodRunningReady`
  which checks both phase as well as pod ready condition.

- Add some more verbose logging to ease future debugging.
2021-03-04 12:16:47 -08:00
Kubernetes Prow Robot
3cab9f5d74 Merge pull request #99584 from cynepco3hahue/e2e_fix_hugepages_tests
e2e node: fix hugepages test flakiness
2021-03-04 11:00:48 -08:00
wojtekt
3b438eb4e5 Cleanup test/e2e/common directory - part2 2021-03-03 10:27:35 +01:00
Artyom Lukianov
ef3e0fd02f e2e node: wait for kubelet health check to pass after kubelet restart
Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-03-02 14:48:03 +02:00
Artyom Lukianov
a6b4868b8d e2e node: stop kubelet service instead of restarting it
The server service monitors the kubelet service and restart it
once the service is down, to avoid kubelet double restarting
we will stop the kubelet service and wait until the kubelet will be
restarted and the node will be ready.

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-03-02 14:48:03 +02:00
Kubernetes Prow Robot
4eca18b6b0 Merge pull request #99447 from lala123912/log-sanitization
typo fixed for terminatation
2021-03-01 19:23:58 -08:00
Benjamin Elder
56e092e382 hack/update-bazel.sh 2021-02-28 15:17:29 -08:00
lala123912
652f8fd3e9 typo fixed 2021-02-25 17:52:17 +08:00
Kubernetes Prow Robot
6235ad8c74 Merge pull request #98373 from cynepco3hahue/e2e_fix_cpus_assumptions
e2e: fix CPU manager methods to be more flexible to different CPU topology
2021-02-24 11:11:03 -08:00
Kubernetes Prow Robot
bd902db13d Merge pull request #98342 from cynepco3hahue/e2e_move_delete_state_file_to_after_each
e2e: move deleteState file to the AfterEach
2021-02-24 11:10:50 -08:00
wojtekt
a74737eb03 Mark remaining e2e_node tests with [sig-*] label 2021-02-23 20:11:09 +01:00
Kubernetes Prow Robot
89b9cf7d6f Merge pull request #98658 from wzshiming/add-e2e-for-graceful-node-shutdown
Add GracefulNodeShutdown e2e test
2021-02-21 06:45:41 -08:00
wzshiming
e367d2fe0c Add GracefulNodeShutdown e2e test 2021-02-18 14:46:49 +08:00
Nikhita Raghunath
c3c45b9b8c *: move balajismaniam to emeritus_approvers 2021-02-16 10:55:47 +05:30
Artyom Lukianov
932134034c memory manager: update API constant to have camel case format
Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:02 +02:00
Artyom Lukianov
e8ea461bfd memory manager: update all relevant part of code to use []MemoryReservation
Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:01 +02:00
Artyom Lukianov
74eeef2a0a memory manager: provide additional e2e tests
With the memory manager static policy:

- start multiple guaranteed pods and verify that pods succeeded to start
- start workload pod on each NUMA node to load the memory and start the
  pod that requested more memory than each NUMA node have, the pod should fail
  to start with the admission error, because no single NUMA node has enough
  memory to start the pod and also each NUMA node already used for single
  NUMA node allocation
  The test requires at least two NUMA nodes

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:00 +02:00
Pawel Rapacz
606fea29f5 memory manager: add e2e test to run guaranteed pod with init containers
Signed-off-by: Pawel Rapacz <p.rapacz@partner.samsung.com>
2021-02-09 01:10:00 +02:00
Artyom Lukianov
f3d4ac2f7e memory manager: add basice e2e tests
Provides basic tests e2e to verify that pod succeeds
to start with MemoryManager enabled.

Verifies both MemoryManager policies and when the node has
multiple NUMA nodes it will verify the memory pinning.

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:00 +02:00
Kubernetes Prow Robot
ee2b7a5049 Merge pull request #98049 from pacoxu/enhancements/redir-links
fix all keps links 404 for kep folder migration
2021-02-01 21:48:28 -08:00
pacoxu
a10bdfed09 fix all keps links 404 for kep folder migration
Signed-off-by: pacoxu <paco.xu@daocloud.io>
2021-02-01 19:41:59 +08:00
Ernest Wong
40713f7666 test images: use recently promoted images
Signed-off-by: Ernest Wong <chuwon@microsoft.com>
2021-01-28 10:09:00 -08:00
Artyom Lukianov
97ac255513 e2e: move deleteState file to the AfterEach
Under the CPU manager and topology manager e2e tests possible the situation
when one of steps under the test will fail and it will not clean the CPU manager
state file. Move the deletion of the state file to `AfterEach` to guarantee that
the state file will be always removed from the node.

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-01-26 20:34:17 +02:00
Artyom Lukianov
d2c3f52532 e2e: fix CPU manager methods to be more flexible to different CPU topology
- fix the issue when the test runs on the node with the single CPU
- fix the issue when the CPU topology has only one core per socket, it can
  be easily reproduced by configuring VM with multi NUMA, but when each socket
  has only one core

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-01-25 18:02:38 +02:00
Ikko Ashimine
5155decbbf Fix typo in device_plugin_test.go
assignement -> assignment
2021-01-24 17:42:34 +09:00
hasheddan
40ca31f524 Default cluster dns-domain to empty string in e2e_node suite
Changes default cluster DNS domain to empty string to align with the
default kubelet configuration value.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2021-01-13 10:53:11 -06:00
Kubernetes Prow Robot
cbcb26e7a5 Merge pull request #97653 from heqg/unused-kubeletPort
remove unused constant of kubeletPort in e2e test
2021-01-12 11:56:35 -08:00
Kubernetes Prow Robot
e414d4e5c2 Merge pull request #96315 from twosigma/dnsdomainteste2e
Ensuring default DNS cluster-domain of Kubelet is cluster.local
2021-01-12 10:28:35 -08:00
he.qingguo
ced98afe56 remove unused constant of kubeletPort in e2e test
Signed-off-by: he.qingguo <he.qingguo@zte.com.cn>
2021-01-02 12:16:07 +08:00
Kubernetes Prow Robot
34baf6aeb7 Merge pull request #96666 from harche/fix_typo
Remove the typo in the logs while configuring firewall for node e2e
2020-12-08 18:35:13 -08:00
Kubernetes Prow Robot
1fe25e9f73 Merge pull request #96637 from hasheddan/node-local-env
Update comment to not indicate check is run for docker daemon
2020-12-08 18:34:18 -08:00
Kubernetes Prow Robot
dcc863a8b3 Merge pull request #96219 from fromanirh/tm-e2e-sriovdp-usage
node: e2e: bring up/down SRIOV DP just once
2020-12-08 16:29:53 -08:00
Kubernetes Prow Robot
5cdc3e6e20 Merge pull request #95613 from MHBauer/CAP_NET_RAW
CAP_NET_RAW should be NET_RAW
2020-12-08 16:29:28 -08:00
Harshal Patil
b76abcd243 Remove the typo in the logs while configuring firewall for node e2e
Signed-off-by: Harshal Patil <harpatil@redhat.com>
2020-11-18 14:20:59 +05:30
Kubernetes Prow Robot
36d12390a7 Merge pull request #95906 from harche/iptables_fix
Verify iptable rules are applied for tcp, udp and icmp
2020-11-17 22:08:04 -08:00
hasheddan
d2f7cea12a Update comment to not indicate check is run for docker daemon
Removes comment from daemons function that previously indicated that a
check was being run to make sure docker daemon was running.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-17 08:31:26 -06:00
Kubernetes Prow Robot
d20e3246ba Merge pull request #96574 from hasheddan/cleanup-resource-tests
Fix test name in e2e resource metrics api test
2020-11-14 05:09:04 -08:00
hasheddan
bf270d1070 Fix test name in e2e resource metrics api test
Updates resouce to resource.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-13 15:19:55 -06:00
hasheddan
56c104a440 Relax matching on pod_memory_working_set_bytes metrics
Relaxes matching of pod_memory_working_set_bytes metric so that we won't
error due to presence of other pods.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-13 12:29:54 -06:00
hasheddan
25a04096e8 Match pod resource metrics to pod not container
Update node resource metrics e2e tests to match pod metrics to pod
instread of containers.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-13 10:29:21 -06:00
Francesco Romani
56106439cf node: e2e: bring up/down SRIOV DP just once
The e2e topology manager want to test the resource alignment using
devices, and the easiest devices to use are the SRIOV devices at this
moment.

The resource alignment test cases are run for each supported policies,
in a loop.

The tests manage the SRIOV device plugin; up until now, the plugin
was set up and tore down at each loop.
There is no real need for that. Each loop must reconfigure (thus
restart) the kubelet, but the device plugin can set up and tore down
just once for all the policies, thus once.
The kubelet can reconnect just fine to a running device plugin.

This way, we greatly reduce the interactions and the complexity of the
test environment, making it easier to understand and more robust, and
we trim down some minutes from execution time.

However, this patch also hides (not solves) a test flake we observed
on some environment. The issue is hardly reproduceable and not well
understood, but seems caused by doing the sriov dp setup/teardown
in each policy testing loop.
Investigation so far suggests that the kubelet sometimes have a stale
state after the sriovdp teardown/setup cycle, leading to flakes and
false negatives.
We tried to address this in https://github.com/kubernetes/kubernetes/pull/95611
with no conclusive results yet.

This patch was posted because overall we believe this patch gains
exceeds the drawbacks (hiding the aforementioned flake) and
because understanding the potential interaction issues between the
sriovdp and the kubelet deserve a separate test.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2020-11-13 10:04:31 +01:00
Kubernetes Prow Robot
0e0cc1ead8 Merge pull request #95839 from egernst/pod-usage
resource-metrics: add pod/sandbox metrics to endpoint
2020-11-12 19:36:23 -08:00
Kubernetes Prow Robot
7edf621277 Merge pull request #96262 from knight42/fix/npd-test
fix(test::npd): provide NPD with proper kubeconfig
2020-11-12 11:20:36 -08:00
Pawel Rapacz
16c7bf4db4 Implement e2e tests for pod scope alignment
A suite of e2e tests was created for Topology Manager
so as to test pod scope alignment feature.

Co-authored-by: Pawel Rapacz <p.rapacz@partner.samsung.com>
Co-authored-by: Krzysztof Wiatrzyk <k.wiatrzyk@samsung.com>
Signed-off-by: Cezary Zukowski <c.zukowski@samsung.com>
2020-11-12 12:25:55 +01:00
Harshal Patil
f3bfc9fd67 Verify iptable rules are applied for tcp, udp and icmp
Signed-off-by: Harshal Patil <harpatil@redhat.com>
2020-11-12 13:29:13 +05:30
Sergey Kanzhelev
06da0e5e74 GA of RuntimeClass feature gate and API 2020-11-11 19:22:32 +00:00
knight42
1a9600dc16 fix(test::npd): provide NPD with proper kubeconfig
Signed-off-by: knight42 <anonymousknight96@gmail.com>
2020-11-11 09:47:09 +08:00
Kubernetes Prow Robot
84745e2db4 Merge pull request #96381 from karan/npd-test-cmd
Fix command and arg in NPD e2e
2020-11-10 12:44:16 -08:00
Kubernetes Prow Robot
6e95025994 Merge pull request #92165 from RenaudWasTaken/v1-pod-resources
Graduate the Pod Resources API to G.A
2020-11-09 18:24:41 -08:00