Commit Graph

1937 Commits

Author SHA1 Message Date
tanjing2020
d0882e69e2 Fix the wrong judgment of oom_score_adj 2021-03-24 16:13:20 +08:00
Francesco Romani
d7a30e1b08 podresources: getallocatable: add feature gate
Add feature gate to disable the GetAllocatableResources API.
The feature gate isd alpha stage, disabled by default.

Add e2e test to demonstrate the behaviour with feature gate disabled.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2021-03-09 13:14:56 +01:00
Francesco Romani
16d5ac3689 node: e2e: docs and fix for teardownSRIOVConfig
Document why teardownSRIOVPod has to wait for all the containers
to be gone before to end, and why is important.

Additionally, change the code to wait for all the containers to be gone,
not just the first. This is both a little cleaner and a little safer,
even though it seems the current code caused no issues so far.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2021-03-09 13:13:36 +01:00
Francesco Romani
adfff27279 node: e2e: run deleteSync in parallel
speedup the cleanup after testcases deleting pods in separate
goroutines.
The post-test cleanup stage must be done carefully since pod require
exclusive allocation - so pods must take all the steps to properly
cleanup the tests to avoid to pollute the environment, but
this has a negative effect on test duration (take longer).

Hence, we add safe speedups like doing pod deletions in parallel.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2021-03-09 13:13:36 +01:00
Francesco Romani
9c69db3f04 e2e: node: add tests for GetAllocatableResources
Add e2e tests for the new GetAllocatableResources API.
The tests are added in the `podresources_test` suite
created previously in this series.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2021-03-09 13:13:36 +01:00
Francesco Romani
4e7434028c e2e: node: bootstrap podresources tests
Start e2e tests for the existing List() API.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2021-03-09 13:13:35 +01:00
Kubernetes Prow Robot
40a411a61a
Merge pull request #99912 from dims/capture-logs-from-containerd-installation-service
Capture logs from containerd-installation service
2021-03-08 22:53:38 -08:00
Kubernetes Prow Robot
0df8c69731
Merge pull request #99960 from knabben/fix-runtime-config
Enabling runtime config on E2E node tests
2021-03-08 16:28:00 -08:00
Amim Knabben
0341e4c2f3 Enabling runtime config on E2E node tests 2021-03-08 15:45:06 -05:00
Kubernetes Prow Robot
eb4dafb7f1
Merge pull request #99651 from umohnani8/cri
Move CRIContainerLogRotation to GA
2021-03-08 12:07:20 -08:00
Davanum Srinivas
ce35b2f2e7
capture additional logs related to containerd
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2021-03-07 13:48:07 -05:00
Kubernetes Prow Robot
4293a6329b
Merge pull request #99805 from bobbypage/shutdown-test
Fixes to node shutdown e2e test
2021-03-04 16:06:35 -08:00
David Porter
bd2e557b25 Fixes to node shutdown e2e test
- Test was failing due to using `sleep infinity` inside the busybox
  container which was going into a crash loop. `sleep infinity` isn't
  supported by the sleep version in busybox, so replace it with a `while
  true; sleep loop`.

- Replace usage of dbus message emitting from gdbus to dbus-send. The
  test was failing on ubuntu which doesn't have gdbus installed.
  dbus-send is installed on COS and Ubuntu, so use it instead.

- Replace check of pod phase with the test util function `PodRunningReady`
  which checks both phase as well as pod ready condition.

- Add some more verbose logging to ease future debugging.
2021-03-04 12:16:47 -08:00
Kubernetes Prow Robot
3cab9f5d74
Merge pull request #99584 from cynepco3hahue/e2e_fix_hugepages_tests
e2e node: fix hugepages test flakiness
2021-03-04 11:00:48 -08:00
Urvashi Mohnani
ca99aa587d Move CRIContainerLogRotation to GA
Graduate the CRIContainerLogRotation feature gate
from beta to GA.

Signed-off-by: Urvashi Mohnani <umohnani@redhat.com>
2021-03-04 09:40:02 -05:00
wojtekt
3b438eb4e5 Cleanup test/e2e/common directory - part2 2021-03-03 10:27:35 +01:00
Artyom Lukianov
ef3e0fd02f e2e node: wait for kubelet health check to pass after kubelet restart
Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-03-02 14:48:03 +02:00
Artyom Lukianov
a6b4868b8d e2e node: stop kubelet service instead of restarting it
The server service monitors the kubelet service and restart it
once the service is down, to avoid kubelet double restarting
we will stop the kubelet service and wait until the kubelet will be
restarted and the node will be ready.

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-03-02 14:48:03 +02:00
Kubernetes Prow Robot
4eca18b6b0
Merge pull request #99447 from lala123912/log-sanitization
typo fixed for terminatation
2021-03-01 19:23:58 -08:00
Benjamin Elder
56e092e382 hack/update-bazel.sh 2021-02-28 15:17:29 -08:00
lala123912
652f8fd3e9 typo fixed 2021-02-25 17:52:17 +08:00
Kubernetes Prow Robot
6235ad8c74
Merge pull request #98373 from cynepco3hahue/e2e_fix_cpus_assumptions
e2e: fix CPU manager methods to be more flexible to different CPU topology
2021-02-24 11:11:03 -08:00
Kubernetes Prow Robot
bd902db13d
Merge pull request #98342 from cynepco3hahue/e2e_move_delete_state_file_to_after_each
e2e: move deleteState file to the AfterEach
2021-02-24 11:10:50 -08:00
wojtekt
a74737eb03 Mark remaining e2e_node tests with [sig-*] label 2021-02-23 20:11:09 +01:00
Kubernetes Prow Robot
89b9cf7d6f
Merge pull request #98658 from wzshiming/add-e2e-for-graceful-node-shutdown
Add GracefulNodeShutdown e2e test
2021-02-21 06:45:41 -08:00
wzshiming
e367d2fe0c Add GracefulNodeShutdown e2e test 2021-02-18 14:46:49 +08:00
Nikhita Raghunath
c3c45b9b8c *: move balajismaniam to emeritus_approvers 2021-02-16 10:55:47 +05:30
Artyom Lukianov
932134034c memory manager: update API constant to have camel case format
Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:02 +02:00
Artyom Lukianov
e8ea461bfd memory manager: update all relevant part of code to use []MemoryReservation
Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:01 +02:00
Artyom Lukianov
74eeef2a0a memory manager: provide additional e2e tests
With the memory manager static policy:

- start multiple guaranteed pods and verify that pods succeeded to start
- start workload pod on each NUMA node to load the memory and start the
  pod that requested more memory than each NUMA node have, the pod should fail
  to start with the admission error, because no single NUMA node has enough
  memory to start the pod and also each NUMA node already used for single
  NUMA node allocation
  The test requires at least two NUMA nodes

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:00 +02:00
Pawel Rapacz
606fea29f5 memory manager: add e2e test to run guaranteed pod with init containers
Signed-off-by: Pawel Rapacz <p.rapacz@partner.samsung.com>
2021-02-09 01:10:00 +02:00
Artyom Lukianov
f3d4ac2f7e memory manager: add basice e2e tests
Provides basic tests e2e to verify that pod succeeds
to start with MemoryManager enabled.

Verifies both MemoryManager policies and when the node has
multiple NUMA nodes it will verify the memory pinning.

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:00 +02:00
Kubernetes Prow Robot
ee2b7a5049
Merge pull request #98049 from pacoxu/enhancements/redir-links
fix all keps links 404 for kep folder migration
2021-02-01 21:48:28 -08:00
pacoxu
a10bdfed09 fix all keps links 404 for kep folder migration
Signed-off-by: pacoxu <paco.xu@daocloud.io>
2021-02-01 19:41:59 +08:00
Ernest Wong
40713f7666
test images: use recently promoted images
Signed-off-by: Ernest Wong <chuwon@microsoft.com>
2021-01-28 10:09:00 -08:00
Artyom Lukianov
97ac255513 e2e: move deleteState file to the AfterEach
Under the CPU manager and topology manager e2e tests possible the situation
when one of steps under the test will fail and it will not clean the CPU manager
state file. Move the deletion of the state file to `AfterEach` to guarantee that
the state file will be always removed from the node.

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-01-26 20:34:17 +02:00
Artyom Lukianov
d2c3f52532 e2e: fix CPU manager methods to be more flexible to different CPU topology
- fix the issue when the test runs on the node with the single CPU
- fix the issue when the CPU topology has only one core per socket, it can
  be easily reproduced by configuring VM with multi NUMA, but when each socket
  has only one core

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-01-25 18:02:38 +02:00
Ikko Ashimine
5155decbbf
Fix typo in device_plugin_test.go
assignement -> assignment
2021-01-24 17:42:34 +09:00
hasheddan
40ca31f524
Default cluster dns-domain to empty string in e2e_node suite
Changes default cluster DNS domain to empty string to align with the
default kubelet configuration value.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2021-01-13 10:53:11 -06:00
Kubernetes Prow Robot
cbcb26e7a5
Merge pull request #97653 from heqg/unused-kubeletPort
remove unused constant of kubeletPort in e2e test
2021-01-12 11:56:35 -08:00
Kubernetes Prow Robot
e414d4e5c2
Merge pull request #96315 from twosigma/dnsdomainteste2e
Ensuring default DNS cluster-domain of Kubelet is cluster.local
2021-01-12 10:28:35 -08:00
he.qingguo
ced98afe56 remove unused constant of kubeletPort in e2e test
Signed-off-by: he.qingguo <he.qingguo@zte.com.cn>
2021-01-02 12:16:07 +08:00
Kubernetes Prow Robot
34baf6aeb7
Merge pull request #96666 from harche/fix_typo
Remove the typo in the logs while configuring firewall for node e2e
2020-12-08 18:35:13 -08:00
Kubernetes Prow Robot
1fe25e9f73
Merge pull request #96637 from hasheddan/node-local-env
Update comment to not indicate check is run for docker daemon
2020-12-08 18:34:18 -08:00
Kubernetes Prow Robot
dcc863a8b3
Merge pull request #96219 from fromanirh/tm-e2e-sriovdp-usage
node: e2e: bring up/down SRIOV DP just once
2020-12-08 16:29:53 -08:00
Kubernetes Prow Robot
5cdc3e6e20
Merge pull request #95613 from MHBauer/CAP_NET_RAW
CAP_NET_RAW should be NET_RAW
2020-12-08 16:29:28 -08:00
Harshal Patil
b76abcd243 Remove the typo in the logs while configuring firewall for node e2e
Signed-off-by: Harshal Patil <harpatil@redhat.com>
2020-11-18 14:20:59 +05:30
Kubernetes Prow Robot
36d12390a7
Merge pull request #95906 from harche/iptables_fix
Verify iptable rules are applied for tcp, udp and icmp
2020-11-17 22:08:04 -08:00
hasheddan
d2f7cea12a
Update comment to not indicate check is run for docker daemon
Removes comment from daemons function that previously indicated that a
check was being run to make sure docker daemon was running.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-17 08:31:26 -06:00
Kubernetes Prow Robot
d20e3246ba
Merge pull request #96574 from hasheddan/cleanup-resource-tests
Fix test name in e2e resource metrics api test
2020-11-14 05:09:04 -08:00