Commit Graph

2807 Commits

Author SHA1 Message Date
Patrick Ohly
bde9b64cdf DRA: remove "source" indirection from v1 Pod API
This makes the API nicer:

    resourceClaims:
    - name: with-template
      resourceClaimTemplateName: test-inline-claim-template
    - name: with-claim
      resourceClaimName: test-shared-claim

Previously, this was:

    resourceClaims:
    - name: with-template
      source:
        resourceClaimTemplateName: test-inline-claim-template
    - name: with-claim
      source:
        resourceClaimName: test-shared-claim

A more long-term benefit is that other, future alternatives
might not make sense under the "source" umbrella.

This is a breaking change. It's justified because DRA is still
alpha and will have several other API breaks in 1.31.
2024-06-27 17:53:24 +02:00
Kubernetes Prow Robot
0c955f7cbb Merge pull request #124617 from bart0sh/PR144-e2e_node-DRA-test-plugin-failures
e2e_node: DRA: test plugin failures
2024-06-18 01:14:19 -07:00
Francesco Romani
086a500d8f e2e: node: use test owner tracking
e2e_node test depend on very specific shared state (node state).
Pod leakages between tests oftentimes cause the test preconditions
to be silently corrupted, causing hard to debug CI failures.

Use the new facility to annotate pods with test owner (= the
test code which created the test) to help debug these failures.

For more context, please check the conversation in #123468

Signed-off-by: Francesco Romani <fromani@redhat.com>
2024-06-14 13:42:14 +02:00
Ed Bartosh
ee0340a828 e2e_node: add tests for 2 Kubelet plugins 2024-06-07 22:53:35 +03:00
Ed Bartosh
ce6faef8d8 e2e_node: change DRA test APIs to work with multiple plugins 2024-06-07 22:53:31 +03:00
Ed Bartosh
118158d8df e2e_node: DRA: test plugin failures 2024-06-07 22:51:53 +03:00
Ed Bartosh
ffc407b4dd e2e_node: DRA: reimplement call blocking 2024-06-07 22:47:20 +03:00
Matthias Bertschy
367e8c5578 ignore starting order in RunTogether, add another that does
Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>
2024-06-02 22:00:40 +02:00
Humble Chirammal
d87bfc8392 update ip-masq-agent and npd to latest versions in cluster addons
Signed-off-by: Humble Chirammal <humble.devassy@gmail.com>
2024-05-30 09:44:02 +05:30
Kubernetes Prow Robot
4c0afd74c3 Merge pull request #125113 from bzsuni/bz/build/cni
update cni from 1.4.1 to 1.5.0
2024-05-29 13:06:08 -07:00
bzsuni
61a35c2186 update cni from 1.4.1 to 1.5.0
Signed-off-by: bzsuni <bingzhe.sun@daocloud.io>
2024-05-25 10:33:52 +08:00
Ed Bartosh
d6c78f853a e2e_node: add deferPodDeletion parameter 2024-05-25 01:02:31 +03:00
Ed Bartosh
f609aa8310 e2e: test-driver: add new matchers 2024-05-25 01:02:25 +03:00
Kubernetes Prow Robot
ec93d3b71a Merge pull request #123160 from bart0sh/PR133-e2e-node-device-plugin-fix-features
e2e_node: fix DevicePlugin feature flags
2024-05-24 09:23:10 -07:00
Kubernetes Prow Robot
028b457cb5 Merge pull request #125090 from gjkim42/deflake-container-lifecycle-e2e-test
e2e: Increase preStop hook delay to deflake the test
2024-05-23 11:58:46 -07:00
Gunju Kim
65b9e560ce e2e: Increase preStop hook delay to deflake the test
This delays the preStop hook to allow sufficient time for the readiness
probe to be executed.
2024-05-23 21:31:02 +09:00
Kubernetes Prow Robot
5cdab888bb Merge pull request #125027 from haircommander/e2e-swap-v1
e2e_node: skip swap tests on cgroupv1
2024-05-22 13:09:42 -07:00
Kubernetes Prow Robot
790d65b28d Merge pull request #125006 from matthyx/fixoom
increase memory limit for e2e test containers
2024-05-21 15:48:15 -07:00
Peter Hunt
ba80f252bf e2e_node: skip swap tests on cgroupv1
Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-05-21 11:39:46 -04:00
Itamar Holder
e9b1a5e185 Expect NoSwap on NodeConformance, test LimitedSwap only in serial tests
Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Itamar Holder
ab5f84e8ed Refactor: Better pod image, read cgroup file from container
- Improve cgroup file read: execute from container instead of host
- Clean unused variables/functions
- BeTrue/BeFalse -> BeTrueBecause/BeFalseBecause
- Use agnhost instread of stress image
- Improve description and fix typo

Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Itamar Holder
b17050927c Update node conformance to use NoSwap
Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Itamar Holder
bdeb80a846 Add serial tests
Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Itamar Holder
2230ed7dc6 Refactor: helper functions and quantity improvement
- Add getSleepingPod() helper function
- Refactor: quantity functions to return resource.quantity instead of int64
- Improve helper functions for memory capacity

Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Itamar Holder
13403e836a Fix swap feature gate check by introduting IsFeatureGateEnabled()
Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Itamar Holder
959d01cbbf Remove cgroup v1 support for swap tests
Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Itamar Holder
eb5d647655 Move current test under its own NodeConformance context
Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Itamar Holder
9f344f23fb Add NodeSwap as a node feature in nodefeature.go
Also, Remove wrong documentation about
tempSetCurrentKubeletConfig() returning bool

Signed-off-by: Itamar Holder <iholder@redhat.com>
2024-05-21 11:19:54 +03:00
Matthias Bertschy
893ab296d7 increase memory limit for e2e test containers
Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>
2024-05-20 22:40:03 +02:00
Kubernetes Prow Robot
06b813fd29 Merge pull request #124634 from saschagrunert/cri-staging-code
Move `pkg/kubelet/cri/remote` to `cri-client`
2024-05-16 07:33:06 -07:00
Maciej Szulik
ceb0387e39 Update PodSecurityLevel used during tests 2024-05-15 16:50:30 +02:00
Sascha Grunert
2aa9e76be1 Move pkg/kubelet/cri/remote to cri-client
Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2024-05-14 10:58:18 +02:00
Kubernetes Prow Robot
63e85caac5 Merge pull request #124741 from saschagrunert/remote-runtime-logging
Make remote runtime and image service logging independent
2024-05-13 08:52:32 -07:00
Sascha Grunert
9c712466f6 Make remote runtime and image service logging independent
It's now possible to pass around the `*klog.Logger` which can also be
`nil` to disable logging at all.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2024-05-08 10:32:21 +02:00
Matthias Bertschy
18f9a08463 add coverage tests for probes behavior
Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>
2024-05-07 07:18:27 +02:00
Kubernetes Prow Robot
65f8129e51 Merge pull request #124668 from bart0sh/PR143-e2e-node-fix-containers-lifecycle
node_e2e: refactor RunTogether function
2024-05-06 15:11:46 -07:00
Ed Bartosh
ba7a74a0be e2e_node: fix DevicePlugin feature flags
Feature:DevicePluginProbe and NodeFeature:DevicePluginProbe
are not used by any of the test-infra jobs.

This commit renames NodeFeature:DevicePluginProbe to NodeFeature:DevicePlugin
and removes Feature:DevicePlugin and Feature:DeviceManager to avoid
having both Feature and NodeFeature tags for the same feature.

NOTE: Test-infra SIG-Node jobs should focus on the
NodeFeature:DevicePlugin to run generic Device Plugins tests.
2024-05-05 23:19:50 +03:00
Kubernetes Prow Robot
de1674829c Merge pull request #123886 from adrianreber/2024-03-12-criu-not-found
Handle containerd "CRIU not found" error message
2024-05-04 06:54:28 -07:00
Ed Bartosh
6ecf0da1a5 node_e2e: refactor RunTogether function 2024-05-02 13:41:47 +03:00
Matthias Bertschy
f7ea5f3fe1 e2e lifecycle: increase delay for restartable init containers
Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>
2024-05-01 22:12:04 +02:00
Matthias Bertschy
8833b4def0 e2e lifecycle: fix finishing -> exiting
Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>
2024-05-01 18:27:13 +02:00
Matthias Bertschy
851d149a88 e2e lifecycle: use millisecond resolution for logs
Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>
2024-05-01 18:27:10 +02:00
Kubernetes Prow Robot
d0fddf143b Merge pull request #122148 from pohly/controllers-context-support
controllers + apiserver: enhance context support
2024-04-30 01:30:09 -07:00
Kubernetes Prow Robot
1fd835ce59 Merge pull request #123398 from ffromani/remove-legacy-checkpoint
node: devicemgr: remove obsolete pre-1.20 checkpoint file support
2024-04-29 14:46:53 -07:00
Patrick Ohly
b92273a760 apiserver + controllers: enhance context support
27a68aee3a introduced context support for events. Creating an event
broadcaster with context makes tests more resilient against leaking goroutines
when that context gets canceled at the end of a test and enables per-test
output via ktesting.

The context could get passed to the constructor. A cleaner solution is to
enhance context support for the apiserver and then pass the context into the
controller's run method. This ripples up the call stack to all places which
start an apiserver.
2024-04-29 20:59:21 +02:00
Ed Bartosh
e4c6adacf0 Revert "add coverage tests for probes behavior"
This reverts commit 9be9832184.
2024-04-24 20:56:46 +03:00
Kubernetes Prow Robot
9db6aac7f3 Merge pull request #124086 from matthyx/probes
add coverage tests for probes behavior
2024-04-23 17:02:17 -07:00
Kubernetes Prow Robot
1a4f5a30f0 Merge pull request #124097 from Nordix/esotsal/cpu_manager_test_clean_code
e2e_node: clean cpu_manager test
2024-04-22 18:26:12 -07:00
Kubernetes Prow Robot
11ca079137 Merge pull request #124396 from mimowo/make-sure-traps-are-registered
Make e2e node tests more resiliant by ensuring the SIGTERM trap is registered
2024-04-22 17:25:40 -07:00
Kubernetes Prow Robot
d8f8c7fae0 Merge pull request #124288 from pohly/test-e2e-node-debugger
e2e node: debugger support
2024-04-22 08:43:27 -07:00