Elana Hashman
d2ed3b28b7
Revert "revert Bump DynamicKubeConfig metric deprecation to 1.23 by delta update"
2021-08-06 08:38:56 -07:00
Kubernetes Prow Robot
d4179be611
Merge pull request #104183 from SergeyKanzhelev/SergeyKanzhelev-node-e2e-approver
...
Add SergeyKanzhelev to node e2e test approvers
2021-08-05 20:55:28 -07:00
Kubernetes Prow Robot
4d87be3ec4
Merge pull request #104121 from dims/skip-node-e2e-test-for-recovering-from-ip-leak-with-docker
...
Skip node e2e test for recovering from ip leak with docker/ubuntu
2021-08-05 16:36:46 -07:00
Sergey Kanzhelev
023f6a90db
Add SergeyKanzhelev to node e2e test approvers
2021-08-05 21:32:55 +00:00
Kubernetes Prow Robot
7f231f899b
Merge pull request #103883 from ehashman/slow-e2es
...
Mark "update Node.Spec.ConfigSource" node e2es as slow
2021-08-05 14:10:37 -07:00
Kubernetes Prow Robot
01cd315f3e
Merge pull request #104106 from ehashman/ehashman-node-e2e-owners
...
Add ehashman to node e2e test approvers
2021-08-05 08:18:49 -07:00
Kubernetes Prow Robot
3b84cc9e6b
Merge pull request #104075 from kerthcet/cleanup/revert-dynamickubeconfig-metric
...
revert Bump DynamicKubeConfig metric deprecation to 1.23 by delta update
2021-08-05 08:18:40 -07:00
Davanum Srinivas
9351b57def
Skip node e2e test for recovering from ip leak with docker
...
Signed-off-by: Davanum Srinivas <davanum@gmail.com >
2021-08-05 07:11:07 -04:00
kerthcet
8cf10d9a20
set showHiddenMetricsForVersion=1.22 in dynamicKubeletConfiguration test
...
Signed-off-by: kerthcet <kerthcet@gmail.com >
2021-08-05 01:04:54 +08:00
Elana Hashman
ac076838c8
Add ehashman to node e2e test approvers
...
List of files raised by matthyx in SIG Node during the 2021-08-03
meeting.
2021-08-03 10:48:06 -07:00
Davanum Srinivas
3463c2dfa9
Skip NVidia GPU test in node e2e CI jobs for containerd and other runtimes
...
Signed-off-by: Davanum Srinivas <davanum@gmail.com >
2021-08-03 08:48:44 -04:00
Kubernetes Prow Robot
9ff3b7e744
Merge pull request #104047 from ehashman/fix-node-e2e-logs
...
Log e2e-node kubelet output directly to file
2021-08-02 12:30:19 -07:00
Davanum Srinivas
dab19517e5
Explicitly restart kubelet to stabilize serial-containerd job
...
Signed-off-by: Davanum Srinivas <davanum@gmail.com >
2021-08-02 11:24:11 -04:00
Elana Hashman
a77f4f4c29
Log e2e-node kubelet output directly to file
...
For some reason when we send them to journald, many log lines are
consistently dropped as soon as the PLEG is started.
If we log directly to file, we don't have this problem. As a bonus, if
the tests crash, the kubelet logs will always be available since they
were already written; otherwise we normally wait until the end of the
test run to collect them from journald, meaning that we often end up
with empty logs.
2021-07-30 15:35:42 -07:00
Ryan Phillips
163e4974b6
e2e node server: fix crash in log line
2021-07-30 12:36:00 -05:00
Elana Hashman
59a7cc12c9
Mark failing node serial tests as flaky
...
Tracked in:
- https://github.com/kubernetes/kubernetes/issues/103690
- https://github.com/kubernetes/kubernetes/issues/103691
2021-07-28 10:39:30 -07:00
Elana Hashman
93146048b4
Mark "update Node.Spec.ConfigSource" node e2es as slow
...
- recover to last-known-good ConfigMap.KubeletConfigKey
~12m to run in CI, 13m locally
- non-nil last-known-good to a new non-nil last-known-good
~24m to run in CI
- recover to last-known-good ConfigMap
~12m to run in CI
- state transitions
~8m to run in CI
2021-07-23 12:40:24 -07:00
Nabarun Pal
77afa53f9d
Add e2e testing manifest bundle to e2e_node test suite
...
Ref: https://kubernetes.slack.com/archives/C0BP8PW9G/p1627003199187100?thread_ts=1626988113.184100&cid=C0BP8PW9G
Signed-off-by: Nabarun Pal <pal.nabarun95@gmail.com >
2021-07-23 09:49:33 +05:30
David Porter
3af4fe8c9b
Use pointer gomega comparison for UsageNanoCores
2021-07-22 01:08:36 -07:00
Kubernetes Prow Robot
ac8dca79af
Merge pull request #103566 from wzshiming/fix/e2e-dbus-config-path
...
Fix dbus config path for GracefulNodeShutdown e2e
2021-07-15 12:39:14 -07:00
Kubernetes Prow Robot
4f9bfb39ad
Merge pull request #102169 from odinuge/rlimit-tests
...
Ensure node-e2e-test can open enough files
2021-07-15 10:20:45 -07:00
Kubernetes Prow Robot
b55c980279
Merge pull request #102395 from odinuge/node_container_manager_test_skip_systemd
...
Skip node container manager test on systemd
2021-07-09 13:26:54 -07:00
Kubernetes Prow Robot
617064d732
Merge pull request #101432 from swatisehgal/smtaware
...
node: cpumanager: add options to reject non SMT-aligned workload
2021-07-08 21:04:53 -07:00
Francesco Romani
a2fb8b0039
smtalign: e2e: add tests
...
Add e2e tests to cover the basic flows for the `full-pcpus-only` option:
negative flow to ensure rejection with proper error message, and
positive flow to verify the actual cpu allocation.
Co-authored-by: Swati Sehgal <swsehgal@redhat.com >
Signed-off-by: Francesco Romani <fromani@redhat.com >
2021-07-08 23:15:37 +02:00
Shiming Zhang
5d80665b0a
Fix dbus config path for GracefulNodeShutdown e2e
2021-07-08 10:41:44 +08:00
Sascha Grunert
2d0f99fba1
Fix resource metrics e2e test
...
Signed-off-by: Sascha Grunert <sgrunert@redhat.com >
2021-07-05 11:16:05 +02:00
Kubernetes Prow Robot
62503f254e
Merge pull request #103413 from mgutierrez98/refactor-whitelist-blacklist
...
Refactored files containing whitelist/blacklist to allowlist/denylist…
2021-07-01 18:12:25 -07:00
Kubernetes Prow Robot
062bc359ca
Merge pull request #102444 from sanwishe/resourceStartTime
...
Expose container start time in kubelet /metrics/resource endpoint
2021-07-01 14:27:51 -07:00
mgutierrez98
1cfbb0aa25
remove webhook.go to revert changes to conformance test
2021-07-01 20:24:46 +00:00
Kubernetes Prow Robot
044fd6fdf6
Merge pull request #99829 from palnabarun/migrate-to-go-embed
...
Replace go-bindata with //go:embed
2021-06-30 10:37:03 -07:00
Kubernetes Prow Robot
f2e47502fd
Merge pull request #103076 from wzshiming/fix/flake-gracefulnodeshutdown-dbus
...
Fix the GracefulNodeShutdown e2e test running on dbus that refuses to manually start
2021-06-29 11:19:50 -07:00
Nabarun Pal
bbccf2ecb4
e2e-node: move to embedded test manifests
...
Signed-off-by: Nabarun Pal <pal.nabarun95@gmail.com >
2021-06-29 19:16:49 +05:30
Nabarun Pal
68b334d02b
test: setup embedded file sources for manifests
...
Signed-off-by: Nabarun Pal <pal.nabarun95@gmail.com >
2021-06-29 19:16:46 +05:30
Kubernetes Prow Robot
9866f9364e
Merge pull request #103112 from fromanirh/cpumanager-e2e-fixes
...
e2e: node: remove obsolete AlphaFeature tag
2021-06-28 19:36:39 -07:00
Kubernetes Prow Robot
ee459b8969
Merge pull request #103265 from fromanirh/e2e-node-fix-npd
...
e2e: node: fix npd test failures bumping image
2021-06-28 17:03:50 -07:00
Kubernetes Prow Robot
38f012320f
Merge pull request #101947 from cynepco3hahue/memory_manager_move_to_beta
...
memory manager: move to beta
2021-06-28 15:38:28 -07:00
Francesco Romani
889dcb5b54
e2e: node: fix npd test failures bumping image
...
The PR https://github.com/kubernetes/kubernetes/pull/100041 updated
node-problem-detector to v0.8.7, but unfortunately we didn't update
also the image using in the e2e_node tests.
As result, the tests were failing like
E2eNode Suite: [sig-node] NodeProblemDetector [NodeFeature:NodeProblemDetector] [Serial] SystemLogMonitor should generate node condition and events for corresponding errors
_output/local/go/src/k8s.io/kubernetes/test/e2e_node/node_problem_detector_linux.go:301
Timed out after 60.000s.
Expected success, but got an error:
<*errors.errorString | 0xc0011f2600>: {
s: "expected total number of events was 4, actual events counted was 7\nEvents
This in turn was one of the contributing factors in making the
pull-kubernetes-node-kubelet-serial lane constantly failing.
This patch updates the image used in the tests, fixing the failure.
Signed-off-by: Francesco Romani <fromani@redhat.com >
2021-06-28 16:32:12 +02:00
sanwishe
43f8f58895
add containers starttime metrics for metrics/resource endpoint
...
Signed-off-by: sanwishe <jiang.mingzhi35@zte.com.cn >
2021-06-24 02:53:21 +08:00
Kubernetes Prow Robot
15a60d1a19
Merge pull request #100180 from fromanirh/tm-e2e-fix-wait
...
e2e: TM: wait for SRIOV devices in pod scope tests
2021-06-23 11:42:10 -07:00
Francesco Romani
47615c2020
e2e: node: remove obsolete AlphaFeature tag
...
The CPUManager graduated to beta a while ago (k8s 1.10?)
so let's get rid of the obsolete Alpha tag on its e2e tests.
Signed-off-by: Francesco Romani <fromani@redhat.com >
2021-06-23 12:34:45 +02:00
Kubernetes Prow Robot
af60bebde3
Merge pull request #97028 from knabben/e2e-restart-kubelet
...
Adding restart kubelet flag on e2e test
2021-06-22 21:00:09 -07:00
Kubernetes Prow Robot
2453f07e93
Merge pull request #102396 from odinuge/restart_test
...
Restart test: Kill container runtime with SIGKILL
2021-06-22 13:10:10 -07:00
Artyom Lukianov
d4767ed5eb
memory manager: move to beta
...
Move the memory manager feature to beta.
Signed-off-by: Artyom Lukianov <alukiano@redhat.com >
2021-06-22 20:15:29 +03:00
Artyom Lukianov
681905706d
e2e node: provide tests for memory manager pod resources metrics
...
- verify memory manager data returned by `GetAllocatableResources`
- verify pod container memory manager data
Signed-off-by: Artyom Lukianov <alukiano@redhat.com >
2021-06-22 13:06:32 +03:00
Shiming Zhang
3daef0a534
Allows manual restart of dbus to work in Ubuntu.
2021-06-22 15:59:30 +08:00
Davanum Srinivas
7fcdbbef06
Switch to github.com/coreos/go-systemd/v22 and drop older package
...
- We use the new v22 module released on May 10
- We drop the unmaintained `github.com/coreos/pkg`
Signed-off-by: Davanum Srinivas <davanum@gmail.com >
2021-06-16 11:14:16 -04:00
Kubernetes Prow Robot
fa152d25d8
Merge pull request #102209 from odinuge/node-e2e-fix
...
Ignore first SIGINT in node-e2e tests
2021-06-15 11:31:23 -07:00
Kubernetes Prow Robot
4e7fc6df63
Merge pull request #100369 from wzshiming/fix/restart-dbus-for-graceful-node-shutdown
...
After DBus restarts, make GracefulNodeShutdown work again
2021-06-14 20:50:00 -07:00
Kubernetes Prow Robot
94707017e1
Merge pull request #102773 from bart0sh/PR0097-run_remote-report-error
...
run_remote: improve error reporting
2021-06-14 19:00:25 -07:00
Ed Bartosh
89284a1ba7
run_remote: improve error reporting
...
Included more info to the error message.
2021-06-10 14:34:05 +03:00