kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	9ff3b7e744	Merge pull request #104047 from ehashman/fix-node-e2e-logs Log e2e-node kubelet output directly to file	2021-08-02 12:30:19 -07:00
Davanum Srinivas	dab19517e5	Explicitly restart kubelet to stabilize serial-containerd job Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2021-08-02 11:24:11 -04:00
Elana Hashman	a77f4f4c29	Log e2e-node kubelet output directly to file For some reason when we send them to journald, many log lines are consistently dropped as soon as the PLEG is started. If we log directly to file, we don't have this problem. As a bonus, if the tests crash, the kubelet logs will always be available since they were already written; otherwise we normally wait until the end of the test run to collect them from journald, meaning that we often end up with empty logs.	2021-07-30 15:35:42 -07:00
Ryan Phillips	163e4974b6	e2e node server: fix crash in log line	2021-07-30 12:36:00 -05:00
Elana Hashman	59a7cc12c9	Mark failing node serial tests as flaky Tracked in: - https://github.com/kubernetes/kubernetes/issues/103690 - https://github.com/kubernetes/kubernetes/issues/103691	2021-07-28 10:39:30 -07:00
Nabarun Pal	77afa53f9d	Add e2e testing manifest bundle to e2e_node test suite Ref: https://kubernetes.slack.com/archives/C0BP8PW9G/p1627003199187100?thread_ts=1626988113.184100&cid=C0BP8PW9G Signed-off-by: Nabarun Pal <pal.nabarun95@gmail.com>	2021-07-23 09:49:33 +05:30
David Porter	3af4fe8c9b	Use pointer gomega comparison for UsageNanoCores	2021-07-22 01:08:36 -07:00
Kubernetes Prow Robot	ac8dca79af	Merge pull request #103566 from wzshiming/fix/e2e-dbus-config-path Fix dbus config path for GracefulNodeShutdown e2e	2021-07-15 12:39:14 -07:00
Kubernetes Prow Robot	4f9bfb39ad	Merge pull request #102169 from odinuge/rlimit-tests Ensure node-e2e-test can open enough files	2021-07-15 10:20:45 -07:00
Kubernetes Prow Robot	b55c980279	Merge pull request #102395 from odinuge/node_container_manager_test_skip_systemd Skip node container manager test on systemd	2021-07-09 13:26:54 -07:00
Kubernetes Prow Robot	617064d732	Merge pull request #101432 from swatisehgal/smtaware node: cpumanager: add options to reject non SMT-aligned workload	2021-07-08 21:04:53 -07:00
Francesco Romani	a2fb8b0039	smtalign: e2e: add tests Add e2e tests to cover the basic flows for the `full-pcpus-only` option: negative flow to ensure rejection with proper error message, and positive flow to verify the actual cpu allocation. Co-authored-by: Swati Sehgal <swsehgal@redhat.com> Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-07-08 23:15:37 +02:00
Shiming Zhang	5d80665b0a	Fix dbus config path for GracefulNodeShutdown e2e	2021-07-08 10:41:44 +08:00
Sascha Grunert	2d0f99fba1	Fix resource metrics e2e test Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2021-07-05 11:16:05 +02:00
Kubernetes Prow Robot	62503f254e	Merge pull request #103413 from mgutierrez98/refactor-whitelist-blacklist Refactored files containing whitelist/blacklist to allowlist/denylist…	2021-07-01 18:12:25 -07:00
Kubernetes Prow Robot	062bc359ca	Merge pull request #102444 from sanwishe/resourceStartTime Expose container start time in kubelet /metrics/resource endpoint	2021-07-01 14:27:51 -07:00
mgutierrez98	1cfbb0aa25	remove webhook.go to revert changes to conformance test	2021-07-01 20:24:46 +00:00
Kubernetes Prow Robot	044fd6fdf6	Merge pull request #99829 from palnabarun/migrate-to-go-embed Replace go-bindata with //go:embed	2021-06-30 10:37:03 -07:00
Kubernetes Prow Robot	f2e47502fd	Merge pull request #103076 from wzshiming/fix/flake-gracefulnodeshutdown-dbus Fix the GracefulNodeShutdown e2e test running on dbus that refuses to manually start	2021-06-29 11:19:50 -07:00
Nabarun Pal	bbccf2ecb4	e2e-node: move to embedded test manifests Signed-off-by: Nabarun Pal <pal.nabarun95@gmail.com>	2021-06-29 19:16:49 +05:30
Nabarun Pal	68b334d02b	test: setup embedded file sources for manifests Signed-off-by: Nabarun Pal <pal.nabarun95@gmail.com>	2021-06-29 19:16:46 +05:30
Kubernetes Prow Robot	9866f9364e	Merge pull request #103112 from fromanirh/cpumanager-e2e-fixes e2e: node: remove obsolete AlphaFeature tag	2021-06-28 19:36:39 -07:00
Kubernetes Prow Robot	ee459b8969	Merge pull request #103265 from fromanirh/e2e-node-fix-npd e2e: node: fix npd test failures bumping image	2021-06-28 17:03:50 -07:00
Kubernetes Prow Robot	38f012320f	Merge pull request #101947 from cynepco3hahue/memory_manager_move_to_beta memory manager: move to beta	2021-06-28 15:38:28 -07:00
Francesco Romani	889dcb5b54	e2e: node: fix npd test failures bumping image The PR https://github.com/kubernetes/kubernetes/pull/100041 updated node-problem-detector to v0.8.7, but unfortunately we didn't update also the image using in the e2e_node tests. As result, the tests were failing like E2eNode Suite: [sig-node] NodeProblemDetector [NodeFeature:NodeProblemDetector] [Serial] SystemLogMonitor should generate node condition and events for corresponding errors _output/local/go/src/k8s.io/kubernetes/test/e2e_node/node_problem_detector_linux.go:301 Timed out after 60.000s. Expected success, but got an error: <*errors.errorString \| 0xc0011f2600>: { s: "expected total number of events was 4, actual events counted was 7\nEvents This in turn was one of the contributing factors in making the pull-kubernetes-node-kubelet-serial lane constantly failing. This patch updates the image used in the tests, fixing the failure. Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-06-28 16:32:12 +02:00
sanwishe	43f8f58895	add containers starttime metrics for metrics/resource endpoint Signed-off-by: sanwishe <jiang.mingzhi35@zte.com.cn>	2021-06-24 02:53:21 +08:00
Kubernetes Prow Robot	15a60d1a19	Merge pull request #100180 from fromanirh/tm-e2e-fix-wait e2e: TM: wait for SRIOV devices in pod scope tests	2021-06-23 11:42:10 -07:00
Francesco Romani	47615c2020	e2e: node: remove obsolete AlphaFeature tag The CPUManager graduated to beta a while ago (k8s 1.10?) so let's get rid of the obsolete Alpha tag on its e2e tests. Signed-off-by: Francesco Romani <fromani@redhat.com>	2021-06-23 12:34:45 +02:00
Kubernetes Prow Robot	af60bebde3	Merge pull request #97028 from knabben/e2e-restart-kubelet Adding restart kubelet flag on e2e test	2021-06-22 21:00:09 -07:00
Kubernetes Prow Robot	2453f07e93	Merge pull request #102396 from odinuge/restart_test Restart test: Kill container runtime with SIGKILL	2021-06-22 13:10:10 -07:00
Artyom Lukianov	d4767ed5eb	memory manager: move to beta Move the memory manager feature to beta. Signed-off-by: Artyom Lukianov <alukiano@redhat.com>	2021-06-22 20:15:29 +03:00
Artyom Lukianov	681905706d	e2e node: provide tests for memory manager pod resources metrics - verify memory manager data returned by `GetAllocatableResources` - verify pod container memory manager data Signed-off-by: Artyom Lukianov <alukiano@redhat.com>	2021-06-22 13:06:32 +03:00
Shiming Zhang	3daef0a534	Allows manual restart of dbus to work in Ubuntu.	2021-06-22 15:59:30 +08:00
Davanum Srinivas	7fcdbbef06	Switch to github.com/coreos/go-systemd/v22 and drop older package - We use the new v22 module released on May 10 - We drop the unmaintained `github.com/coreos/pkg` Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2021-06-16 11:14:16 -04:00
Kubernetes Prow Robot	fa152d25d8	Merge pull request #102209 from odinuge/node-e2e-fix Ignore first SIGINT in node-e2e tests	2021-06-15 11:31:23 -07:00
Kubernetes Prow Robot	4e7fc6df63	Merge pull request #100369 from wzshiming/fix/restart-dbus-for-graceful-node-shutdown After DBus restarts, make GracefulNodeShutdown work again	2021-06-14 20:50:00 -07:00
Kubernetes Prow Robot	94707017e1	Merge pull request #102773 from bart0sh/PR0097-run_remote-report-error run_remote: improve error reporting	2021-06-14 19:00:25 -07:00
Ed Bartosh	89284a1ba7	run_remote: improve error reporting Included more info to the error message.	2021-06-10 14:34:05 +03:00
Odin Ugedal	1c8675fc02	Ensure node e2e apiserver and test suite can open enough files The apiserver and test suite in node e2e runs under the sshd daemon that can limit the amount of files it can open. Set a higher limit to address the issues. Signed-off-by: Odin Ugedal <odin@uged.al>	2021-06-10 13:12:03 +02:00
Giuseppe Scrivano	c98306a09e	test: adjust summary test for cgroup v2 on cgroup v2 the reported metric is recursive for the entire and it includes all the sub cgroups. Adjust the test accordingly. Closes: https://github.com/kubernetes/kubernetes/issues/99230 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2021-06-09 14:04:06 +02:00
Odin Ugedal	c0c9f1f318	Ignore first SIGINT in node-e2e tests Node e2e tests exceeding the global timeout are sent SIGINT, resulting in no artifacts or console output. This will ignore the first SIGINT, and since all children processes are being stopped due to SIGINT, we can clean up before exiting.	2021-06-09 10:12:05 +02:00
Odin Ugedal	d5cb5065c4	Skip node container manager test on systemd	2021-05-28 10:24:24 +02:00
Odin Ugedal	2787e8c18c	Kill container runtime with SIGKILL Make sure to use SIGKILL so that the service is killed in a dirty way. In case container runtime use "Restart=on-abnormal" in systemd, killing with SIGTERM will not restart the service, as the kill looks intentional and clean. This is used by cri-o by default.	2021-05-28 10:16:23 +02:00
Lennart Jern	507710b50f	Update CNI plugins v0.9.1 ref: https://github.com/containernetworking/plugins/releases/tag/v0.9.1 Signed-off-by: Lennart Jern <lennart.jern@est.tech>	2021-05-26 11:02:04 +03:00
Ed Bartosh	38c56883f1	e2e: hugepages: delete test pod after the test Current test assumes that test pod is deleted when the test namespace is deleted. However, namespace deletion is an asynchronous operation. The pod may still be running and allocating hugepages resources when next test case creates another pod that requests the same hugepages resources. This can cause kubelet to fail the test pod with this kind of error: OutOfhugepages-2Mi: Node didn't have enough resource: hugepages-2Mi requested: 6291456, used: 6291456, capacity: 10485760 Explicitly deleting test pod should fix this issue.	2021-05-25 17:09:55 +03:00
Shiming Zhang	990d0949c4	Add test, after restart dbus, should be able to gracefully shutdown	2021-05-19 10:06:06 +08:00
Jordan Liggitt	4b45d0d921	Revert "Merge pull request 101888 from kolyshkin/update-runc-rc94" This reverts commit `b1b06fe0a4`, reversing changes made to `382a33986b`.	2021-05-18 09:13:47 -04:00
Kubernetes Prow Robot	4d4b530114	Merge pull request #101903 from cynepco3hahue/e2e_remote_kernel_args e2e node: make possible to add additional kernel arguments	2021-05-17 13:39:59 -07:00
Kubernetes Prow Robot	b1b06fe0a4	Merge pull request #101888 from kolyshkin/update-runc-rc94 vendor: bump runc to rc94	2021-05-17 09:43:30 -07:00
Kubernetes Prow Robot	f35e587087	Merge pull request #99899 from hasheddan/update-node-e2e-note Update dependencies in local node test runner	2021-05-13 03:54:25 -07:00

1 2 3 4 5 ...

2011 Commits