kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	b27670dfbd	Merge pull request #118740 from saschagrunert/kubelet-label-types Make kubelet label types public	2023-09-06 23:46:57 -07:00
Francesco Romani	2ea47038b9	podresources: e2e: force eager connection Add and use more facilities to the internal podresources client. Checking e2e test runs, we have quite some ``` rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing: dial unix /var/lib/kubelet/pod-resources/kubelet.sock: connect: connection refused": rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing: dial unix /var/lib/kubelet/pod-resources/kubelet.sock: connect: connection refused" ``` This is likely caused by kubelet restarts, which we do plenty in e2e tests, combined with the fact gRPC does lazy connection AND we don't really check the errors in client code - we just bubble them up. While it's arguably bad we don't check properly error codes, it's also true that in the main case, e2e tests, the functions should just never fail besides few well known cases, we're connecting over a super-reliable unix domain socket after all. So, we centralize the fix adding a function (alongside with minor cleanups) which wants to trigger and ensure the connection happens, localizing the changes just here. The main advantage is this approach is opt-in, composable, and doesn't leak gRPC details into the client code. Signed-off-by: Francesco Romani <fromani@redhat.com>	2023-09-07 08:24:49 +02:00
Todd Neal	94afd6e3a4	skip the reason check for OOM tests if it will fail This is currently flaking badly due to a race between cgroup deletion and the runtime detecting the OOM kill.	2023-09-06 12:20:02 -05:00
Gunju Kim	b468e4eb1c	e2e_node: Assign enough time to finish the postStart hook This deflakes the "Containers Lifecycle should not launch second container before PostStart of the first container completed" test by assigning enough time to finish the postStart hook.	2023-09-07 00:42:54 +09:00
Kubernetes Prow Robot	56cc5e77a1	Merge pull request #120441 from tzneal/revert-npd-update Revert "bump npd to v0.8.14"	2023-09-06 06:39:04 -07:00
Kubernetes Prow Robot	debe30de70	Merge pull request #120281 from gjkim42/feature-gate-sidecar-containers-in-kuberuntime Feature-gate SidecarContainers code in pkg/kubelet/kuberuntime	2023-09-05 18:34:54 -07:00
Todd Neal	355ae44a3c	Revert "bump npd to v0.8.14" This reverts commit `7b44d73f73`.	2023-09-05 20:28:53 -05:00
jinye	a774887262	cleanup:e2e:stop using deprecated framework.ExpectNotEqual	2023-09-05 18:16:57 +08:00
RuquanZhao	bfc3c2110f	e2e-node: fix TopologyManager test jobs. Signed-off-by: Ruquan Zhao <ruquan.zhao@arm.com>	2023-09-01 17:53:16 +08:00
wen.rui	3d9b5d0577	e2e_node:stop using deprecated framework.ExpectError	2023-09-01 17:42:36 +08:00
Kubernetes Prow Robot	400059d025	Merge pull request #120194 from bzsuni/bz/bump/npd bump npd to v0.8.14	2023-08-31 20:52:30 -07:00
Gunju Kim	63177db32c	Add an e2e test for the pod sandbox changed scenario This adds an e2e test to ensure that a pod should restart its containers in right order after the pod sandbox is changed.	2023-09-01 00:13:47 +09:00
Todd Neal	ede524e1a6	fix a pidpressure test flake With the new busybox, ash has a built-in sleep command. Prior to this change we were creating half the pids expected since `sleep` wasn't actually launching a new binary. Use the full path to /bin/sleep which avoids the built-in and actually launches a new process.	2023-08-30 22:44:36 -05:00
bzsuni	7b44d73f73	bump npd to v0.8.14 Signed-off-by: bzsuni <bingzhe.sun@daocloud.io>	2023-08-30 19:03:33 +08:00
Fan Shang Xiang	8d9517318a	Extend npd e2e timeout to fix npd e2e error	2023-08-29 17:22:28 +08:00
Kubernetes Prow Robot	232d343d58	Merge pull request #119969 from saschagrunert/cni-plugins Update CNI plugins to v1.3.0	2023-08-23 12:41:57 -07:00
Dixita Narang	d2dbc583a0	Adding coverage for OOM Kill scenario due to node allocatable memory limits, when pod level memory limits are not set	2023-08-22 00:45:17 +00:00
Davanum Srinivas	3e9a4c15a8	Restrict what imports get into code within test/e2e_node Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2023-08-21 15:04:23 -04:00
Kubernetes Prow Robot	4dee8398ae	Merge pull request #120078 from tzneal/investigate-test-failure expect the new resource_scape_error metric	2023-08-21 04:13:34 -07:00
Todd Neal	b8512cfe24	expect the new resource_scape_error metric	2023-08-20 14:17:54 -05:00
Todd Neal	905f07f1ac	Revert "mark the OOM killer as serial to reduce flakes" This reverts commit `bd6f548746`. Running as serial didn't completely eliminate the flake so I think there's something more going on here. Reverting the change to serial since its not a solution.	2023-08-20 13:38:07 -05:00
Todd Neal	bd6f548746	mark the OOM killer as serial to reduce flakes In testing I could only reproduce the flake by running stress-ng to load the CPU. Running it as serial should reduce and hopefully eliminate the flakiness.	2023-08-18 13:18:50 -05:00
Todd Neal	577197559a	remove the legacy test dependency This removes the import which added a bunch of apparently old failing tests.	2023-08-17 12:54:20 -05:00
Sascha Grunert	7933368460	Update CNI plugins to v1.3.0 Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2023-08-17 09:50:53 +02:00
Kubernetes Prow Robot	4d166947cf	Merge pull request #119097 from pacoxu/fix-eviction-pid PIDPressure condition is triggered slow on CRI-O with large PID pressure/heavy load	2023-08-16 16:36:19 -07:00
Kubernetes Prow Robot	88d14edc26	Merge pull request #119197 from saschagrunert/stop-container-runtime-err Check dbus error on container runtime start/stop	2023-08-16 15:27:52 -07:00
Kubernetes Prow Robot	b1e35d5616	Merge pull request #119974 from tzneal/bump-busybox-test-version bump the busybox test version to resolve test failures	2023-08-16 12:44:13 -07:00
Kubernetes Prow Robot	dd44792cec	Merge pull request #119880 from saschagrunert/seccomp-filter Make seccomp status checks in e2e tests more robust	2023-08-16 12:43:54 -07:00
Todd Neal	b75c5d33e5	bump the busybox test version to resolve test failures - bump busybox version - specify the path to /bin/sleep to avoid calling a new shell builtin	2023-08-16 08:50:20 -05:00
Kubernetes Prow Robot	c41c448b80	Merge pull request #119890 from tzneal/containers-lifecycle-flake crio: increase test buffer to eliminate test flakes	2023-08-15 23:13:45 -07:00
Kubernetes Prow Robot	061ae8a68b	Merge pull request #119765 from tzneal/detect-nfsv3-and-change-mount-path fix mirror pod nfs test failure due to differing NFS versions	2023-08-15 23:12:44 -07:00
Kubernetes Prow Robot	3111fee8bf	Merge pull request #119670 from lengrongfu/fix/oomkill-multi-target-container fix OOM killer	2023-08-15 19:43:40 -07:00
Kubernetes Prow Robot	3525255622	Merge pull request #119212 from CoderSherlock/master Added oomkill test for init container and fix typos	2023-08-15 15:17:48 -07:00
Todd Neal	e258228e4a	use a buffer equivalent to grace period to eliminate test flakes This modifies the test to wait up to 2x the grace period for the pod to be removed.	2023-08-11 14:08:11 -05:00
Todd Neal	717c149a73	fix mirror pod nfs test failure due to differing NFS versions /exports *(rw,fsid=0,insecure,no_root_squash) can be mounted as `/exports` using NFSv3 and `/` using NFSv4 Mount as '/', since clients that support both can try both.	2023-08-11 07:27:05 -05:00
Sascha Grunert	8ab6bee676	Make seccomp status checks in e2e tests more robust The tests have been introduced in `ca7be7dc6d` and checked for `ecc` in `/proc/self/status` since its creation. We got a new field `Seccomp_filters:` with the Linux commit `c818c03b66`, means that `ecc` would now match both and interfere with possible test results depending on the host. The field `Seccomp:` got introduced in `2f4b3bf6b2` and has never changed since then, means we can use it directly to make the tests more strict. Refers to https://github.com/kubernetes-sigs/cri-tools/pull/1236 Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2023-08-10 09:51:03 +02:00
lengrongfu	c23cee1be3	fix OOM killer Signed-off-by: lengrongfu <rongfu.leng@daocloud.io>	2023-07-30 11:16:12 +08:00
Davanum Srinivas	b4ef4015a2	Avoid pulling mounter.tar through the CDN Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2023-07-28 22:15:55 -04:00
upodroid	a65d207507	calculate the correct machine-type	2023-07-26 23:10:06 +00:00
upodroid	7d13c9b096	set map to nil if an empty string is passed	2023-07-26 10:32:27 +03:00
Talor Itzhak	3964f71fe0	e2e:podresources: verify count for terminal pods PodResourcesAPI reports in the List call about resources of pods in terminal phase. The internal managers reassign resources assigned to pods in terminal phase, so podresources should ignore them. Whether this behavior intended or not (the docs are not unequivocal) this e2e test demonstrates and verifies the mentioned above. Signed-off-by: Talor Itzhak <titzhak@redhat.com>	2023-07-23 12:46:41 +03:00
upodroid	1c99f9591b	add node-env and instance-type flags to node-e2e tests	2023-07-21 21:46:37 +00:00
Gunju Kim	e0a6eb93a1	node_e2e: Fix createStaticSystemNodeCriticalPod's invalid spec This fixes `createStaticSystemNodeCriticalPod` to set pod's restartPolicy instead of container's restartPolicy.	2023-07-20 20:18:05 +09:00
Itamar Holder	ee82654e39	Add pod_swap_usage_bytes as an expected metric in e2e test Use haveKeys() matcher from previous commit to ensure required keys exist. Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-19 14:44:05 +03:00
Itamar Holder	81abfca407	Add a haveKeys() helper function to match multiple keys Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-19 14:44:04 +03:00
Kubernetes Prow Robot	b4d793c450	Merge pull request #118865 from iholder101/kubelet/add-swap-to-summary-stats Add swap to stats to Summary API and Prometheus endpoints (`/stats/summary` and `/metrics/resource`)	2023-07-17 19:49:18 -07:00
Kubernetes Prow Robot	da2fdf8cc3	Merge pull request #118764 from iholder101/Swap/burstableQoS-impl Add full cgroup v2 swap support with automatically calculated swap limit for LimitedSwap and Burstable QoS Pods	2023-07-17 19:49:07 -07:00
Kubernetes Prow Robot	d17f3ba2cf	Merge pull request #119168 from gjkim42/sidecar-allow-probes-and-lifecycle-hooks Allow all probes and lifecycle for restartable init containers	2023-07-17 18:11:07 -07:00
Itamar Holder	4cb5547f93	Adjust summary API e2e test Signed-off-by: Itamar Holder <iholder@redhat.com>	2023-07-18 02:55:56 +03:00
Gunju Kim	3bf282652f	Allow restartable init containers to have lifecycle	2023-07-18 08:12:24 +09:00

... 4 5 6 7 8 ...

2847 Commits