kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	45636684a4	Merge pull request #112897 from fromanirh/podresources-metrics-e2e-tests register podresources metrics	2022-10-19 13:57:18 -07:00
Kubernetes Prow Robot	42c1f881cc	Merge pull request #113165 from swatisehgal/e2e-deviceplugin-logs node: e2e: device plugins: Add more logs for clarity	2022-10-19 08:59:14 -07:00
Swati Sehgal	ef54dbb5cc	node: e2e: device plugins: Add more logs for clarity The device plugin test in https://testgrid.k8s.io/sig-node-release-blocking#node-kubelet-serial-containerd has been flaky for a while now when it runs on the test infrastructure. Locally running this test resulted in test passing without issues. Based on the existing logs, it is not clear why podresource API endpoint is returning 3 pods rather than the expected two pods (device plugin pod and the test pod requesting devices). For more clarity and debugaability on why an addtional pod seems to be appearing we expose the output from podresource API endpoint. Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2022-10-19 13:57:47 +01:00
Kubernetes Prow Robot	e1812683e3	Merge pull request #113042 from swatisehgal/memorymgr-fix-rejection-test node: e2e: memorymgr: Fix test failure	2022-10-17 04:39:07 -07:00
Kubernetes Prow Robot	6f579d3ceb	Merge pull request #111616 from ndixita/credential-api-ga Move the Kubelet Credential Provider feature to GA and Update the Credential Provider API to GA	2022-10-15 07:53:09 -07:00
Swati Sehgal	6c6865af28	node: e2e: memorymgr: Fix test failure The change made in https://github.com/kubernetes/kubernetes/pull/112644 resulted in an update to the rejection message. In the memory manager node e2e test, we still checked against the old expected error message giving the impression that the pod succeeded to run even though it failed as expected mainly because the check wasn't performed correctly. In this patch, we update to the correct rejection message to make sure that the memory manager is no longer failing. NOTE: This test is supposed to run on multi NUMA systems and if the underlying node does not have multi NUMA nodes, the test is skipped which is what happens in upstream test infrastructure as it is mainly composed of single NUMA nodes. Because of this, this test failure wasn't evident via testgrid. Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2022-10-13 12:45:14 +01:00
Francesco Romani	3c60c1a10c	node: e2e: add podresources metrics tests add tests to ensure the podresources metrics are exposed, and basic sanity tests for their values. Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-10-06 15:14:56 +02:00
Patrick Ohly	dfdf88d4fa	e2e: adapt to moved code This is the result of automatically editing source files like this: go install golang.org/x/tools/cmd/goimports@latest find ./test/e2e* -name ".go" \| xargs env PATH=$GOPATH/bin:$PATH ./e2e-framework-sed.sh with e2e-framework-sed.sh containing this: sed -i \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.ExecCommandInContainer(/e2epod.ExecCommandInContainer(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.ExecCommandInContainerWithFullOutput(/e2epod.ExecCommandInContainerWithFullOutput(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.ExecShellInContainer(/e2epod.ExecShellInContainer(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.ExecShellInPod(/e2epod.ExecShellInPod(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.ExecShellInPodWithFullOutput(/e2epod.ExecShellInPodWithFullOutput(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.ExecWithOptions(/e2epod.ExecWithOptions(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.MatchContainerOutput(/e2eoutput.MatchContainerOutput(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.PodClient(/e2epod.NewPodClient(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.PodClientNS(/e2epod.PodClientNS(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w$\.TestContainerOutput(/e2eoutput.TestContainerOutput(\1, /" \ -e "s/$f\\|fr\\|\w\w\.[fF]\w*$\.TestContainerOutputRegexp(/e2eoutput.TestContainerOutputRegexp(\1, /" \ -e "s/framework.AddOrUpdateLabelOnNode\b/e2enode.AddOrUpdateLabelOnNode/" \ -e "s/framework.AllNodes\b/e2edebug.AllNodes/" \ -e "s/framework.AllNodesReady\b/e2enode.AllNodesReady/" \ -e "s/framework.ContainerResourceGatherer\b/e2edebug.ContainerResourceGatherer/" \ -e "s/framework.ContainerResourceUsage\b/e2edebug.ContainerResourceUsage/" \ -e "s/framework.CreateEmptyFileOnPod\b/e2eoutput.CreateEmptyFileOnPod/" \ -e "s/framework.DefaultPodDeletionTimeout\b/e2epod.DefaultPodDeletionTimeout/" \ -e "s/framework.DumpAllNamespaceInfo\b/e2edebug.DumpAllNamespaceInfo/" \ -e "s/framework.DumpDebugInfo\b/e2eoutput.DumpDebugInfo/" \ -e "s/framework.DumpNodeDebugInfo\b/e2edebug.DumpNodeDebugInfo/" \ -e "s/framework.EtcdUpgrade\b/e2eproviders.EtcdUpgrade/" \ -e "s/framework.EventsLister\b/e2edebug.EventsLister/" \ -e "s/framework.ExecOptions\b/e2epod.ExecOptions/" \ -e "s/framework.ExpectNodeHasLabel\b/e2enode.ExpectNodeHasLabel/" \ -e "s/framework.ExpectNodeHasTaint\b/e2enode.ExpectNodeHasTaint/" \ -e "s/framework.GCEUpgradeScript\b/e2eproviders.GCEUpgradeScript/" \ -e "s/framework.ImagePrePullList\b/e2epod.ImagePrePullList/" \ -e "s/framework.KubectlBuilder\b/e2ekubectl.KubectlBuilder/" \ -e "s/framework.LocationParamGKE\b/e2eproviders.LocationParamGKE/" \ -e "s/framework.LogSizeDataTimeseries\b/e2edebug.LogSizeDataTimeseries/" \ -e "s/framework.LogSizeGatherer\b/e2edebug.LogSizeGatherer/" \ -e "s/framework.LogsSizeData\b/e2edebug.LogsSizeData/" \ -e "s/framework.LogsSizeDataSummary\b/e2edebug.LogsSizeDataSummary/" \ -e "s/framework.LogsSizeVerifier\b/e2edebug.LogsSizeVerifier/" \ -e "s/framework.LookForStringInLog\b/e2eoutput.LookForStringInLog/" \ -e "s/framework.LookForStringInPodExec\b/e2eoutput.LookForStringInPodExec/" \ -e "s/framework.LookForStringInPodExecToContainer\b/e2eoutput.LookForStringInPodExecToContainer/" \ -e "s/framework.MasterAndDNSNodes\b/e2edebug.MasterAndDNSNodes/" \ -e "s/framework.MasterNodes\b/e2edebug.MasterNodes/" \ -e "s/framework.MasterUpgradeGKE\b/e2eproviders.MasterUpgradeGKE/" \ -e "s/framework.NewKubectlCommand\b/e2ekubectl.NewKubectlCommand/" \ -e "s/framework.NewLogsVerifier\b/e2edebug.NewLogsVerifier/" \ -e "s/framework.NewNodeKiller\b/e2enode.NewNodeKiller/" \ -e "s/framework.NewResourceUsageGatherer\b/e2edebug.NewResourceUsageGatherer/" \ -e "s/framework.NodeHasTaint\b/e2enode.NodeHasTaint/" \ -e "s/framework.NodeKiller\b/e2enode.NodeKiller/" \ -e "s/framework.NodesSet\b/e2edebug.NodesSet/" \ -e "s/framework.PodClient\b/e2epod.PodClient/" \ -e "s/framework.RemoveLabelOffNode\b/e2enode.RemoveLabelOffNode/" \ -e "s/framework.ResourceConstraint\b/e2edebug.ResourceConstraint/" \ -e "s/framework.ResourceGathererOptions\b/e2edebug.ResourceGathererOptions/" \ -e "s/framework.ResourceUsagePerContainer\b/e2edebug.ResourceUsagePerContainer/" \ -e "s/framework.ResourceUsageSummary\b/e2edebug.ResourceUsageSummary/" \ -e "s/framework.RunHostCmd\b/e2eoutput.RunHostCmd/" \ -e "s/framework.RunHostCmdOrDie\b/e2eoutput.RunHostCmdOrDie/" \ -e "s/framework.RunHostCmdWithFullOutput\b/e2eoutput.RunHostCmdWithFullOutput/" \ -e "s/framework.RunHostCmdWithRetries\b/e2eoutput.RunHostCmdWithRetries/" \ -e "s/framework.RunKubectl\b/e2ekubectl.RunKubectl/" \ -e "s/framework.RunKubectlInput\b/e2ekubectl.RunKubectlInput/" \ -e "s/framework.RunKubectlOrDie\b/e2ekubectl.RunKubectlOrDie/" \ -e "s/framework.RunKubectlOrDieInput\b/e2ekubectl.RunKubectlOrDieInput/" \ -e "s/framework.RunKubectlWithFullOutput\b/e2ekubectl.RunKubectlWithFullOutput/" \ -e "s/framework.RunKubemciCmd\b/e2ekubectl.RunKubemciCmd/" \ -e "s/framework.RunKubemciWithKubeconfig\b/e2ekubectl.RunKubemciWithKubeconfig/" \ -e "s/framework.SingleContainerSummary\b/e2edebug.SingleContainerSummary/" \ -e "s/framework.SingleLogSummary\b/e2edebug.SingleLogSummary/" \ -e "s/framework.TimestampedSize\b/e2edebug.TimestampedSize/" \ -e "s/framework.WaitForAllNodesSchedulable\b/e2enode.WaitForAllNodesSchedulable/" \ -e "s/framework.WaitForSSHTunnels\b/e2enode.WaitForSSHTunnels/" \ -e "s/framework.WorkItem\b/e2edebug.WorkItem/" \ "$@" for i in "$@"; do # Import all sub packages and let goimports figure out which of those # are redundant (= already imported) or not needed. sed -i -e '/"k8s.io.kubernetes.test.e2e.framework"/a e2edebug "k8s.io/kubernetes/test/e2e/framework/debug"' "$i" sed -i -e '/"k8s.io.kubernetes.test.e2e.framework"/a e2ekubectl "k8s.io/kubernetes/test/e2e/framework/kubectl"' "$i" sed -i -e '/"k8s.io.kubernetes.test.e2e.framework"/a e2enode "k8s.io/kubernetes/test/e2e/framework/node"' "$i" sed -i -e '/"k8s.io.kubernetes.test.e2e.framework"/a e2eoutput "k8s.io/kubernetes/test/e2e/framework/pod/output"' "$i" sed -i -e '/"k8s.io.kubernetes.test.e2e.framework"/a e2epod "k8s.io/kubernetes/test/e2e/framework/pod"' "$i" sed -i -e '/"k8s.io.kubernetes.test.e2e.framework"/a e2eproviders "k8s.io/kubernetes/test/e2e/framework/providers"' "$i" goimports -w "$i" done	2022-10-06 08:19:47 +02:00
Patrick Ohly	92047da152	e2e: make import blocks consistent	2022-10-06 08:16:47 +02:00
Patrick Ohly	5614a9d064	e2e framework: eliminate interim sub packages The "todo" packages were necessary while moving code around to avoid hitting cyclic dependencies. Now that any sub package can depend on the framework, they are no longer needed and the code can be moved into the normal sub packages.	2022-10-06 08:16:47 +02:00
Patrick Ohly	802451b6ca	e2e framework: move metrics gathering into sub package This reduces the size of the test/e2e/framework itself. Because it does not gather metrics data anymore by default, E2E test suites must set their callbacks function or set the original one by importing "k8s.io/kubernetes/test/e2e/framework/todo/metrics/init".	2022-10-06 08:16:47 +02:00
Patrick Ohly	b8d28cb6c3	e2e framework: move node helper code into sub package This reduces the size of the test/e2e/framework itself. Because it does not check nodes anymore by default, E2E test suites must set their own check function or set the original one by importing "k8s.io/kubernetes/test/e2e/framework/todo/node/init".	2022-10-06 08:16:47 +02:00
Patrick Ohly	c45a924c5e	e2e framework: move dumping of information into sub package This reduces the size of the test/e2e/framework itself. Because it does not dump anything anymore by default, E2E test suites must set their own dump function or set the original one by importing "k8s.io/kubernetes/test/e2e/framework/debug/init".	2022-10-06 08:16:47 +02:00
Kubernetes Prow Robot	98233be715	Merge pull request #112709 from swagatbora90/kubelet-tracing Support otel tracing in cri remote image service	2022-10-04 14:12:00 -07:00
Tim Hockin	70c1c795e8	Remove generated file rules in make This is all covered by update-codegen.sh now. The old `make generated_files` rule still exists, but just prints a warning.	2022-10-04 08:50:30 -07:00
Dixita Narang	d6ab1da1b5	Update test to validate against v1 kubelet APIs	2022-10-03 17:57:25 +00:00
Dixita Narang	a016b06bbd	Update test plugin to use v1 kubelet APIs	2022-10-03 17:57:14 +00:00
Dixita Narang	1ac4fc779b	Update kubelet credential provider tests to use new v1 APIs	2022-09-30 20:51:39 +00:00
Swagat Bora	caa83c25ae	Support otel tracing in cri remote image service Signed-off-by: Swagat Bora <sbora@amazon.com>	2022-09-29 22:15:07 +00:00
Paco Xu	ad3083b51e	LocalStorageCapacityIsolationFSQuotaMonitoring: feature gate typo in e2e node test	2022-09-22 10:06:48 +08:00
Patrick Ohly	41619ace15	stop using deprecated klog flags Some scripts and tools still relied on the deprecated flags, the ones which are about to be removed. This is intentionally not a complete removal of all those flags in the entire repo. This would lead to much more code churn also in places where commands still accept the flags because they use klog directly.	2022-09-04 21:02:43 +02:00
Ryan Phillips	32a90f5f35	Revert "promote LocalStorageCapacityIsolationFSQuotaMonitoring to beta"	2022-08-26 16:25:00 -05:00
Danielle Lancashire	e8442054fe	node_e2e: add a dbus restart test	2022-08-08 16:56:13 +00:00
Kubernetes Prow Robot	d40bc18461	Merge pull request #105126 from sallyom/tracing-kubelet kubelet tracing instrumentation	2022-08-02 11:38:06 -07:00
Kubernetes Prow Robot	2e1a4da8df	Merge pull request #111358 from ddebroy/hasnet1 Introduce PodHasNetwork condition for pods	2022-08-01 15:04:52 -07:00
Sally O'Malley	47e7d8034f	kubelet tracing Signed-off-by: Sally O'Malley <somalley@redhat.com> Co-authored-by: David Ashpole <dashpole@google.com>	2022-08-01 12:55:02 -04:00
Deep Debroy	0ac7cce38a	Node e2e test for pod conditions managed by Kubelet Signed-off-by: Deep Debroy <ddebroy@gmail.com>	2022-08-01 09:52:07 -07:00
Kubernetes Prow Robot	95ed6820ea	Merge pull request #107329 from pacoxu/promote-e2e-quota promote LocalStorageCapacityIsolationFSQuotaMonitoring to beta	2022-07-28 19:35:10 -07:00
Davanum Srinivas	a9593d634c	Generate and format files - Run hack/update-codegen.sh - Run hack/update-generated-device-plugin.sh - Run hack/update-generated-protobuf.sh - Run hack/update-generated-runtime.sh - Run hack/update-generated-swagger-docs.sh - Run hack/update-openapi-spec.sh - Run hack/update-gofmt.sh Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2022-07-26 13:14:05 -04:00
Adrian Reber	92ea6e32b8	Add e2e tests for checkpointing Signed-off-by: Adrian Reber <areber@redhat.com>	2022-07-14 10:27:41 +00:00
Kubernetes Prow Robot	a455c296fd	Merge pull request #111015 from xmcqueen/master Capture the Container Logs for a Flaky Test	2022-07-11 16:58:50 -07:00
Brian McQueen	37d246bac1	capture the container logs on pod error to assist in debugging test failures #109295	2022-07-11 09:46:46 -07:00
Dave Chen	82ac6be0e9	Custom reporter of Junit report is no longer needed Ginkgo is now writing the JUnit file itself. The -report-dir parameter is used as fallback for enabling JUnit output in case that users haven't migrated to the new -junit-report parameter. Co-authored-by: Patrick Ohly <patrick.ohly@intel.com> Signed-off-by: Dave Chen <dave.chen@arm.com>	2022-07-08 10:46:11 +08:00
Dave Chen	5ac8105b86	Set Ginkgo config by the method of `GinkgoConfiguration()` Signed-off-by: Dave Chen <dave.chen@arm.com>	2022-07-08 10:46:11 +08:00
Dave Chen	fd4b5b629b	Stop using the deprecated method `CurrentGinkgoTestDescription` Besides, the using of method might lead to a `concurrent map writes` issue per the discussion here: https://github.com/onsi/ginkgo/issues/970 Signed-off-by: Dave Chen <dave.chen@arm.com>	2022-07-08 10:46:11 +08:00
Dave Chen	857458cfa5	update ginkgo from v1 to v2 and gomega to 1.19.0 - update all the import statements - run hack/pin-dependency.sh to change pinned dependency versions - run hack/update-vendor.sh to update go.mod files and the vendor directory - update the method signatures for custom reporters Signed-off-by: Dave Chen <dave.chen@arm.com>	2022-07-08 10:44:46 +08:00
Kubernetes Prow Robot	2b657a0f3b	Merge pull request #110805 from saschagrunert/seccomp-default-beta Graduate SeccompDefault feature to beta	2022-07-07 17:54:11 -07:00
Kubernetes Prow Robot	91aca10d59	Merge pull request #108958 from 249043822/e2e-density Fix:[Flaky test] create a sequence of pods latency/resource should be within limit when create 10 pods with 50 background pods	2022-06-29 20:18:06 -07:00
Paul S. Schweigert	b6675fce4a	fix link to eviction policy in e2enode eviction test Signed-off-by: Paul S. Schweigert <paulschw@us.ibm.com>	2022-06-29 19:23:49 -04:00
Sascha Grunert	a4f966aada	Graduate SeccompDefault feature to beta As outlined in the KEP, we now graduate the Kubelet feature to beta which means that it is enabled by default. The corresponding Kubelet flag still defaults to `false`, but we now have the chance to e2e test the feature by using a new serial test case. KEP: https://github.com/kubernetes/enhancements/issues/2413 Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2022-06-27 14:39:55 +02:00
Paco Xu	b36786e96e	promote LSCIQuotaFeature to beta	2022-06-23 10:00:54 +08:00
ZhangKe10140699	a945b6f066	Fix:[Flaky test] ci-kubernetes-node-kubelet-serial-cri-o job: [sig-node] Density [Serial] [Slow] create a sequence of pods latency/resource should be within limit when create 10 pods with 50 background pods	2022-06-22 08:14:43 +08:00
David Porter	b4b338d4eb	test: update graceful node shutdown e2e with watch Use a watch to detect invalid pod status updates in graceful node shutdown node e2e test. By using a watch, all pod updates will be captured while the previous logic required polling the api-server which could miss some intermediate updates. Signed-off-by: David Porter <david@porter.me>	2022-06-08 16:19:16 -07:00
Kubernetes Prow Robot	19ca12cb3e	Merge pull request #109820 from fromanirh/e2e-node-enable-device-plugin-test e2e: node: re-enable the device plugin tests	2022-06-01 12:03:40 -07:00
Davanum Srinivas	50bea1dad8	Move from k8s.gcr.io to registry.k8s.io Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2022-05-31 10:16:53 -04:00
Francesco Romani	f3e157d168	e2e: node: re-enable the device plugin tests Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-05-16 16:05:13 +02:00
Francesco Romani	48b5af49e0	e2e: node: reorder imports trivial cleanup Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-05-16 16:04:01 +02:00
Francesco Romani	98eb6db7c0	e2e: node: fix plugins directory Previously, the e2e test was overriding the plugins socket directory to "/var/lib/kubelet/plugins_registry". This seems wrong, and with that setting the e2e test was already failing, because the registration process was timing out, in turn because the kubelet was trying to call back the device plugin in the wrong place (see below for details). I can't explain why it worked before - or it if worked at all - but it really seems that `pluginapi.DevicePluginPath` is the right setting here. +++ In a nutshell, the device plugin registration process works like this: 1. The kubelet runs and creates the device plugin socket registration endpoint: KubeletSocket = DevicePluginPath + "kubelet.sock" DevicePluginPath = "/var/lib/kubelet/device-plugins/" 2. Each device plugin will listen to an ENDPOINT the kubelet will connect backk to. IOW the kubelet will act like a client to each device plugin, to perform allocation requests (and more) Each device plugin will serve from a endpoint. The endpoint name is plugin-specific, but they all must be inside a well-known directory: pluginapi.DevicePluginPath 3. The kubelet creates the device plugin pod, like any other pod 4. During the startup, each device plugin wants to register itself in the kubelet. So it sends a request through the registration endpoint. Key details: grpc.Dial(kubelet registration socket) registration request reqt := &pluginapi.RegisterRequest{ Version: pluginapi.Version, Endpoint: endpointSocket, <- socket relative to pluginapi.DevicePluginPath ResourceName: resourceName, <- resource name to be exposed } 5. While handling the registration request, kubelet dial back the device plugin on socketDir + req.Endpoint. But socketDir is hardcoded in the device manager code to pluginapi.KubeletSocket Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-05-16 16:03:50 +02:00
Francesco Romani	23147ff4b3	e2e: node: devplugin: tolerate node readiness flip In the AfterEach check of the e2e node device plugin tests, the tests want really bad to clean up after themselves: - delete the sample device plugin - restart again the kubelet - ensure that after the restart, no stale sample devices (provided by the sample device plugin) are reported anymore. We observed that in the AfterEach block of these e2e tests we have quite reliably a flip/flop of the kubelet readiness state, possibly related to a race with/ a slow runtime/PLEG check. What happens is that the kubelet readiness state is true, but goes false for a quick interval and then goes true again and it's pretty stable after that (observed adding more logs to the check loop). The key factor here is the function `getLocalNode` aborts the test (as in `framework.ExpectNoError`) if the node state is not ready. So any occurrence of this scenario, even if it is transient, will cause a test failure. I believe this will make the e2e test unnecessarily fragile without making it more correct. For the purpose of the test we can tolerate this kind of glitches, with kubelet flip/flopping the ready state, granted that we meet eventually the final desired condition on which the node reports ready AND reports no sample devices present - which was the condition the code was trying to check. So, we add a variant of `getLocalNode`, which just fetches the node object the e2e_node framework created, alongside to a flag reporting the node readiness. The new helper does not make implicitly the test abort if the node is not ready, just bubbles up this information. Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-05-16 14:22:25 +02:00
Francesco Romani	56c539bff0	e2e: node: deviceplug: deepcopy the pod dev template Let's avoid unexpected side effects Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-05-16 14:22:24 +02:00

1 2 3 4 5 ...

2303 Commits