Commit Graph

21223 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
2face135c7 Merge pull request #97415 from AlexeyPerevalov/ExcludeSharedPoolFromPodResources
Return only isolated cpus in podresources interface
2021-10-08 05:58:58 -07:00
Kubernetes Prow Robot
60ab733932 Merge pull request #105546 from Huang-Wei/fix-evt-volumebinding
sched: adjust events to register for VolumeBinding plugin
2021-10-08 02:12:57 -07:00
Kubernetes Prow Robot
dd650bd41f Merge pull request #105527 from rphillips/fixes/filter_terminated_pods
kubelet: set terminated podWorker status for terminated pods
2021-10-07 22:19:51 -07:00
Ryan Phillips
3982fcae64 go fmt 2021-10-07 20:13:43 -05:00
Sahil Vazirani
3988405c8d GA TTLAfterFinish 2021-10-07 16:58:50 -07:00
Elana Hashman
c771698de3 Add e2e test to verify kubelet restart behaviour
Succeeded pods should not be counted as running on restart.
2021-10-07 18:30:17 -05:00
Kubernetes Prow Robot
956da16a8d Merge pull request #105344 from jonyhy96/add-benchmark-test
add benchmark test for replaceRegistryInImageURLWithList
2021-10-07 14:21:32 -07:00
Francesco Romani
d15bff2839 e2e: node: expose the running flag
Each e2e test knows it wants to restart a running kubelet or a
non-running kubelet. The vast majority of times, we want to
restart a running kubelet (e.g. to change config or to check
some properties hold across kubelet crashes/restarts), but sometimes
we stop the kubelet, do some actions and only then restart.

To accomodate both use cases, we just expose the `running` boolean
flag to the e2e tests.

Having the `restartKubelet` explicitly restarting a running kubelet
helps us to trobuleshoot e2e failures on which the kubelet
was supposed to be running, while it was not; attempting a restart
in such cases only murkied the waters further, making the
troubleshooting and the eventual fix harder.

In the happy path, no expected change in behaviour.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2021-10-07 22:15:28 +02:00
Francesco Romani
e878c20ac7 e2e: node: improve error logging
In the `restartKubelet` helper, we use `exec.Command`, whose
return value is the output as the command, but as `[]byte`.
The way we logged the output of the command was as value, making
the output, meant to be human readable, unnecessarily hard to read.

We fix this annoying behaviour converting the output to string before
to log it out, making pretty obvious to understand the outcome of
the command.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2021-10-07 22:13:49 +02:00
Kubernetes Prow Robot
9b45983d3c Merge pull request #104251 from ravisantoshgudimetla/scheduling-v1beta3
Scheduling v1beta3
2021-10-07 10:47:32 -07:00
Wei Huang
b7d90ca991 sched: adjust events to register for VolumeBinding plugin 2021-10-07 08:51:04 -07:00
ravisantoshgudimetla
283b176102 [test]: modify integration tests to use v1beta3 2021-10-07 10:58:07 -04:00
Swati Sehgal
5043b431b4 excludesharedpool: e2e tests: Test cases for pods with non-integral CPUs
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2021-10-07 15:39:41 +01:00
Swati Sehgal
42dd01aa3f excludesharedpool: e2e tests: code refactor to handle non-integral CPUs
This patch changes cpuCount to cpuRequest in order to cater to cases
where guaranteed pods make non-integral CPU Requests.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2021-10-07 15:39:40 +01:00
Kubernetes Prow Robot
c4d802b0b5 Merge pull request #103289 from AlexeyPerevalov/DoNotExportEmptyTopology
podresources: do not export empty NUMA topology
2021-10-07 07:11:46 -07:00
Yaakov Selkowitz
acaa24b254 Enable more test images for s390x 2021-10-07 01:25:55 -04:00
Yaakov Selkowitz
82920d99f8 Fix build of apparmor-loader test image
apparmor is no longer found in Alpine edge/testing but in
edge/community, presumably in preparation for full-fledged inclusion in
3.15.  If so, once that is released, BASEIMAGE can be updated again and
the explicit --repository flag to 'apk add' dropped.

Fixes: https://github.com/kubernetes/kubernetes/issues/105528
2021-10-07 01:25:49 -04:00
Kubernetes Prow Robot
79ee735bad Merge pull request #105337 from Huang-Wei/pr-105303
sched: integration test to cover event registration
2021-10-06 19:53:56 -07:00
Kubernetes Prow Robot
c20ff9839e Merge pull request #97045 from claudiubelu/tests-windows-containerd-enabled
tests: Enable some [LinuxOnly] tests for Windows Containerd
2021-10-06 19:53:45 -07:00
Wei Huang
3283e6bb72 sched: integration test to cover event registration 2021-10-06 15:17:38 -07:00
ravisantoshgudimetla
9476237923 [scheduler] [test]: Fix PreferNominatedNode test
Once the node gets deleted, the nodelifecycle controller
is racing to update pod status and the pod deletion logic
is failing causing tests to flake. This commit moves
the testContext creation to within the test loop and deletes nodes,
namespace within the test loop. We don't explicitly call the node
deletion within the loop but the `testutils.CleanupTest(t, testCtx)`
call ensures that the namespace, nodes gets deleted.
2021-10-06 09:07:16 -04:00
Swati Sehgal
9337902648 podresource: move the checkForTopology logic inline
As per the recommendation here: https://github.com/kubernetes/kubernetes/pull/103289#pullrequestreview-766949859
we move the check inline.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2021-10-06 11:31:48 +01:00
Kubernetes Prow Robot
4af19756bd Merge pull request #105217 from dbgoytia/refactor/102787-enable-block-tests
Refactoring TestConcurrentAccessToRelatedVolumes for enabling Block Volume tests
2021-10-05 14:18:57 -07:00
Kubernetes Prow Robot
907d62eac8 Merge pull request #105462 from ehashman/merge-terminal-phase
Ensure terminal pods maintain terminal status
2021-10-05 13:12:58 -07:00
Elana Hashman
f9a827b895 Fix bugs in e2e pod test
- Update test to account for late synced statuses
- Terminated containers are not running
- Add missing format values to test logline
2021-10-05 11:51:12 -07:00
Kubernetes Prow Robot
c4c3062c07 Merge pull request #102333 from mcbenjemaa/fix/102295
Add Log statement in exec_util.go for ClientSet Creation
2021-10-05 11:21:53 -07:00
Kubernetes Prow Robot
b28bf04cd0 Merge pull request #105468 from ehashman/missing-format-values
Add missing format values to test logline
2021-10-05 09:23:40 -07:00
Kubernetes Prow Robot
b4a0342714 Merge pull request #105388 from mauriciopoppe/copy-volumesnapshotcontent-annotations
Copy VolumeSnapshotContent annotations in snapshottable.go test
2021-10-05 09:23:33 -07:00
Giuseppe Scrivano
f23e2a8c7f test, cgroupv2: adjust pagefaults test
on cgroup v2 the reported metric is recursive and it includes all the
sub cgroups.

Closes: https://github.com/kubernetes/kubernetes/issues/105301

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>
2021-10-05 18:00:57 +02:00
Kubernetes Prow Robot
519b164db1 Merge pull request #105222 from cyclinder/remove_node_lease_GA
remove nodeLease feature GA
2021-10-05 05:41:21 -07:00
Kubernetes Prow Robot
c5ad58d8a1 Merge pull request #103372 from verb/1.22-e2e-node
Create node_e2e test for ephemeral containers
2021-10-05 05:41:09 -07:00
Danielle Lancashire
742d3d36f5 e2e_node: cleanup features in podresources 2021-10-05 14:39:59 +02:00
Danielle Lancashire
f28dd90810 e2e_node: NodeGracefulShutdown is a Beta feature 2021-10-05 14:39:59 +02:00
Danielle Lancashire
71e6d9cbe0 e2e_node: remove no-op config change from critical_pod_test 2021-10-05 10:36:32 +02:00
Danielle Lancashire
8b1b06c507 e2e_node: Remove KubeletPodResources enablement as it is a default gate 2021-10-05 10:26:10 +02:00
Author cyclinder
e61b901628 remove nodeLease feature GA
Signed-off-by: cyclinder <qifeng.guo@daocloud.io>
2021-10-05 12:23:27 +08:00
Claudiu Belu
f7cf747e48 tests: Wait for pod collection to enter a Running state
While running tests in parallel, especially those with higher loads
than others, it might take some time for Pods to be Running, even more
so if the image has to be pulled as well.

The test [sig-node] Pods should delete a collection of pods [Conformance]
only waits for the for the pods to be scheduled before deleting them, and
expects them to be gone in 1 minute, which can flake because of the above
reasons. Note that the operations are in order, and kubelet runs them in
order, which means that the pod first has to enter the Running state
before attempting to delete it.

This commit waits for the Pods to enter the Running state first before
deleting the entire collection.

Co-Authored-By: Antonio Ojea <aojea@redhat.com>
2021-10-04 20:42:16 -07:00
Kubernetes Prow Robot
0a29e2a73a Merge pull request #105197 from alculquicondor/job-tracking
Roll-forward: Beta requirements for JobTrackingWithFinalizers
2021-10-04 18:57:49 -07:00
Kubernetes Prow Robot
9eaabb6b2e Merge pull request #104304 from endocrimes/dani/eviction
[Failing Test] Fix Kubelet Storage Eviction Tests
2021-10-04 15:16:40 -07:00
Kubernetes Prow Robot
70d1360ab8 Merge pull request #105103 from tkashem/extend-audit-policy
apiserver: refactor PolicyRuleEvaluator interface
2021-10-04 14:12:40 -07:00
Elana Hashman
7bbf40f1cc Add missing format values to test logline 2021-10-04 13:44:15 -07:00
Kubernetes Prow Robot
04f747d09f Merge pull request #104782 from kerthcet/cleanup/remove-cc-v1beta1
remove scheduler component config v1beta1
2021-10-04 08:53:08 -07:00
Lee Verberne
2a82228e33 Apply suggestions from code review
Co-authored-by: Sergey Kanzhelev <S.Kanzhelev@live.com>
2021-10-04 15:07:37 +02:00
Kubernetes Prow Robot
f31453fe5b Merge pull request #104894 from pacoxu/windows-arm
add windows arm support on client-side since go 1.17 supports
2021-10-04 04:19:07 -07:00
Danielle Lancashire
7b91337068 e2e_node: eviction: Include names of pending-eviction pods in error 2021-10-04 13:07:40 +02:00
Danielle Lancashire
b5c2d3b389 e2e_node: eviction: Memory-backed Volumes seperation
This commit fixes the LocalStorageCapacityIsolationEviction test by
acknowledging that in its default configuration kubelet will no-longer
evict memory-backed volume pods as they cannot use more than their
assigned limit with SizeMemoryBackedVolumes enabled.

To account for the old behaviour, we also add a test that explicitly
disables the feature to test the behaviour of memory backed local
volumes in those scenarios. That test can be removed when/if the feature
gate is removed.
2021-10-04 13:07:40 +02:00
Danielle Lancashire
a8168ed543 e2e_node: Fix LocalStorage and PriorityLocalStorage eviction tests
Currently the storage eviction tests fail for a few reasons:
- They re-enter storage exhaustion after pulling the images during
  cleanup (increasing test storage reqs, and adding verification for
future diagnosis)
- They were timing out, as in practice it seems that eviction takes just
  over 10 minutes on an n1-standard in many cases. I'm raising these to
15 to provide some padding.

This should ideally bring these tests to passing on CI, as they've now
passed locally for me several times with the remote GCE env.

Follow up work involves diagnosing why these take so long, and
restructuring them to be less finicky.
2021-10-04 13:07:40 +02:00
Kubernetes Prow Robot
82da9bdaab Merge pull request #105076 from pohly/log-flush-frequency-bug
initialize logging after flag parsing + refactor commands
2021-10-01 14:30:18 -07:00
Kubernetes Prow Robot
e414cf7641 Merge pull request #100482 from pohly/generic-ephemeral-volume-checks
generic ephemeral volume checks
2021-10-01 10:47:22 -07:00
Swati Sehgal
01dacd0463 podresource-api: getAllocatableResources to Beta
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2021-10-01 16:48:29 +01:00