Commit Graph

1908 Commits

Author SHA1 Message Date
Artyom Lukianov
932134034c memory manager: update API constant to have camel case format
Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:02 +02:00
Artyom Lukianov
e8ea461bfd memory manager: update all relevant part of code to use []MemoryReservation
Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:01 +02:00
Artyom Lukianov
74eeef2a0a memory manager: provide additional e2e tests
With the memory manager static policy:

- start multiple guaranteed pods and verify that pods succeeded to start
- start workload pod on each NUMA node to load the memory and start the
  pod that requested more memory than each NUMA node have, the pod should fail
  to start with the admission error, because no single NUMA node has enough
  memory to start the pod and also each NUMA node already used for single
  NUMA node allocation
  The test requires at least two NUMA nodes

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:00 +02:00
Pawel Rapacz
606fea29f5 memory manager: add e2e test to run guaranteed pod with init containers
Signed-off-by: Pawel Rapacz <p.rapacz@partner.samsung.com>
2021-02-09 01:10:00 +02:00
Artyom Lukianov
f3d4ac2f7e memory manager: add basice e2e tests
Provides basic tests e2e to verify that pod succeeds
to start with MemoryManager enabled.

Verifies both MemoryManager policies and when the node has
multiple NUMA nodes it will verify the memory pinning.

Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
2021-02-09 01:10:00 +02:00
Kubernetes Prow Robot
ee2b7a5049 Merge pull request #98049 from pacoxu/enhancements/redir-links
fix all keps links 404 for kep folder migration
2021-02-01 21:48:28 -08:00
pacoxu
a10bdfed09 fix all keps links 404 for kep folder migration
Signed-off-by: pacoxu <paco.xu@daocloud.io>
2021-02-01 19:41:59 +08:00
Ernest Wong
40713f7666 test images: use recently promoted images
Signed-off-by: Ernest Wong <chuwon@microsoft.com>
2021-01-28 10:09:00 -08:00
Ikko Ashimine
5155decbbf Fix typo in device_plugin_test.go
assignement -> assignment
2021-01-24 17:42:34 +09:00
hasheddan
40ca31f524 Default cluster dns-domain to empty string in e2e_node suite
Changes default cluster DNS domain to empty string to align with the
default kubelet configuration value.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2021-01-13 10:53:11 -06:00
Kubernetes Prow Robot
cbcb26e7a5 Merge pull request #97653 from heqg/unused-kubeletPort
remove unused constant of kubeletPort in e2e test
2021-01-12 11:56:35 -08:00
Kubernetes Prow Robot
e414d4e5c2 Merge pull request #96315 from twosigma/dnsdomainteste2e
Ensuring default DNS cluster-domain of Kubelet is cluster.local
2021-01-12 10:28:35 -08:00
he.qingguo
ced98afe56 remove unused constant of kubeletPort in e2e test
Signed-off-by: he.qingguo <he.qingguo@zte.com.cn>
2021-01-02 12:16:07 +08:00
Kubernetes Prow Robot
34baf6aeb7 Merge pull request #96666 from harche/fix_typo
Remove the typo in the logs while configuring firewall for node e2e
2020-12-08 18:35:13 -08:00
Kubernetes Prow Robot
1fe25e9f73 Merge pull request #96637 from hasheddan/node-local-env
Update comment to not indicate check is run for docker daemon
2020-12-08 18:34:18 -08:00
Kubernetes Prow Robot
dcc863a8b3 Merge pull request #96219 from fromanirh/tm-e2e-sriovdp-usage
node: e2e: bring up/down SRIOV DP just once
2020-12-08 16:29:53 -08:00
Kubernetes Prow Robot
5cdc3e6e20 Merge pull request #95613 from MHBauer/CAP_NET_RAW
CAP_NET_RAW should be NET_RAW
2020-12-08 16:29:28 -08:00
Harshal Patil
b76abcd243 Remove the typo in the logs while configuring firewall for node e2e
Signed-off-by: Harshal Patil <harpatil@redhat.com>
2020-11-18 14:20:59 +05:30
Kubernetes Prow Robot
36d12390a7 Merge pull request #95906 from harche/iptables_fix
Verify iptable rules are applied for tcp, udp and icmp
2020-11-17 22:08:04 -08:00
hasheddan
d2f7cea12a Update comment to not indicate check is run for docker daemon
Removes comment from daemons function that previously indicated that a
check was being run to make sure docker daemon was running.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-17 08:31:26 -06:00
Kubernetes Prow Robot
d20e3246ba Merge pull request #96574 from hasheddan/cleanup-resource-tests
Fix test name in e2e resource metrics api test
2020-11-14 05:09:04 -08:00
hasheddan
bf270d1070 Fix test name in e2e resource metrics api test
Updates resouce to resource.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-13 15:19:55 -06:00
hasheddan
56c104a440 Relax matching on pod_memory_working_set_bytes metrics
Relaxes matching of pod_memory_working_set_bytes metric so that we won't
error due to presence of other pods.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-13 12:29:54 -06:00
hasheddan
25a04096e8 Match pod resource metrics to pod not container
Update node resource metrics e2e tests to match pod metrics to pod
instread of containers.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-13 10:29:21 -06:00
Francesco Romani
56106439cf node: e2e: bring up/down SRIOV DP just once
The e2e topology manager want to test the resource alignment using
devices, and the easiest devices to use are the SRIOV devices at this
moment.

The resource alignment test cases are run for each supported policies,
in a loop.

The tests manage the SRIOV device plugin; up until now, the plugin
was set up and tore down at each loop.
There is no real need for that. Each loop must reconfigure (thus
restart) the kubelet, but the device plugin can set up and tore down
just once for all the policies, thus once.
The kubelet can reconnect just fine to a running device plugin.

This way, we greatly reduce the interactions and the complexity of the
test environment, making it easier to understand and more robust, and
we trim down some minutes from execution time.

However, this patch also hides (not solves) a test flake we observed
on some environment. The issue is hardly reproduceable and not well
understood, but seems caused by doing the sriov dp setup/teardown
in each policy testing loop.
Investigation so far suggests that the kubelet sometimes have a stale
state after the sriovdp teardown/setup cycle, leading to flakes and
false negatives.
We tried to address this in https://github.com/kubernetes/kubernetes/pull/95611
with no conclusive results yet.

This patch was posted because overall we believe this patch gains
exceeds the drawbacks (hiding the aforementioned flake) and
because understanding the potential interaction issues between the
sriovdp and the kubelet deserve a separate test.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2020-11-13 10:04:31 +01:00
Kubernetes Prow Robot
0e0cc1ead8 Merge pull request #95839 from egernst/pod-usage
resource-metrics: add pod/sandbox metrics to endpoint
2020-11-12 19:36:23 -08:00
Kubernetes Prow Robot
7edf621277 Merge pull request #96262 from knight42/fix/npd-test
fix(test::npd): provide NPD with proper kubeconfig
2020-11-12 11:20:36 -08:00
Pawel Rapacz
16c7bf4db4 Implement e2e tests for pod scope alignment
A suite of e2e tests was created for Topology Manager
so as to test pod scope alignment feature.

Co-authored-by: Pawel Rapacz <p.rapacz@partner.samsung.com>
Co-authored-by: Krzysztof Wiatrzyk <k.wiatrzyk@samsung.com>
Signed-off-by: Cezary Zukowski <c.zukowski@samsung.com>
2020-11-12 12:25:55 +01:00
Harshal Patil
f3bfc9fd67 Verify iptable rules are applied for tcp, udp and icmp
Signed-off-by: Harshal Patil <harpatil@redhat.com>
2020-11-12 13:29:13 +05:30
Sergey Kanzhelev
06da0e5e74 GA of RuntimeClass feature gate and API 2020-11-11 19:22:32 +00:00
knight42
1a9600dc16 fix(test::npd): provide NPD with proper kubeconfig
Signed-off-by: knight42 <anonymousknight96@gmail.com>
2020-11-11 09:47:09 +08:00
Kubernetes Prow Robot
84745e2db4 Merge pull request #96381 from karan/npd-test-cmd
Fix command and arg in NPD e2e
2020-11-10 12:44:16 -08:00
Kubernetes Prow Robot
6e95025994 Merge pull request #92165 from RenaudWasTaken/v1-pod-resources
Graduate the Pod Resources API to G.A
2020-11-09 18:24:41 -08:00
Kubernetes Prow Robot
1d6bd259d2 Merge pull request #95608 from MHBauer/re-enable-benchmark
enable benchmark by matching test config
2020-11-09 12:48:41 -08:00
Karan Goel
ee93b3f837 Fix command and arg in NPD e2e 2020-11-09 10:58:09 -08:00
Kubernetes Prow Robot
23839ba649 Merge pull request #96305 from twosigma/fixe2etestbug
Fixing expected pod subdomain to match framework.TestContext.ClusterD…
2020-11-09 08:22:26 -08:00
Kubernetes Prow Robot
ba39cf58ec Merge pull request #96302 from harche/summary_test_fix
Increase the timeout to allow summary test pods to start
2020-11-06 11:29:25 -08:00
Javier Diaz-Montes
c8da7128eb Ensuring default DNS cluster-domain of Kubelet is cluster.local
Currently e2e tests run under test-e2e-node have a cluster-domain
equals to "". This change makes test-e2e-node consistent with other
e2e tests. For example, in hack/ginkgo-e2e.sh, cluster-domain
defaults to cluster.local and it can be changed by defining KUBE_DNS_DOMAIN.
2020-11-06 13:42:13 -05:00
Javier Diaz-Montes
d7e47cfb19 Fixing expected pod subdomain to match framework.TestContext.ClusterDNSDomain 2020-11-06 13:20:46 -05:00
Harshal Patil
199b49941f Increase the timeout to allow summary test pod to start
Signed-off-by: Harshal Patil <harpatil@redhat.com>
2020-11-06 15:54:13 +05:30
Kubernetes Prow Robot
bb4d8b0746 Merge pull request #96176 from mrunalp/mrunal_node_approver
Add mrunalp as a node approver
2020-11-05 10:53:29 -08:00
Kubernetes Prow Robot
a6a74bf2e9 Merge pull request #96173 from twosigma/removealphae2e
Removing Alpha annotation as the feature is graduating to Beta in v1.20
2020-11-05 09:25:50 -08:00
Kubernetes Prow Robot
eca53507be Merge pull request #95187 from twosigma/addingfailscenario
Adding failure scenario for long FQDN and setHostnameAsFQDN feature
2020-11-05 08:28:53 -08:00
Mrunal Patel
b78be8446f Add mrunalp as node approver
Signed-off-by: Mrunal Patel <mpatel@redhat.com>
2020-11-04 15:48:30 -08:00
Kubernetes Prow Robot
d40b23bdbe Merge pull request #96050 from knight42/fix/npd-test
fix(test::npd): fix node problem detector test
2020-11-04 15:46:53 -08:00
Kubernetes Prow Robot
941e9242a5 Merge pull request #96128 from MHBauer/node-e2e-partial-fix
reenable e2e_node services & debugging improvements
2020-11-03 14:58:17 -08:00
Eric Ernst
b26b755e94 resource-metrics: add pod metrics e2e test
Signed-off-by: Eric Ernst <eric_ernst@apple.com>
2020-11-03 10:14:13 -08:00
Morgan Bauer
9b8c1dcd19 reenable e2e_node services & debugging improvements
- re-enable e2e_node services
 - call GenerateSecureToken for e2e_node Conformance test-suite
 - add log messages indicating location in process
 - move log messages to some more accurate locations
2020-11-03 10:09:56 -08:00
Javier Diaz-Montes
19c70c7edc Removing Alpha annotation as the feature is graduating to Beta in v1.20 2020-11-03 11:39:56 -05:00
knight42
186be6f0d2 fix(test::npd): fix node problem detector test
Since the insecure port of apiserver has been disabled in e2e node tests,
we could create a service account in the test for node problem detector
and then bind the cluster role `system:node-problem-detector` with this
service account.

Signed-off-by: knight42 <anonymousknight96@gmail.com>
2020-11-03 08:57:52 +08:00