Commit Graph

24762 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
bbcbfe4b97
Merge pull request #121257 from bzsuni/ga/ProbeTerminationGracePeriod
Remove GA featuregate about ProbeTerminationGracePeriod in 1.29
2023-10-17 16:14:19 +02:00
Patrick Ohly
7d077f9271 e2e: move AllNodesReady check to DeferCleanup
Some test cases can make nodes not ready and use DeferCleanup to bring
nodes back online. Checking if all nodes are online would fail
in such cases as AfterEach runs before DeferCleanup.

Scheduling nodes readines check to DeferCleanup should solve this
issue as nodes would be brought back to a `Ready` state before the
check.
2023-10-17 16:45:57 +03:00
Gunju Kim
ca6fda05ce
Restart containers in right order after the podSandboxChanged
This is a workaround for the issue that the kubelet cannot differentiate
the container statuses of the previous podSandbox from the current one.

If the node is rebooted, all containers will be in the exited state and
the kubelet will try to recreate a new podSandbox. In this case, the
kubelet should not mistakenly think that the newly created podSandbox
has been initialized.
2023-10-17 22:11:31 +09:00
Swati Sehgal
f46cdf1428 node: sample-dp: set default value for pluginSocksDir
Setting a reasonable default in case `PLUGIN_SOCK_DIR`
environment variable is not specified.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:19:10 +01:00
Swati Sehgal
211d8cc80a node: sample-dp: stubRegisterControlFunc for controlling registration
If the user specifies the intent to control registration process, we rely on
registration triggers (deletion of control file) to prompt registration.

This behvaiour is expected to be consistent across kubelet restarts and therefore
across the watch calls where we watch for changes to the unix socket so we make
this part of Stub object instead of a parameter.

Co-authored-by: Francesco Romani <fromani@redhat.com>
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:19:10 +01:00
Swati Sehgal
c4c9d61d66 node: sample-dp: Handle re-registration for controlled registrations
In case `REGISTER_CONTROL_FILE` is specified, we want to ensure that the
registration is triggered by deletion of the control file. This is
applicable both when the registration happens for the first time and
subsequent ones because of kubelet restarts.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:19:07 +01:00
Swati Sehgal
6714e678d3 node: sample-dp: register by default and re-register on restarts
In issue: 115107 we added an environment variable to control the registration of sample
device plugin to kubelet. The intent of this patch is to ensure that the default
behaviour of the plugin is to register to kubelet (in case no environment
variable is specified).

In addition to that, we want to ensure that the plugin registers itself not just once.
It should re-register itself to kubelet in case of node reboot or kubelet restarts.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:14:09 +01:00
Gunju Kim
d2b803246a
Don't reuse the device allocated to the restartable init container 2023-10-17 18:28:29 +09:00
wen.rui
3df3b83226 e2e_api: stop using deprecated framework.ExpectNotEqual 2023-10-17 11:03:11 +08:00
Kubernetes Prow Robot
63ae3084b7
Merge pull request #121176 from bzsuni/cleanup/e2e/expect_not_equal
remove deprecated framework.ExpectNotEqual for node
2023-10-17 02:29:06 +02:00
Kubernetes Prow Robot
ac66f3d466
Merge pull request #121010 from Jefftree/decouple-openapi-v2v3-config
Decouple openapi v2v3 config
2023-10-16 23:41:11 +02:00
Riaan Kleinhans
4ae7150f05 remove pc pvc status endpoints
Signed-off-by: Riaan Kleinhans <riaan@ii.coop>
2023-10-17 10:33:17 +13:00
Stephen Heywood
2e8f66afb3 Promote PV/PVC status e2e test to Conformance 2023-10-17 09:40:20 +13:00
Kubernetes Prow Robot
952a9cb060
Merge pull request #121174 from carlory/fix-120770
fix limitrange flaky test
2023-10-16 22:27:00 +02:00
Kubernetes Prow Robot
f6ba25fdbd
Merge pull request #119026 from AxeZhan/sleepAction
Introducing Sleep Action for PreStop Hook
2023-10-16 21:19:44 +02:00
Dejan Pejchev
f2b723a130
fix: improve assertion for Failed PodReplacementPolicy integration test cases 2023-10-16 21:16:17 +02:00
Jefftree
b30c6bdff8 Fix v3 spec 2023-10-16 15:05:13 -04:00
Kubernetes Prow Robot
e196d24910
Merge pull request #121243 from ii/fix-pv-pvc-status-test
Fix PersistentVolumeStatus & PersistentVolumeClaimStatus e2e test flake
2023-10-16 19:07:39 +02:00
carlory
5d0f8530f6 fix Huge Pages failing test 2023-10-16 23:13:32 +08:00
charles-chenzz
7d31b5ffd0 Add test case for sandbox condition if pod fails to mount volume from a missing secret 2023-10-16 22:04:04 +08:00
Kevin Hannon
dd9c3358f5
Revert "podresources: e2e: force eager connection" 2023-10-16 09:46:04 -04:00
bzsuni
27abf99148 Remove GA featuregate about ProbeTerminationGracePeriod in 1.29
Signed-off-by: bzsuni <bingzhe.sun@daocloud.io>
2023-10-16 20:12:47 +08:00
upodroid
011c65e17e configure npd to run as daemonset on cos 2023-10-16 12:34:47 +01:00
upodroid
e69a960792 skip deploying csi driver if it is already installed 2023-10-16 12:32:27 +01:00
Kubernetes Prow Robot
4254a77c21
Merge pull request #121249 from pohly/klog-call-fixes
fix invalid log calls
2023-10-16 13:14:01 +02:00
Patrick Ohly
957adfac49 fix invalid log calls
These were found with a modified klog that enables "go vet" to check klog call
parameters:

    cmd/kubeadm/app/features/features.go:149:4: printf: k8s.io/klog/v2.Warningf format %t has arg v of wrong type string (govet)
    			klog.Warningf("Setting deprecated feature gate %s=%t. It will be removed in a future release.", k, v)
    test/images/sample-device-plugin/sampledeviceplugin.go:147:5: printf: k8s.io/klog/v2.Errorf does not support error-wrapping directive %w (govet)
    				klog.Errorf("error: %w", err)
    test/images/sample-device-plugin/sampledeviceplugin.go:155:3: printf: k8s.io/klog/v2.Errorf does not support error-wrapping directive %w (govet)
    		klog.Errorf("Failed to add watch to %q: %w", triggerPath, err)
    staging/src/k8s.io/code-generator/cmd/prerelease-lifecycle-gen/prerelease-lifecycle-generators/status.go:207:5: printf: k8s.io/klog/v2.Fatalf does not support error-wrapping directive %w (govet)
    				klog.Fatalf("Package %v: unsupported %s value: %q :%w", i, tagEnabledName, ptag.value, err)
    staging/src/k8s.io/legacy-cloud-providers/vsphere/nodemanager.go:286:3: printf: (k8s.io/klog/v2.Verbose).Infof format %s reads arg #1, but call has 0 args (govet)
    		klog.V(4).Infof("Node %s missing in vSphere cloud provider cache, trying node informer")
    staging/src/k8s.io/legacy-cloud-providers/vsphere/nodemanager.go:302:3: printf: (k8s.io/klog/v2.Verbose).Infof format %s reads arg #1, but call has 0 args (govet)
    		klog.V(4).Infof("Node %s missing in vSphere cloud provider caches, trying the API server")
2023-10-16 10:21:07 +02:00
Shiming Zhang
33f2d487e2 Promote KEP-2681 to beta in 1.29 2023-10-16 10:10:35 +08:00
jinye
e20ecbe1b4 cleanup:e2e-cloud:stop using deprecated framework.ExpectNotEqual 2023-10-16 09:24:26 +08:00
bzsuni
5683660830 remove deprecated framework.ExpectNotEqual for node
Signed-off-by: bzsuni <bingzhe.sun@daocloud.io>
2023-10-16 09:21:26 +08:00
Stephen Heywood
b548bbb8cf When reading pvc status allow bound to be a valid value 2023-10-16 10:20:26 +13:00
AxeZhan
332779a3b2 add a simple e2e test 2023-10-15 13:59:46 +08:00
Kubernetes Prow Robot
378866edba
Merge pull request #120518 from saschagrunert/metrics-container-start
kubelet: fix metric `container_start_time_seconds` timestamp
2023-10-15 07:05:37 +02:00
Kubernetes Prow Robot
4d8a51acac
Merge pull request #121229 from aojea/remove_cluster_cidr
remove ClusterCIDR alpha API
2023-10-15 02:56:33 +02:00
Kubernetes Prow Robot
675a64eaa6
Merge pull request #121129 from carlory/cleanup-e2e-framework-equal
remove deprecated framework.ExpectEqual
2023-10-14 23:50:37 +02:00
Kubernetes Prow Robot
ae9dc3330e
Merge pull request #120874 from ruquanzhao/fixDevicePluginProbeCI
fix DevicePluginProbe node-e2e: pod and kubelet restarts
2023-10-14 23:50:28 +02:00
Antonio Ojea
c2d473f0d4 remove ClusterCIDR
KEP-2593 proposed to expand the existing node-ipam controller
to be configurable via a ClusterCIDR objects, however, there
were reasonable doubts on the SIG about the feature and after
several months of dicussions we decided to not move forward
with the KEP intree, hence, we are going to remove the existing
code, that is still in alpha.

https://groups.google.com/g/kubernetes-sig-network/c/nts1xEZ--gQ/m/2aTOUNFFAAAJ

Change-Id: Ieaf2007b0b23c296cde333247bfb672441fe6dfc
2023-10-14 19:06:22 +00:00
Kubernetes Prow Robot
b87cae907d
Merge pull request #121001 from jiahuif-forks/feature/validating-admission-policy/typed-composition-variables
ValidatingAdmissionPolicy: typed variables support.
2023-10-14 01:55:43 +02:00
Kevin Hannon
1ae5429629 add potential fixes for flakiness in eviction tests 2023-10-13 11:36:44 -04:00
cpanato
b0c7956a86
Bump distroless-iptables to v0.4.1
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-10-13 11:00:04 +02:00
carlory
a34d2f3b13 fix limitrange flaky test 2023-10-13 11:46:05 +08:00
Kubernetes Prow Robot
4c8fca2f06
Merge pull request #112894 from pohly/e2e-framework-test-labels
e2e framework: test labels
2023-10-13 02:40:43 +02:00
Kubernetes Prow Robot
2b4ef19578
Merge pull request #121191 from dims/update-busybox-sha-based-image-to-match-tag-1.36-1-1
Update busybox SHA based image to match tag - 1.36.1-1
2023-10-12 22:49:43 +02:00
Kubernetes Prow Robot
8923c3c871
Merge pull request #119659 from kannon92/beta-pod-ready-to-start
[KEP-3085] Promote PodReadyToStartContainers to beta in 1.29
2023-10-12 22:49:16 +02:00
Davanum Srinivas
968d6b8a32
Update busybox SHA based image to match tag - 1.36.1-1
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-10-12 14:17:36 -04:00
Kubernetes Prow Robot
3bd4c1cf20
Merge pull request #120892 from ii/create-pv-pvc-status-test
Write e2e test for PersistentVolumeStatus & PersistentVolumeClaimStatus Endpoints +6 Endpoints
2023-10-12 18:50:33 +02:00
Kevin Hannon
c94240e2e2 move kubelet constant for podreadytostart to staging 2023-10-12 11:18:11 -04:00
Kubernetes Prow Robot
38a1ec75f0
Merge pull request #119882 from ffromani/podres-client-wait
podresources: e2e: force eager connection
2023-10-12 15:59:55 +02:00
Davanum Srinivas
de01788dea
Switch to newer cos-gpu-installer - v2.1.9
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-10-12 06:59:39 -04:00
Hao Ruan
531324cb1d allow FSGroupPolicy and PodInfoOnMount to be mutable in CSIDriver.Spec 2023-10-12 15:01:37 +08:00
Davanum Srinivas
5ad5ef1d0f
Switch to newer cos-gpu-installer - v2.1.7
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-10-11 23:15:51 -04:00
Kubernetes Prow Robot
5027809be8
Merge pull request #119479 from HirazawaUi/implement-secrets-empty-value-test
implement secret empty string value integration test
2023-10-12 00:51:13 +02:00
Kubernetes Prow Robot
dc1cde6e02
Merge pull request #121044 from charles-chenzz/e2e_pod_readytostart_false
[KEP-3085]: check PodReadyToStartContainers condition after gracefulshutdown
2023-10-11 20:29:32 +02:00
cpanato
5eb7fea383
[go] Bump images, dependencies and versions to go 1.21.3
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-10-11 17:08:26 +02:00
Arda Güçlü
a504aed54d
Add shortname ambiguity warning in shortcut expander (#117668)
* Add warning handler callback function in shortcut expander

Currently, errors in client-go are propagated back to the callers via
function returns. However, there is no elegant way for just warning users.
For example, when user wants to get a resource with it's short name format
and if there are multiple resources belonging to this short name, we need to
warn user about this ambugity which one is picked and which ones are discarded.

Not only to overcome this particular case mentioned above, but also propose a
way for the possible warnings in the future, this commit adds a warningHandler
callback function in shortcutExpander.

* Add warningPrinter functionality in ConfigFlags

ConfigFlags has neither warning user in a standardized
format functionality nor passing warning callback functions to other upper level
libraries such as client-go.

This commit adds an ability that user can set warningPrinters
according to their IOStreams and this warningPrinters will be used
to raise possible warnings happening not only in cli-runtime but
also in client-go.

* Pass warning callback function in ConfigFlags to shortcutExpander

This commit passes warning callback function to print possible
warnings happened in shortcut expander to warn user in a
standardized format.

* Add integration test for CRDs having ambiguous short names

This commit adds integration test to assure that warning message
related to this ambiguity is printed when resources are being retrieved via their short name
representations in cases where multiple resources have same
short names.

This integration test also ensures that the logic behind which resource
will be selected hasn't been changed which may cause disperancies in
clusters.

* Remove defaultConfigFlag global variable

* Move default config flags initialization into function

* Skip warning for versions of same group/resource

* Run update-vendor

* Warn only once when there are multiple versions registered for ambiguous resource

* Apply gocritic review

* Add multi-resource multi-version ambiguity unit test
2023-10-11 17:04:11 +02:00
wlq1212
d9545976ed using wait.Interrupted instead of deprecated wait.ErrWaitTimeout for utils 2023-10-11 19:30:14 +08:00
carlory
2c1836bc24 remove deprecated framework.ExpectEqual 2023-10-11 12:43:10 +08:00
RuquanZhao
babac47c6f fix DevicePluginProbe node-e2e: pod and kubelet restarts
The kubelet restarts working pods with an exponential back-off delay,
with a maximum cap of 5 minutes. The waiting 1 minutes may happen to be
in back-off time.

Signed-off-by: Ruquan Zhao <ruquan.zhao@arm.com>
2023-10-11 10:15:32 +08:00
Kubernetes Prow Robot
bdcb73d6b3
Merge pull request #120460 from tzneal/deflake-oom-tests-on-containerd
skip the reason check for OOM reason test if it will fail
2023-10-11 01:03:17 +02:00
Patrick Ohly
19ecf93ec3 e2e: define features and node features
The list is based on the -list-tests output.
2023-10-10 18:15:49 +02:00
Patrick Ohly
f2d34426f8 e2e: enhance SIGDescribe
framework.SIGDescribe is better because:
- Ginkgo uses the source code location of the test, not of the wrapper,
  when reporting progress.
- Additional annotations can be passed.

To make this a drop-in replacement, framework.SIGDescribe generates a function
that can be used instead of the former SIGDescribe functions.

windows.SIGDescribe contained some additional code to ensure that tests are
skipped when not running with a suitable node OS. This gets moved into a
separate wrapper generator, to allow using framework.SIGDescribe as intended.
To ensure that all callers were modified, the windows.sigDescribe isn't
exported anymore (wasn't necessary in the first place!).
2023-10-10 18:15:49 +02:00
Patrick Ohly
39b6916cbc e2e: add wrapper functions to annotate tests
These wrapper functions set labels in addition to injecting the annotation into
the test text. It then becomes possible to select tests in different ways:

    ginkgo -v --focus="should respect internalTrafficPolicy.*\[FeatureGate:ServiceInternalTrafficPolicy\]"

    ginkgo -v --label-filter="FeatureGate:ServiceInternalTrafficPolicy"

    ginkgo -v --label-filter="Beta"

When a test runs, ginkgo shows it as:

    [It] should respect internalTrafficPolicy=Local Pod to Pod [FeatureGate:ServiceInternalTrafficPolicy] [Beta] [FeatureGate:ServiceInternalTrafficPolicy, Beta]

The test name and the labels at the end are in different colors. Embedding the
annotations inside the text is redundant and only done because users of the e2e
suite might expect it. Also, our tooling that consumes test results currently
doesn't know about ginkgo labels.

Environments, features and node features as described by
https://github.com/kubernetes/enhancements/tree/master/keps/sig-testing/3041-node-conformance-and-features
are also supported.

The framework and thus (at the moment) test/e2e do not have any pre-defined
environments and features. Adding those and modifying tests will follow in
a separate commit.
2023-10-10 18:15:49 +02:00
Patrick Ohly
535ab74346 e2e framework: track and report internal bugs
If something goes wrong during the test registration phase, the only solution
so far was to panic. This is not user-friendly and only allows to report one
problem at a time.

If initialization can continue, then a better solution is to record a bug,
continue, and then report all bugs together.

This also works when just listing tests. The new verify-e2e-suites.sh uses that
to check all test suites (identified as "packages that call
framework.AfterReadingAllFlags", with some exceptions) as part of
pull-kubernetes-verify.

Example output for a fake

    framework.RecordBug(framework.NewBug("fake bug during SIGDescribe", 0))

in test/e2e/storage/volume_metrics.go:
```
$ hack/verify-e2e-suites.sh
go version go1.21.1 linux/amd64
ERROR: E2E test suite invocation failed for test/e2e.
   ERROR: E2E suite initialization was faulty, these errors must be fixed:
   ERROR: test/e2e/storage/volume_metrics.go:49: fake bug during SIGDescribe
E2E suite test/e2e_kubeadm passed.
E2E suite test/e2e_node passed.
```
2023-10-10 18:15:46 +02:00
Patrick Ohly
3afdcc03ea e2e framework: list specs and labels
-list-tests is a more concise alternative for `ginkgo --dry-run` with one line
per test. In contrast to `--dry-run`, it really lists all tests. `--dry-run`
without additional parameters uses the default skip expression from the E2E
context, which filters out flaky and feature-gated tests. The output includes
the source code location where each test is defined. It is sorted by test
name (not source code location) because that order is independent of
reorganizing the source code and ordering by location can be achieved with
"sort".

-list-labels has no corresponding feature in Ginkgo.

One possible usage is to figure out what values might make sense for
-focus/skip/label-filter.

Unit tests will follow in a future commit.
2023-10-10 17:15:59 +02:00
Patrick Ohly
aa4d2fc5ed e2e storage: remove useless info message
Always printing "Enabling in-tree volume drivers" whenever the E2E suite is
initializing doesn't provide any useful information and makes output of the
upcoming -list-tests look weird.
2023-10-10 17:15:59 +02:00
Kubernetes Prow Robot
09edfe4ebb
Merge pull request #121067 from carlory/cleanup-e2enode-framework-equal
e2e_node: stop using deprecated framework.ExpectEqual
2023-10-10 14:44:55 +02:00
Kubernetes Prow Robot
4a92b00db9
Merge pull request #121064 from carlory/cleanup-e2e-apimachinery-framework-equal
e2e_apimachinery: stop using deprecated framework.ExpectEqual
2023-10-10 13:33:02 +02:00
Kubernetes Prow Robot
4b9e15e0fe
Merge pull request #120873 from pohly/dra-e2e-test-driver-enhancements
e2e dra: enhance test driver
2023-10-10 13:32:55 +02:00
Stephen Heywood
00ae26cda8 Create e2e test for PV/PVC status endpoints
e2e test validates the following 6 endpoints
- patchCoreV1PersistentVolumeStatus
- readCoreV1PersistentVolumeStatus
- replaceCoreV1PersistentVolumeStatus
- patchCoreV1NamespacedPersistentVolumeClaimStatus
- readCoreV1NamespacedPersistentVolumeClaimStatus
- replaceCoreV1NamespacedPersistentVolumeClaimStatus
2023-10-10 09:38:16 +13:00
Kubernetes Prow Robot
e1788034c6
Merge pull request #120046 from chansuke/feature/update-the-function-to-get-nodes
Optimize test for scalability by using `GetBoundedReadySchedulableNodes()`
2023-10-09 21:44:21 +02:00
cpanato
a6b0a6c484
Bump distroless-iptables to v0.3.3
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-10-09 13:18:46 +02:00
carlory
137ad9bf2b clean up deprecated funcs in e2e framework 2023-10-09 16:51:53 +08:00
carlory
d5d7fb595e e2e_node: stop using deprecated framework.ExpectEqual 2023-10-09 16:42:42 +08:00
carlory
5f74461bcd e2e_apimachinery: stop using deprecated framework.ExpectEqual 2023-10-09 15:44:20 +08:00
Katarzyna Lach
122ff5a212 Move grpc rate limitter from podresource folder
Rate limitter.go file is a generic file implementing
grpc Limiter interface. This file can be reuse by other gRPC
API not only by podresource.

Change-Id: I905a46b5b605fbb175eb9ad6c15019ffdc7f2563
2023-10-09 07:22:23 +00:00
Kubernetes Prow Robot
cdc026fad1
Merge pull request #119915 from AxeZhan/rollout_e2e
Add a kubectl rollout undo e2e test
2023-10-09 04:53:29 +02:00
Kubernetes Prow Robot
3fc8c32425
Merge pull request #121051 from carlory/cleanup-e2e-apps-framework-equal
fix incorrect Consistently due to missing assignment when create a newAsyncAssertion
2023-10-08 21:53:52 +02:00
charles-chenzz
ccc6458683 e2e_node: add testcase to check status of pod ready to start condition are set to false after terminating 2023-10-08 20:40:36 +08:00
carlory
bb02d0feba fix incorrect Consistently due to missing assignment when create a newAsyncAssertion 2023-10-08 13:23:46 +08:00
Kubernetes Prow Robot
4b2225701e
Merge pull request #120819 from fusida/fix-master-e2e
e2e: set liveness probe timeout seconds for conformance test
2023-10-08 07:02:52 +02:00
Kubernetes Prow Robot
9bea6bda05
Merge pull request #121021 from cpanato/update-go-1212
[go] Bump images, dependencies and versions to go 1.21.2
2023-10-06 22:34:58 +02:00
Kubernetes Prow Robot
e339e03ff8
Merge pull request #120814 from Jefftree/fix-openapiv2-aggregator-apiservice
Fix AddUpdateAPIService for openapiv2
2023-10-06 21:21:32 +02:00
Kubernetes Prow Robot
9af7096c89
Merge pull request #120968 from borg-land/remove-kubeup-tests-1
Remove an e2e test specific to kubeup clusters - Part One
2023-10-06 19:20:11 +02:00
Jefftree
89adbb4b4a Unit & integration tests for OpenAPIV2 AddUpdateAPIService 2023-10-06 12:57:57 -04:00
Kubernetes Prow Robot
57144165f7
Merge pull request #120977 from borg-land/remove-kubeup-tests-2
Remove an e2e test specific to kubeup clusters - Part Two
2023-10-06 17:11:20 +02:00
cpanato
619abfec70
[go] Bump images, dependencies and versions to go 1.21.2
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-10-06 15:18:34 +02:00
Gunju Kim
8b5f30ef09
Don't reuse CPU set of a restartable init container 2023-10-06 22:16:15 +09:00
Jiahui Feng
4d157bb142 add single boolean to composition e2e test. 2023-10-04 16:43:38 -07:00
Kubernetes Prow Robot
f19b62fc09
Merge pull request #120959 from pohly/e2e-test-whitespace-cleanup
e2e: remove redundant spaces in test names
2023-10-05 00:41:59 +02:00
Lukasz Szaszkiewicz
9b32a47ab6 integration/apimachinery: add TestReflectorWatchListFallback integration test 2023-10-03 14:56:50 +02:00
upodroid
6d759aac91 delete addon-manager test 2023-10-03 10:30:48 +01:00
Kubernetes Prow Robot
6a84edb2ce
Merge pull request #120222 from tkashem/apf-queue-wait-ctx
apf: manage request queue wait with context in APF Filter
2023-10-03 11:16:18 +02:00
Kubernetes Prow Robot
67733c650b
Merge pull request #119485 from carlory/cleanup-e2e-apps-framework-equal
e2e_apps: stop using deprecated framework.ExpectEqual
2023-10-02 11:45:02 -07:00
upodroid
243ba5fb6b remove an e2e test specific to kubeup clusters 2023-10-02 16:21:41 +01:00
Kubernetes Prow Robot
44cfd556b3
Merge pull request #120339 from pohly/scheduler-perf-dra-driver-names
scheduler_perf: use different log names for different DRA drivers
2023-10-02 06:32:56 -07:00
Kubernetes Prow Robot
5cc92713d1
Merge pull request #120335 from pohly/scheduler-perf-pod-name
scheduler_perf: show name of one pending pod in log
2023-10-02 06:32:45 -07:00
carlory
98bad643cd e2e_apps: stop using deprecated framework.ExpectEqual 2023-10-01 10:30:00 +08:00
Kubernetes Prow Robot
3e9f2f4d65
Merge pull request #120948 from marosset/fix-mem-limit-windows-test
Fixing attempt to deploy past allocatable memory limits test on Windows
2023-09-29 14:33:57 -07:00
Mark Rossetti
0b76e7b69f
Fixing attempt to deploy past allocatable memory limits test on Windows
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2023-09-29 13:26:51 -07:00
Kubernetes Prow Robot
46728a8fa2
Merge pull request #120896 from aramase/aramase/f/kmsv2_pkcs11
[KMSv2] pkcs11 reference implementation using SoftHSM
2023-09-29 12:11:56 -07:00
Anish Ramasekar
2f5708833a
[KMSv2] pkcs11 reference implementation using SoftHSM
Signed-off-by: Anish Ramasekar <anish.ramasekar@gmail.com>
2023-09-29 16:57:26 +00:00
chansuke
74180aaf66 Fix scalability concerns in test runs 2023-09-30 00:17:51 +09:00
Patrick Ohly
0e8a1f1816 e2e: remove redundant spaces in test names
The spaces are redundant because Ginkgo will add them itself when concatenating
the different test name components. Upcoming change in the framework will
enforce that there are no such redundant spaces.
2023-09-29 08:30:57 +02:00
Kubernetes Prow Robot
f599a98205
Merge pull request #120834 from dzjiang91/fix-panic
Fix panic testing intree vSphere dynamic PV.
2023-09-28 18:00:40 -07:00
Davanum Srinivas
d900217664
fix missed branch - targets when building using arm64
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-09-27 15:52:37 -04:00
Kubernetes Prow Robot
d924775d78
Merge pull request #120750 from pohly/test-output-init-fix
e2e framework: adapt unit test to Go 1.22
2023-09-27 06:26:10 -07:00
Kubernetes Prow Robot
fe6d64d080
Merge pull request #120402 from Rei1010/storageClean
e2e_storage:stop using deprecated framework.ExpectNotEqual
2023-09-27 05:14:08 -07:00
Kubernetes Prow Robot
36b474b008
Merge pull request #120893 from dims/build-kubelet-with-cgo-for-sig-node-e2e-tests-not-ginkgo
Build kubelet with CGO for sig-node e2e tests (not ginkgo)
2023-09-26 21:05:21 -07:00
Kubernetes Prow Robot
bb024e1f91
Merge pull request #120347 from shijinye/e2eclean
e2e_network:stop using deprecated framework.ExpectError
2023-09-26 12:04:59 -07:00
Abu Kashem
11ef9514da
apf: remove RequestWaitLimit from queueset config 2023-09-26 08:55:23 -04:00
Davanum Srinivas
52f5093d77
Build kubelet with CGO for sig-node e2e tests (not ginkgo)
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-09-26 08:32:59 -04:00
Patrick Ohly
36146ad686 e2e dra: enhance test driver
Several enhancements:
- `--resource-config` is now listed under `controller` options instead of
  `leader election`: merely a cosmetic change
- The driver name can be configured as part of the resource config. The
  command line flag overrides the config, but only when set explicitly.
  This makes it possible to pre-define complete driver setups where the
  name is associated with certain resource availability. This will be
  used for testing cluster autoscaling.
- The set of nodes where resources are available can optionally be specified
  via node labels. This will be used for testing cluster autoscaling.
2023-09-25 19:50:33 +02:00
Kubernetes Prow Robot
1c651cf739
Merge pull request #120817 from bertinatto/fix-panic-vsphere
e2e: bootstrap vsphere tests earlier
2023-09-25 08:57:36 -07:00
Dezhou Jiang
06deb49d2c Fix panic testing intree vSphere dynamic PV.
Change-Id: I9d1ee7f49f01c3308b967d84865fa6bcfafc4b0d
2023-09-22 18:58:44 +00:00
Kubernetes Prow Robot
3ac83f528d
Merge pull request #119290 from carlory/add-logger
the scheduling queue logs the error and treats it as QueueAfterBackoff
2023-09-22 08:10:49 -07:00
fusida
9fb77db9b9 e2e: set liveness probe timeout seconds for conformance test 2023-09-22 11:05:00 +08:00
Fabio Bertinatto
eb109e3948 e2e: bootstrap vsphere tests earlier 2023-09-21 16:19:37 -03:00
Kubernetes Prow Robot
ed552363c3
Merge pull request #120405 from Rei1010/introduceNewMethod
using wait.Interrupted instead of deprecated wait.ErrWaitTimeout for apps
2023-09-21 07:24:44 -07:00
Kubernetes Prow Robot
ee73715dd9
Merge pull request #120774 from pacoxu/revert-120749-skip-kube-dns-tests-when-missing
Revert "Skip kube-dns tests if coredns is installed"
2023-09-20 21:22:14 -07:00
carlory
0105a002bc when the hint fn returns error, the scheduling queue logs the error and treats it as QueueAfterBackoff.
Co-authored-by: Kensei Nakada <handbomusic@gmail.com>

Co-authored-by: Kante Yin <kerthcet@gmail.com>

Co-authored-by: XsWack <xushiwei5@huawei.com>
2023-09-21 09:40:44 +08:00
Kubernetes Prow Robot
884bc96fec
Merge pull request #120773 from swatisehgal/tm-metrics-e2e-deflake
topology-mgr: metrics: Deflake Topology Manager metrics e2e tests
2023-09-20 11:26:26 -07:00
Kubernetes Prow Robot
7fb7e2625b
Merge pull request #120401 from shijinye/e2eclean-node-notequal
cleanup:e2e:stop using deprecated framework.ExpectNotEqual
2023-09-20 11:26:19 -07:00
Kubernetes Prow Robot
3191493cea
Merge pull request #119402 from Tal-or/e2e_podres_terminal_pods
e2e:podresources: verify count for terminal pods
2023-09-20 11:26:11 -07:00
Kubernetes Prow Robot
89b4153d4d
Merge pull request #116884 from mengjiao-liu/contextual-logging-scheduler-plugin-nodevolumelimits
Change the scheduler plugins FactoryAdapter function to use context parameter to pass logger
2023-09-20 11:26:00 -07:00
Kubernetes Prow Robot
7dcb412675
Merge pull request #120762 from Riaankl/remove-storageclass-endpoints
remove storageclass endpoints from pending_eligible_endpoints.yaml
2023-09-20 07:02:14 -07:00
Kubernetes Prow Robot
71e6923af5
Merge pull request #120761 from ii/promote-storageclass-lifecycle-test
Promote e2e test for StorageClass Endpoints + 7 Endpoints
2023-09-20 07:02:02 -07:00
Paco Xu
7cfdb792d6
Revert "Skip kube-dns tests if coredns is installed" 2023-09-20 21:04:54 +08:00
Swati Sehgal
f5d915b594 topology-mgr: metrics: Deflake Topology Manager metrics e2e tests
On local execution of Topology Manager metrics tests, the tests pass rate was 100%.
Yet, we can see that the Topology Manager metrics tests are failing in upstream
CI consistently: https://testgrid.k8s.io/sig-node-presubmits#pr-kubelet-serial-gce-e2e-topology-manager.

From the logs, it was identified that these failures are because of timeouts,
so we are increasing the default timeout as well as polling interval frequency
of obtaining KubeletMetrics to deflake this test.

We have noticed a similar flake in case of CPU manager metrics tests as well:
https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/directory/pull-kubernetes-node-kubelet-serial-cpu-manager/1701615009836044288.
Once it is confirmed that the issue is resolved for Topology Manager test,
we will be fix this for CPU Manager as well in a follow-up PR.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-09-20 13:37:27 +01:00
Mengjiao Liu
a7466f44e0 Change the scheduler plugins PluginFactory function to use context parameter to pass logger
- Migrated pkg/scheduler/framework/plugins/nodevolumelimits to use contextual logging
- Fix golangci-lint validation failed
- Check for plugins creation err
2023-09-20 17:49:54 +08:00
Kubernetes Prow Robot
144baff189
Merge pull request #120598 from neolit123/1.29-kubeadm-ownders-cleanup
/cmd/kubeadm,/test/e2e*: update OWNERS for fabriziopandini & RA489
2023-09-19 21:39:59 -07:00
Kubernetes Prow Robot
963c9b3cb9
Merge pull request #119317 from mochizuki875/fix_ds_rolling_update_118823
Exclude nodes from rolling update depending on tolerations
2023-09-19 16:50:17 -07:00
Stephen Heywood
de244b2678 Promote StorageClass e2e test to Conformance 2023-09-20 09:44:55 +12:00
Riaan Kleinhans
b23575d0fc remove storageclass endpoints
Signed-off-by: Riaan Kleinhans <riaan@ii.coop>
2023-09-20 09:17:58 +12:00
upodroid
9e060590e6 skip kube-dns tests if kube-dns is missing 2023-09-19 12:48:54 +01:00
mochizuki875
2a82776745 change rolling update logic to exclude sunsetting nodes 2023-09-19 11:39:32 +00:00
Patrick Ohly
83c37acb21 e2e framework: adapt unit test to Go 1.22
Go 1.22 changed the name of init functions from "glob..func" to
"init.func". That difference is acceptable and has to be ignored when comparing
output.
2023-09-19 12:18:34 +02:00
Kubernetes Prow Robot
12dc19d46f
Merge pull request #119393 from saschagrunert/debian-base
Update debian-base for test images and others
2023-09-19 00:48:55 -07:00
Antonio Ojea
bbb905c794 remove gce healthcheck interval e2e lb test
Change-Id: Idcd8c4c6b39cbd419ca58b7354508ebe81501e91
2023-09-17 19:47:32 +00:00
Patrick Ohly
7cac1dcf67 dra scheduler: fall back to SSA for PodSchedulingContext updates
During scheduler_perf testing, roughly 10% of the PodSchedulingContext update
operations failed with a conflict error. Using SSA would avoid that, but
performance measurements showed that this causes a considerable
slowdown (primarily because of the slower encoding with JSON instead of
protobuf, but also because server-side processing is more expensive).

Therefore a normal update is tried first and SSA only gets used when there has
been a conflict. Using SSA in that case instead of giving up outright is better
because it avoids another scheduling attempt.
2023-09-15 15:05:38 +02:00
Kubernetes Prow Robot
d393d4e151
Merge pull request #120574 from logicalhan/cslis
promote component SLIs to GA; remove feature gates for component slis
2023-09-14 22:52:12 -07:00
Kubernetes Prow Robot
9bba6cd627
Merge pull request #120600 from wojtek-t/conformance_metadata
Fix missing conformance metadata forAPI Chunking tests
2023-09-14 18:50:11 -07:00
mochizuki875
e22f8ed553 add CAP_NET_RAW to netadmin profile and remove privileged 2023-09-14 08:55:58 +00:00
Kubernetes Prow Robot
4abf29c5c8
Merge pull request #120651 from humblec/test-rbd
test-e2e: Retrigger the RBD image post merge job
2023-09-14 01:00:29 -07:00
Kubernetes Prow Robot
8eaaf2f9b9
Merge pull request #120403 from wlq1212/cheanup/notwork/expectnotequal
e2e_network:stop using deprecated framework.ExpectNotEqual
2023-09-14 01:00:16 -07:00
Kubernetes Prow Robot
a68093a3ff
Merge pull request #120506 from alexzielenski/import-restrictions
Update e2e import restrictions
2023-09-13 21:56:22 -07:00
Humble Chirammal
0288367db8 test-e2e: Retrigger the RBD image post merge job
The post merge job was failed #117103
and this causes the e2e tests to fail. Considering we have increased
the timeout recently,  retriggering the job.

Signed-off-by: Humble Chirammal <humble.devassy@gmail.com>
2023-09-14 09:38:24 +05:30
Kubernetes Prow Robot
716b8b9d83
Merge pull request #120623 from aojea/service_status_Finalizer
sync Service API status rest storage
2023-09-13 17:56:11 -07:00
Antonio Ojea
21e26486ac sync Service API status rest storage
The Service API Rest implementation is complex and has to use different
hooks on the REST storage. The status store was making a shallow copy of
the storage before adding the hooks, so it was not inheriting the hooks.

The status store must have the same hooks as the rest store to be able
to handle correctly the allocation and deallocation of ClusterIPs and
nodePorts.

Change-Id: I44be21468d36017f0ec41a8f912b8490f8f13f55
Signed-off-by: Antonio Ojea <aojea@google.com>
2023-09-13 11:35:42 +00:00
wlq1212
239ce7c52b e2e_network:stop using deprecated framework.ExpectNotEqual 2023-09-13 18:46:49 +08:00
Kubernetes Prow Robot
50578c4b58
Merge pull request #119902 from RomanBednar/pv-phase-transition-time-e2e
e2e: add test for verification of timestamp for new PVs
2023-09-13 03:00:11 -07:00
Kubernetes Prow Robot
160fe010f3
Merge pull request #120464 from gjkim42/deflake-container-lifecycle-e2e-test
e2e_node: Assign enough time to finish the postStart hook
2023-09-12 17:44:44 -07:00
Kubernetes Prow Robot
04e5914079
Merge pull request #120349 from ruquanzhao/fixTopologyManagerJobs
e2e-node: fix TopologyManager test jobs.
2023-09-12 17:44:37 -07:00
Kubernetes Prow Robot
8aeebda818
Merge pull request #120306 from Rei1010/nodeClean
e2e_node:stop using deprecated framework.ExpectError
2023-09-12 17:44:23 -07:00
Kubernetes Prow Robot
9afd83d9bd
Merge pull request #120614 from tzneal/add-check-for-pod-success-status
sidecar tests: specifically check that the pod was successful
2023-09-12 16:40:10 -07:00
Kubernetes Prow Robot
81ac3f9e2e
Merge pull request #120552 from ii/promote-pv-pvc-lifecycle-test
Promote e2e test for PersistentVolume & PersistentVolumeClaim Endpoints + 13 Endpoints
2023-09-12 14:42:22 -07:00
Kubernetes Prow Robot
d147da0304
Merge pull request #120597 from machine424/apiserver_storage_objects
clarify the description of the metric apiserver_storage_objects
2023-09-12 13:34:23 -07:00
Kubernetes Prow Robot
75ed5cc556
Merge pull request #120553 from Riaankl/remove-persistentvolume-endpoints
Remove persistentvolume endpoints from pending_eligible_endpoints.yaml
2023-09-12 13:34:11 -07:00
Kubernetes Prow Robot
e897483713
Merge pull request #120583 from ritazh/kms-livezchecks
kms: remove livez check
2023-09-12 12:18:12 -07:00
Todd Neal
af151eeba2 specifically check that the pod was successful 2023-09-12 13:40:20 -05:00
Rita Zhang
7710128636
kms: remove livez check
Signed-off-by: Rita Zhang <rita.z.zhang@gmail.com>
2023-09-12 08:48:26 -07:00
Kubernetes Prow Robot
24bc34b1bf
Merge pull request #120516 from togettoyou/cleanup-client-go
cleanup: Replace the deprecated NewMemCacheClient with memory.NewMemCacheClient
2023-09-12 07:48:24 -07:00
machine424
084deb7a7c
clarify the description of the metric apiserver_storage_objects 2023-09-12 16:36:41 +02:00
Wojciech Tyczyński
9da533ced6 API Chunking conformance metadata 2023-09-12 13:12:49 +02:00
Lubomir I. Ivanov
3d91ffc6f9 /cmd/kubeadm,/test/e2e*: update OWNERS for fabriziopandini & RA489
- move fabriziopandini to emeritus_approvers for /test/e2e*
and /cmd/kubeadm. fabriziopandini remains in /OWNERS_ALIASES
under sig-cluster-lifecycle-leads.
- remove RA489 as reviewer for /test/e2e* and /cmd/kubeadm
2023-09-12 12:35:40 +03:00
Fabian Fulga
868adeb3bd Update IPPVS e2e tests for containerd version above 1.6.9 2023-09-12 01:53:28 +03:00
Aldo Culquicondor
6b4ab616a2
Increase range of job_sync_duration_seconds
Change-Id: I7ed4b006faecf0a7e6e583c42b4d6bc4b786a164
2023-09-11 18:01:33 -04:00
Kubernetes Prow Robot
be968597e3
Merge pull request #120310 from gjkim42/use-container-log-instead-of-termination-log
Use container log instead of termination log
2023-09-11 11:52:23 -07:00
Han Kang
09d64ed7d4 promote sli metrics to stable 2023-09-11 10:17:42 -07:00
Kensei Nakada
0d3eafdfa3
fix(scheduling_queue): always put Pods with no unschedulable plugins into activeQ/backoffQ (#119105)
* always put Pods with no unschedulable plugins into activeQ/backoffQ

* address review comments
2023-09-11 09:30:11 -07:00
Gunju Kim
1fb4eee94e
Use container log instead of termination log
Since the termination log cannot be accessed until the container is
terminated, use the container log.
2023-09-11 22:55:09 +09:00
Patrick Ohly
fc3ee07b51 e2e pods: fix WaitForPodsResponding retry
The status error was embedded inside the new error constructed by
WaitForPodsResponding's get function, but not wrapped. Therefore
`apierrors.IsServiceUnavailable(err)` didn't find it and returned false -> no
retries.

Wrapping fixes this and Gomega formatting of the error remains useful:

	err := &errors.StatusError{}
	err.ErrStatus.Code = 503
	err.ErrStatus.Message = "temporary failure"

	err2 := fmt.Errorf("Controller %s: failed to Get from replica pod %s:\n%w\nPod status:\n%s",
		"foo", "bar",
		err, "some status")
	fmt.Println(format.Object(err2, 1))
        fmt.Println(errors.IsServiceUnavailable(err2))

=>

    <*fmt.wrapError | 0xc000139340>:
    Controller foo: failed to Get from replica pod bar:
    temporary failure
    Pod status:
    some status
    {
        msg: "Controller foo: failed to Get from replica pod bar:\ntemporary failure\nPod status:\nsome status",
        err: <*errors.StatusError | 0xc0001a01e0>{
            ErrStatus: {
                TypeMeta: {Kind: "", APIVersion: ""},
                ListMeta: {
                    SelfLink: "",
                    ResourceVersion: "",
                    Continue: "",
                    RemainingItemCount: nil,
                },
                Status: "",
                Message: "temporary failure",
                Reason: "",
                Details: nil,
                Code: 503,
            },
        },
    }

    true
2023-09-11 11:54:15 +02:00
Kubernetes Prow Robot
cc0a24d2e8
Merge pull request #120406 from wlq1212/cheanup/framework/timeout
e2e_framework:stop using deprecated wait.ErrwaitTimeout
2023-09-10 21:10:10 -07:00
Stephen Heywood
41b62c4dd7 Promote PV/PVC e2e test to Conformance 2023-09-11 10:25:07 +12:00
Riaan Kleinhans
0936c8de59
remove persistentvolume endpoints from pending_eligible_endpoints.yaml 2023-09-11 06:53:28 +12:00
Kubernetes Prow Robot
25c7a1439a
Merge pull request #120069 from aojea/service_conformance
promote to conformance Service multiprotocol tests
2023-09-10 07:26:09 -07:00
Kubernetes Prow Robot
b343878daa
Merge pull request #120438 from ritazh/kmsv2-metrics-apiserverid
kmsv2: add apiserver identity to metrics
2023-09-09 16:46:09 -07:00
Rita Zhang
43ccf6c4e8
kmsv2: add apiserver identity to metrics
Signed-off-by: Rita Zhang <rita.z.zhang@gmail.com>
2023-09-09 15:31:32 -07:00
Kubernetes Prow Robot
33c5bd631d
Merge pull request #120008 from skitt/drop-intstr-ptr-wrappers
Use ptr.To to retrieve intstr addresses
2023-09-09 07:24:09 -07:00
Kubernetes Prow Robot
817488e4fa
Merge pull request #120082 from aojea/hostnetwork_services_fallback
e2e network test for udp services with hostNetwork clients
2023-09-08 12:01:12 -07:00
cpanato
f499b4f891
Bump distroless-iptables to v0.3.2
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-09-08 15:01:22 +02:00
Kubernetes Prow Robot
1773a162bf
Merge pull request #120484 from carlory/fix-120475
fix breaking pv creation when the validation webhook exists
2023-09-08 04:18:14 -07:00
Stephen Kitt
aa89e6dc97
Use ptr.To to retrieve intstr addresses
This uses the generic ptr.To in k8s.io/utils to replace functions and
code constructs which only serve to return pointers to intstr
values. Other uses of the deprecated pointer package are updated in
modified files.

Signed-off-by: Stephen Kitt <skitt@redhat.com>
2023-09-08 11:10:50 +02:00
Sascha Grunert
5e0931336b
kubelet: fix metric container_start_time_seconds's timestamp
Adapting the tests and reverting https://github.com/kubernetes/kubernetes/pull/103429

Carry-over from https://github.com/kubernetes/kubernetes/pull/117881

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-09-08 09:13:37 +02:00
Junhao Zou
43c05e98ca
cleanup: Replace the deprecated NewMemCacheClient with memory.NewMemCacheClient 2023-09-08 11:57:46 +08:00
Kubernetes Prow Robot
b685f5204a
Merge pull request #120471 from ii/create-storageclass-lifecycle-test
Write e2e test for StorageClass Endpoints + 7 Endpoints
2023-09-07 14:40:57 -07:00
Alexander Zielenski
7a13b11af0 update e2e import restrictions 2023-09-07 12:20:29 -07:00
cpanato
98af8befe2
[go] Bump images, dependencies and versions to go 1.21.1
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-09-07 14:56:56 +02:00
Kubernetes Prow Robot
1a1a3d0a87
Merge pull request #120482 from carlory/cleanup-1
HandleRetry has already called in the GetObject
2023-09-07 03:53:08 -07:00
carlory
2d1877c97b fix breaking pv creation when the validation webhook exists 2023-09-07 16:45:53 +08:00
Kubernetes Prow Robot
163553bbe0
Merge pull request #120479 from carlory/cleanup-e2e-apimachinery-framework-equal
fix typecast missing
2023-09-07 01:26:58 -07:00
carlory
f33265cf5d HandleRetry has already called in the GetObject 2023-09-07 15:48:18 +08:00
Kubernetes Prow Robot
2d5b6f16f5
Merge pull request #120213 from pohly/dra-scheduler-resourceclass-missing
dra: resourceclass missing
2023-09-06 23:47:09 -07:00
Kubernetes Prow Robot
b27670dfbd
Merge pull request #118740 from saschagrunert/kubelet-label-types
Make kubelet label types public
2023-09-06 23:46:57 -07:00
carlory
fc51a27c03 fix typecast missing 2023-09-07 14:25:29 +08:00
Francesco Romani
2ea47038b9 podresources: e2e: force eager connection
Add and use more facilities to the *internal* podresources client.
Checking e2e test runs, we have quite some
```
rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing: dial unix /var/lib/kubelet/pod-resources/kubelet.sock: connect: connection refused": rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing: dial unix /var/lib/kubelet/pod-resources/kubelet.sock: connect: connection refused"
```

This is likely caused by kubelet restarts, which we do plenty in e2e tests,
combined with the fact gRPC does lazy connection AND we don't really
check the errors in client code - we just bubble them up.

While it's arguably bad we don't check properly error codes, it's also
true that in the main case, e2e tests, the functions should just never
fail besides few well known cases, we're connecting over a
super-reliable unix domain socket after all.

So, we centralize the fix adding a function (alongside with minor
cleanups) which wants to trigger and ensure the connection happens,
localizing the changes just here. The main advantage is this approach
is opt-in, composable, and doesn't leak gRPC details into the client
code.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2023-09-07 08:24:49 +02:00
Kensei Nakada
87d49a51be fix(queue_test): make sure the first bind failure via counter 2023-09-06 19:47:54 +00:00
Kubernetes Prow Robot
eb0724a699
Merge pull request #120404 from kerthcet/flaky-test/bind
Output the error message for better analysis
2023-09-06 12:35:08 -07:00
Kubernetes Prow Robot
cf8f9a8d07
Merge pull request #119334 from kmala/serverapply
update the dependency sigs.k8s.io/structured-merge-diff/v4 to latest tag
2023-09-06 12:34:55 -07:00
Kubernetes Prow Robot
3ed537ab46
Merge pull request #119988 from bzsuni/cleanup/e2e/app
[e2e_app] stop using deprecated framework.ExpectNotEqual
2023-09-06 11:08:51 -07:00
Todd Neal
94afd6e3a4 skip the reason check for OOM tests if it will fail
This is currently flaking badly due to a race between cgroup deletion
and the runtime detecting the OOM kill.
2023-09-06 12:20:02 -05:00
Gunju Kim
b468e4eb1c
e2e_node: Assign enough time to finish the postStart hook
This deflakes the "Containers Lifecycle should not launch second
container before PostStart of the first container completed" test by
assigning enough time to finish the postStart hook.
2023-09-07 00:42:54 +09:00
Kubernetes Prow Robot
56cc5e77a1
Merge pull request #120441 from tzneal/revert-npd-update
Revert "bump npd to v0.8.14"
2023-09-06 06:39:04 -07:00
Kubernetes Prow Robot
c0a38d01aa
Merge pull request #120433 from enj/enj/f/kmsv2_default_enable_kdf
kmsv2: enable KMSv2KDF feature gate by default
2023-09-06 06:38:52 -07:00
Kubernetes Prow Robot
a759bf2b1e
Merge pull request #119461 from carlory/cleanup-e2e-auth-framework-equal
e2e_auth: stop using deprecated framework.ExpectEqual
2023-09-06 05:35:19 -07:00
Kubernetes Prow Robot
ea5862ca4e
Merge pull request #119459 from carlory/cleanup-e2e-apimachinery-framework-equal
e2e_apimachinery: stop using deprecated framework.ExpectEqual
2023-09-06 05:35:12 -07:00
Kubernetes Prow Robot
8703ca2413
Merge pull request #118996 from cpanato/golang121
[go] Bump images, dependencies and versions to go 1.21.0
2023-09-06 05:34:54 -07:00
Kubernetes Prow Robot
a8aa3621f2
Merge pull request #120407 from shijinye/e2ecleanup-storage-errwaittimeout
cleanup:e2e:stop using deprecated wait.ErrWaitTimeout
2023-09-06 04:20:51 -07:00
carlory
36d3672249 e2e_auth: stop using deprecated framework.ExpectEqual 2023-09-06 18:29:28 +08:00
cpanato
67858e1ff0
Bump images, dependencies and versions to go 1.21.0
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-09-06 12:06:51 +02:00
Patrick Ohly
c682d2b8c5 scheduler: add ResourceClass events
When filtering fails because a ResourceClass is missing, we can treat the pod
as "unschedulable" as long as we then also register a cluster event that wakes
up the pod. This is more efficient than periodically retrying.
2023-09-06 11:14:08 +02:00
carlory
15a402eefb e2e_apimachinery: stop using deprecated framework.ExpectEqual 2023-09-06 14:43:45 +08:00
Kubernetes Prow Robot
04292dd94b
Merge pull request #120369 from Gauravpadam/update-generated-html
Generated new documentation.md for kubernetes metrics reference
2023-09-05 23:02:50 -07:00
Kubernetes Prow Robot
debe30de70
Merge pull request #120281 from gjkim42/feature-gate-sidecar-containers-in-kuberuntime
Feature-gate SidecarContainers code in pkg/kubelet/kuberuntime
2023-09-05 18:34:54 -07:00
Todd Neal
355ae44a3c Revert "bump npd to v0.8.14"
This reverts commit 7b44d73f73.
2023-09-05 20:28:53 -05:00
Stephen Heywood
e7ee3ae3d9 Create e2e test for StorageClass endpoints
e2e test validates the following 7 endpoints
- createStorageV1StorageClass
- deleteStorageV1CollectionStorageClass
- deleteStorageV1StorageClass
- listStorageV1StorageClass
- patchStorageV1StorageClass
- readStorageV1StorageClass
- replaceStorageV1StorageClass
2023-09-06 12:09:10 +12:00
Kubernetes Prow Robot
3ad8fb5dcf
Merge pull request #120350 from Gauravpadam/metrics-change
Changed the generator codes for the kubernetes metrics reference table
2023-09-05 13:09:02 -07:00
Kubernetes Prow Robot
f68c66f96d
Merge pull request #119142 from aramase/aramase/f/kep_3331_add_feature_flag
[StructuredAuthenticationConfig] Add feature flag and wire up `--authentication-config` flag
2023-09-05 13:08:51 -07:00
Gauravpadam
2144852f7c Commits merged
Some label fixes

Wrapper class is now metric

merged commits
2023-09-06 01:32:09 +05:30
Monis Khan
657cc2045e
kmsv2: enable KMSv2KDF feature gate by default
Signed-off-by: Monis Khan <mok@microsoft.com>
2023-09-05 15:20:10 -04:00
Gauravpadam
0024c7e166 Changed the generator codes for kubernetes metrics table
Corrected the gotemplate range call

Modified the wrapper class

Delete test/instrumentation/documentation/documentation.md

Removed documentation.md change as we're changing it over the other PR

Restored the original doc.md ; PR is solely for the generator code now

Some label fixes

merge commits
2023-09-05 23:32:36 +05:30
AxeZhan
059e8353b7 check labels/annotations in the beginning 2023-09-05 19:23:01 +08:00
jinye
a774887262 cleanup:e2e:stop using deprecated framework.ExpectNotEqual 2023-09-05 18:16:57 +08:00
Arda Güçlü
e8564d6c5b KEP-3895: Promote interactive delete to beta 2023-09-05 11:16:30 +03:00
wlq1212
ef235c4eac e2e_framework:stop using deprecated wait.ErrwaitTimeout 2023-09-05 14:19:42 +08:00
tao.yang
b35357b6c0 cleanup: omit comparison with bool constants
Signed-off-by: tao.yang <tao.yang@daocloud.io>
2023-09-05 10:24:38 +08:00
wen.rui
622eac2347 e2e_storage:stop using deprecated framework.ExpectNotEqual 2023-09-05 09:44:43 +08:00
jinye
9795df456f cleanup:e2e:stop using deprecated wait.ErrWaitTimeout 2023-09-04 21:53:08 +08:00
Kubernetes Prow Robot
294bde0079
Merge pull request #120119 from Ithrael/fix/kubectl-events-doesnt-filter-events-by-GroupVersion-for-resource-with-full-name
fix(kubectl): kubectl events doesn't filter events by GroupVersion fo…
2023-09-04 06:45:48 -07:00
wen.rui
fc25af7f54 using wait.Interrupted instead of deprecated wait.ErrWaitTimeout for apps 2023-09-04 21:36:53 +08:00
kerthcet
e6dfdb240f Output the error message for better analylsis
Signed-off-by: kerthcet <kerthcet@gmail.com>
2023-09-04 20:31:48 +08:00
Kubernetes Prow Robot
6586296afc
Merge pull request #120305 from Rei1010/cleanUp
e2e_scheduling:stop using deprecated framework.ExpectError
2023-09-04 04:49:59 -07:00
Kubernetes Prow Robot
a800e8534e
Merge pull request #119487 from carlory/cleanup-e2e-autoscaling-framework-equal
e2e_autoscaling: stop using deprecated framework.ExpectEqual
2023-09-04 04:49:48 -07:00
AxeZhan
ff3cb43c30 remove labeld-deployment.yaml 2023-09-04 18:34:28 +08:00
Patrick Ohly
c74d045c4b scheduler_perf: show name of one pending pod in error message
If pods get stuck, then giving the name of one makes it possible
to search for it in the log output. Without the name it's hard
to figure out which pods got stuck.
2023-09-04 09:54:26 +02:00
jinye
95118ec30c e2e_network:stop using deprecated framework.ExpectError 2023-09-04 11:04:29 +08:00
Kubernetes Prow Robot
751058c0a4
Merge pull request #120285 from wlq1212/cleanup/e2e/clouod
e2e_cloud&common:stop using deprecated framework.ExpectError
2023-09-03 01:21:45 -07:00
Kubernetes Prow Robot
84faedfbfe
Merge pull request #119989 from bzsuni/cleanup/e2e/auth
[e2e_auth] stop using deprecated framework.ExpectNotEqual
2023-09-02 07:33:11 -07:00
Monis Khan
95121fe846
kmsv2: add legacy data integration test
Signed-off-by: Monis Khan <mok@microsoft.com>
2023-09-01 15:33:28 -04:00
Kubernetes Prow Robot
a99e377a54
Merge pull request #120221 from enj/enj/i/kms_cache_metrics_lock
kmsv2: fix race in simpleCache.set when setting cache size metric
2023-09-01 10:00:31 -07:00
ithrael
39e433891b fix(test): delete cronjobs.v1.example.com pi for clean up 2023-09-01 23:13:26 +08:00
Kubernetes Prow Robot
c3aa3ac3bd
Merge pull request #119468 from Songjoy/cleanup-e2e-upgrades-framework-equal
e2e_upgrades: stop using deprecated framework.ExpectEqual
2023-09-01 07:54:32 -07:00
ithrael
36e33cd88e fix(test): Remove 'shortNames' Property from CronJob Fields in e2e-test 2023-09-01 22:16:06 +08:00
Kubernetes Prow Robot
9a0c389fd7
Merge pull request #120352 from saschagrunert/distroless-bump
Update distroless-iptables image version to v0.3.1
2023-09-01 06:52:43 -07:00
Kubernetes Prow Robot
b2499a1c28
Merge pull request #119558 from Songjoy/cleanup-e2e-storage-framework-equal
e2e_storage: stop using deprecated framework.ExpectEqual
2023-09-01 05:28:30 -07:00
Sascha Grunert
936d67f688
Update distroless-iptables image version to v0.3.1
Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-09-01 13:10:15 +02:00
wlq1212
a56745c706 e2e_cloud:stop using deprecated framework.ExpectError 2023-09-01 17:53:37 +08:00
RuquanZhao
bfc3c2110f e2e-node: fix TopologyManager test jobs.
Signed-off-by: Ruquan Zhao <ruquan.zhao@arm.com>
2023-09-01 17:53:16 +08:00
wen.rui
3d9b5d0577 e2e_node:stop using deprecated framework.ExpectError 2023-09-01 17:42:36 +08:00
Rei1010
f1051f34a7 Remove deprecated framework 2023-09-01 17:39:37 +08:00
Patrick Ohly
78f3b76390 scheduler_perf: use different log names for different DRA drivers
This helps when using -feature-gate=ContextualLogging=true and running the
SchedulingWithMultipleResourceClaims test case because then output from the two
driver instances is easy to distinguish.
2023-09-01 09:25:09 +02:00
Kubernetes Prow Robot
400059d025
Merge pull request #120194 from bzsuni/bz/bump/npd
bump npd to v0.8.14
2023-08-31 20:52:30 -07:00
Kubernetes Prow Robot
efadb94a74
Merge pull request #120259 from SataQiu/clean-apf-20230830
apf: remove v1alpha1 API
2023-08-31 19:44:42 -07:00
Monis Khan
b10697c788
kmsv2: fix race in simpleCache.set when setting cache size metric
Signed-off-by: Monis Khan <mok@microsoft.com>
2023-08-31 16:26:58 -04:00
Kubernetes Prow Robot
dd0c2d4eb1
Merge pull request #120108 from tnqn/fix-appgregator
Fix OpenAPI aggregation cleanup
2023-08-31 10:42:47 -07:00
ithrael
903156daa2 fix(test): add e2e-test for kubectl events filter by GroupVersion 2023-08-31 23:51:54 +08:00
Kubernetes Prow Robot
8777747c98
Merge pull request #119150 from tnqn/external-traffic-policy-external-ips
Allow specifying ExternalTrafficPolicy for Services with ExternalIPs
2023-08-31 08:24:48 -07:00
Gunju Kim
63177db32c
Add an e2e test for the pod sandbox changed scenario
This adds an e2e test to ensure that a pod should restart its containers
in right order after the pod sandbox is changed.
2023-09-01 00:13:47 +09:00
Kubernetes Prow Robot
76b06ee239
Merge pull request #120288 from tzneal/fix-pid-pressure-flake
fix a PidPressure test flake
2023-08-31 05:46:48 -07:00
Kubernetes Prow Robot
def694bbe0
Merge pull request #119801 from wojtek-t/refactor_watch_encoders
Refactor apiserver endpoint transformers to more natively use Encoders
2023-08-31 01:44:48 -07:00
Kubernetes Prow Robot
887ac275a2
Merge pull request #120250 from liyuerich/apimachinery
e2e_apimachinery:stop using deprecated framework.ExpectError
2023-08-30 23:40:47 -07:00
Todd Neal
ede524e1a6 fix a pidpressure test flake
With the new busybox, ash has a built-in sleep command. Prior to this
change we were creating half the pids expected since `sleep` wasn't
actually launching a new binary.  Use the full path to /bin/sleep which
avoids the built-in and actually launches a new process.
2023-08-30 22:44:36 -05:00
Anish Ramasekar
15c02f2a03
add integration tests
Signed-off-by: Anish Ramasekar <anish.ramasekar@gmail.com>
2023-08-30 23:14:59 +00:00
Quan Tian
bf41b0462c Fix OpenAPI aggregation cleanup
There were four issues in OpenAPI aggregation cleanup:
1. When removing an APIService, openAPIAggregationController was called
   twice while openAPIV3AggregationController was never called, leading
   to OpenAPI v3 for the APIService not cleaned up.
2. When removing a local APIService, v2 specAggregator should not return
   ErrAPIServiceNotFound when it doesn't find the APIService because
   local APIServices were never added to its cache, otherwise confusing
   error logs would be generated. Besides, the method's comment
   indicates that the desired behavior is that no error is returned if
   the APIService does not exist.
3. When removing an APIService, v3 specProxier should update
   openapiv2converter's cache, like when updating an APIService,
   otherwise the API would not be removed from "/openapi/v3".
4. When v3 AggregationController reconciles an APIService, it should
   stop requeueing it if it fails with ErrAPIServiceNotFound as the
   APIService has been removed, like what v2 AggregationController does,
   otherwise it would keep reconciling the APIService forever.

Signed-off-by: Quan Tian <qtian@vmware.com>
2023-08-31 00:01:14 +08:00
Quan Tian
16a7b53602 Fix typos in comments
Signed-off-by: Quan Tian <qtian@vmware.com>
2023-08-30 23:56:47 +08:00
Quan Tian
2b69daa960 Allow specifying ExternalTrafficPolicy for ClusterIP Services with ExternalIPs
When defining a ClusterIP Service, we can specify externalIP, and the
traffic policy of externalIP is subject to externalTrafficPolicy.
However, the policy can't be set when type is not NodePort or
LoadBalancer, and will default to Cluster when kube-proxy processes the
Service.

This commit updates the defaulting and validation of Service to allow
specifying ExternalTrafficPolicy for ClusterIP Services with
ExternalIPs.

Signed-off-by: Quan Tian <qtian@vmware.com>
2023-08-30 23:56:47 +08:00
SataQiu
2825519da2 apf: remove v1alpha1 API 2023-08-30 20:48:42 +08:00
bzsuni
7b44d73f73
bump npd to v0.8.14
Signed-off-by: bzsuni <bingzhe.sun@daocloud.io>
2023-08-30 19:03:33 +08:00
Kubernetes Prow Robot
c723f46603
Merge pull request #119463 from Songjoy/cleanup-e2e-windows-framework-equal
e2e_windows: stop using deprecated framework.ExpectEqual
2023-08-30 00:36:47 -07:00
liyuerich
53cb35d6d4 e2e_apimachinery:stop using deprecated framework.ExpectError
Signed-off-by: liyuerich <yue.li@daocloud.io>
2023-08-30 14:31:58 +08:00
宋文杰
8a368afa27 e2e_storage: stop using deprecated framework.ExpectEqual 2023-08-30 14:00:34 +08:00
宋文杰
209f23ecd7 e2e_upgrades: stop using deprecated framework.ExpectEqual 2023-08-30 11:28:17 +08:00
Kubernetes Prow Robot
3dd93d2bd7
Merge pull request #114717 from kerthcet/fix/bind-skip-status
Add new testcases to integration test TestBindPlugin
2023-08-29 12:25:56 -07:00
Kubernetes Prow Robot
2a23eba15b
Merge pull request #120231 from MartinForReal/patch-1
Extend npd e2e timeout to fix npd e2e error
2023-08-29 11:21:33 -07:00
Kubernetes Prow Robot
f9e9d38a88
Merge pull request #119488 from carlory/cleanup-e2e-cloud-framework-equal
e2e_cloud: stop using deprecated framework.ExpectEqual
2023-08-29 04:53:21 -07:00
Fan Shang Xiang
8d9517318a
Extend npd e2e timeout to fix npd e2e error 2023-08-29 17:22:28 +08:00
Kubernetes Prow Robot
8428655308
Merge pull request #119963 from pohly/dra-scheduler-perf-multiple-claims
dra: scheduler_perf test case with multiple claims per pod
2023-08-29 00:25:34 -07:00
Wojciech Tyczyński
b2c0d80c37 Test watch converter encoder 2023-08-29 09:04:12 +02:00
Kubernetes Prow Robot
27c9d32d46
Merge pull request #120101 from lowang-bh/Hotfix
fix: concurrent map writes in e2e test
2023-08-28 23:19:20 -07:00
Kubernetes Prow Robot
f3d0392e07
Merge pull request #119695 from ii/create-pv-pvc-lifecycle-test
Write e2e test for PersistentVolume & PersistentVolumeClaim Endpoints + 13 Endpoints
2023-08-28 13:27:20 -07:00
Kubernetes Prow Robot
9c25ce6f3e
Merge pull request #119540 from SataQiu/clean-apiserver-20230724
Remove the deprecated kube-apiserver identity lease garbage collector for k8s.io/component=kube-apiserver
2023-08-28 10:49:42 -07:00
kerthcet
c29234d3e1 Add integration tests for all bind plugins skipped in TestBindPlugin
Signed-off-by: kerthcet <kerthcet@gmail.com>
2023-08-28 22:05:58 +08:00
Han Kang
dbbce2aaba
sort stability list of metrics in the shell command (#120197)
* sort stability list of metrics in the shell command

* remove used vars

* fix verify errors
2023-08-28 04:43:54 -07:00
Kubernetes Prow Robot
c7a04e10a6
Merge pull request #120153 from SataQiu/clean-scheduler-20230824
using wait.PollUntilContextTimeout instead of deprecated wait.Poll/PollWithContext/PollImmediate/PollImmediateWithContext methods for scheduler
2023-08-28 02:11:42 -07:00
宋文杰
484510863b e2e_windows: stop using deprecated framework.ExpectEqual 2023-08-28 17:04:03 +08:00
carlory
f5681f8817 e2e_autoscaling: stop using deprecated framework.ExpectEqual 2023-08-28 16:22:17 +08:00
carlory
05bb01d27a e2e_cloud: stop using deprecated framework.ExpectEqual 2023-08-28 16:17:25 +08:00
AxeZhan
ff10df107a remove fake deployment 2023-08-25 21:52:33 +08:00
Kubernetes Prow Robot
8b132ea40a
Merge pull request #120151 from Penguin-zlh/cleanup-e2e-framework-havekey
cleanup: e2e : stop using deprecated framework.ExpectHaveKey
2023-08-25 06:02:52 -07:00
AxeZhan
dec1c36125 add kubectl rollout e2e test 2023-08-25 20:17:16 +08:00
Kubernetes Prow Robot
10c622e99a
Merge pull request #119994 from SataQiu/remove-scheduler-v1beta3
scheduler: remove deprecated v1beta3 KubeSchedulerConfiguration component config
2023-08-24 15:31:17 -07:00
Kubernetes Prow Robot
46e8cfd7d6
Merge pull request #119508 from nayihz/fix_finalizer_validation
validate crd finalizer name
2023-08-24 05:22:20 -07:00
SataQiu
5524f1651a using wait.PollUntilContextTimeout instead of deprecated wait.Poll/PollWithContext/PollImmediate/PollImmediateWithContext methods for scheduler 2023-08-24 18:35:59 +08:00
penguin
c9c83d1af4 cleanup: e2e : stop using deprecated framework.ExpectHaveKey 2023-08-24 18:21:27 +08:00
nayihz
13b52a1848 fix crd finalizer validation 2023-08-24 13:43:57 +08:00
Kubernetes Prow Robot
232d343d58
Merge pull request #119969 from saschagrunert/cni-plugins
Update CNI plugins to v1.3.0
2023-08-23 12:41:57 -07:00
Kubernetes Prow Robot
2e6eafca3f
Merge pull request #119503 from wojtek-t/pagination_ga
Graduate APIListChunking to GA
2023-08-23 10:39:29 -07:00
Wojciech Tyczyński
f752ca2dc8 Make API chunking conformance test deterministic 2023-08-23 12:18:23 +02:00
Kubernetes Prow Robot
95bb02943d
Merge pull request #118950 from ndixita/node-allocatable-oom
Adding coverage for OOM Kill scenario due to node allocatable memory …
2023-08-22 14:41:54 -07:00
Kubernetes Prow Robot
c0691f3784
Merge pull request #119789 from thockin/deprecate_svc_lb_ingress_with_clusterip
Gate: disallow .status.loadBalancer on non-LB svc
2023-08-22 10:02:55 -07:00
lowang-bh
bd06c26d58 fix: concurrent map writes in e2e test
Signed-off-by: lowang-bh <lhui_wang@163.com>
2023-08-22 16:57:41 +08:00
Antonio Ojea
826f3a130a promote to conformance Service multiprotocol tests
Services can expose network applications that are running on
one or more Pods. User need to specify the Port and Protocol of the
network application, and network implementations must forward only
the traffic indicated in the Service, as it may present a security
problem if you allow to forward traffic to a backend if the user
didn't specify it.

Change-Id: I77fbb23c6415ed09dd81c4f2deb6df7a17de46f0
2023-08-22 08:36:17 +00:00
tao.yang
c6e54169fc e2e_apps: stop using deprecated framework.ExpectConsistOf
Signed-off-by: tao.yang <tao.yang@daocloud.io>
2023-08-22 11:17:26 +08:00
bzsuni
93832de624
stop using deprecated framework.ExpectNotEqual
Signed-off-by: bzsuni <bingzhe.sun@daocloud.io>
2023-08-22 11:12:39 +08:00
bzsuni
145dbb6acc
stop using deprecated framework.ExpectNotEqual
Signed-off-by: bzsuni <bingzhe.sun@daocloud.io>
2023-08-22 09:53:04 +08:00
Dixita Narang
d2dbc583a0 Adding coverage for OOM Kill scenario due to node allocatable memory limits, when pod level memory limits are not set 2023-08-22 00:45:17 +00:00
Kubernetes Prow Robot
f852d7fead
Merge pull request #118653 from pohly/volume-resource-requirements
Volume resource requirements
2023-08-21 14:08:05 -07:00
Davanum Srinivas
3e9a4c15a8
Restrict what imports get into code within test/e2e_node
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-08-21 15:04:23 -04:00
Kubernetes Prow Robot
70a28449a7
Merge pull request #120038 from liyuerich/expectempty
e2e: stoping using deprecated framework.ExpectEmpty
2023-08-21 08:03:23 -07:00
Antonio Ojea
039859b9b7 e2e network test for udp services with hostNetwork clients
There are some implementations of service that use socket loadbalancing
instead of NAT. These implementations don't need to deal with the
conntrack cleanup, however, they need to cleanup the sockets that are
no longer needed, so the application does not get stuck forever.

This can  happen in both TCP or UDP, but since UDP is stateless, the
situation is much complicated because does not have mechanisms like TCP
to detect that socket is no longer needed.

Change-Id: Ic2cfbdf6c8b1f1335e8b5964825dd1fa716fef53
2023-08-21 13:39:39 +00:00
Patrick Ohly
2472291790 api: introduce separate VolumeResourceRequirements struct
PVC and containers shared the same ResourceRequirements struct to define their
API. When resource claims were added, that struct got extended, which
accidentally also changed the PVC API. To avoid such a mistake from happening
again, PVC now uses its own VolumeResourceRequirements struct.

The `Claims` field gets removed because risk of breaking someone is low:
theoretically, YAML files which have a claims field for volumes now
get rejected when validating against the OpenAPI. Such files
have never made sense and should be fixed.

Code that uses the struct definitions needs to be updated.
2023-08-21 15:31:28 +02:00
Kubernetes Prow Robot
4dee8398ae
Merge pull request #120078 from tzneal/investigate-test-failure
expect the new resource_scape_error metric
2023-08-21 04:13:34 -07:00
liyuerich
3546831ceb e2e_storage: stoping using deprecated framework.ExpectEmpty 2023-08-21 09:18:18 +08:00
Tim Hockin
a930892769
Gate: disallow .status.loadBalancer on non-LB svc
The fact that the .status.loadBalancer field can be set while .spec.type
is not "LoadBalancer" is a flub.  Any spec update will already clear
.status.ingress, so it's hard to really rely on this.  After this
change, updates which try to set this combination will fail validation.

Existing cases of this will not be broken.  Any spec/metadata update
will clear it (no error) and this is the only stanza of status.

New gate "AllowServiceLBStatusOnNonLB" is off by default, but can be
enabled if this change actually breaks someone, which seems exceeedingly
unlikely.
2023-08-20 16:40:41 -07:00
Todd Neal
b8512cfe24 expect the new resource_scape_error metric 2023-08-20 14:17:54 -05:00
Todd Neal
905f07f1ac Revert "mark the OOM killer as serial to reduce flakes"
This reverts commit bd6f548746.

Running as serial didn't completely eliminate the flake so I think
there's something more going on here.  Reverting the change to serial
since its not a solution.
2023-08-20 13:38:07 -05:00
Kubernetes Prow Robot
db64cc4a55
Merge pull request #119714 from ritazh/kmsv2-feature-enablement-ut
kmsv2 test feature enablement unit test
2023-08-18 20:27:31 -07:00
Kubernetes Prow Robot
c2bddad813
Merge pull request #119454 from pacoxu/fix-ip-mode
use LoadBalancer type service for e2e service test to patch ingress status
2023-08-18 20:27:19 -07:00
Kubernetes Prow Robot
8a8d63a732
Merge pull request #120059 from tzneal/mark-oomkiller-test-as-serial
mark the OOM killer as serial to reduce flakes
2023-08-18 18:19:31 -07:00
Paco Xu
30ff937aaf add a fake load balancer class to fix e2e test failure 2023-08-19 08:50:13 +08:00
Kubernetes Prow Robot
addc0391e7
Merge pull request #116897 from Richabanker/kubelete-resource-metrics-ga
Graduate kubelet resource metrics to GA
2023-08-18 16:03:37 -07:00
Rita Zhang
d86e72202c
kmsv2 test feature enablement unit test
Signed-off-by: Rita Zhang <rita.z.zhang@gmail.com>
2023-08-18 15:28:32 -07:00
Todd Neal
bd6f548746 mark the OOM killer as serial to reduce flakes
In testing I could only reproduce the flake by running stress-ng to load
the CPU. Running it as serial should reduce and hopefully eliminate the
flakiness.
2023-08-18 13:18:50 -05:00
Kubernetes Prow Robot
fbac8388bf
Merge pull request #120018 from liggitt/validating-admission-beta
Store validating admission as beta
2023-08-18 07:12:28 -07:00
Kubernetes Prow Robot
5929d49f87
Merge pull request #119494 from carlory/cleanup-e2e-network-framework-equal
e2e_network: stop using deprecated framework.ExpectEqual
2023-08-18 03:06:28 -07:00
carlory
e290b4cb3b e2e_network: stop using deprecated framework.ExpectEqual 2023-08-18 14:54:15 +08:00
Kubernetes Prow Robot
ee265c92fe
Merge pull request #119937 from RyanAoh/kep-1860-dev
Make Kubernetes aware of the LoadBalancer behaviour
2023-08-17 14:00:28 -07:00
Todd Neal
577197559a remove the legacy test dependency
This removes the import which added a bunch of apparently
old failing tests.
2023-08-17 12:54:20 -05:00
Richa Banker
4712025ea8 Graduate kubelet resource metrics to GA 2023-08-17 09:22:48 -07:00
Kubernetes Prow Robot
93bf570d46
Merge pull request #119746 from SataQiu/cleanup-scheduler-20230803
using wait.PollUntilContextTimeout instead of deprecated wait.Poll for scheduler
2023-08-17 09:07:21 -07:00
Jordan Liggitt
af9bf7b41e
Prefer non-alpha storage versions when available 2023-08-17 10:35:46 -04:00
Jordan Liggitt
09fa21ab87
Store validating admission policies and bindings as v1beta1 2023-08-17 10:35:45 -04:00
Kubernetes Prow Robot
d4d9f7deee
Merge pull request #119489 from carlory/cleanup-e2e-common-framework-equal
e2e_common: stop using deprecated framework.ExpectEqual
2023-08-17 03:25:22 -07:00
SataQiu
ef7d404702 using wait.PollUntilContextTimeout instead of deprecated wait.Poll for pkg/scheduler
using wait.PollUntilContextTimeout instead of deprecated wait.Poll for test/integration/scheduler

using wait.PollUntilContextTimeout instead of deprecated wait.Poll for test/e2e/scheduling

using wait.ConditionWithContextFunc for PodScheduled/PodIsGettingEvicted/PodScheduledIn/PodUnschedulable/PodSchedulingError
2023-08-17 17:25:09 +08:00
Sascha Grunert
7933368460
Update CNI plugins to v1.3.0
Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-08-17 09:50:53 +02:00
SataQiu
427b703c37 scheduler: remove deprecated v1beta2 KubeSchedulerConfiguration 2023-08-17 13:27:21 +08:00
Kubernetes Prow Robot
9ee7185be6
Merge pull request #119501 from Songjoy/cleanup-e2e-node-framework-equal
e2e_node: stop using deprecated framework.ExpectEqual
2023-08-16 16:36:30 -07:00
Kubernetes Prow Robot
4d166947cf
Merge pull request #119097 from pacoxu/fix-eviction-pid
PIDPressure condition is triggered slow on CRI-O with large PID pressure/heavy load
2023-08-16 16:36:19 -07:00
Kubernetes Prow Robot
88d14edc26
Merge pull request #119197 from saschagrunert/stop-container-runtime-err
Check dbus error on container runtime start/stop
2023-08-16 15:27:52 -07:00
Kubernetes Prow Robot
b1e35d5616
Merge pull request #119974 from tzneal/bump-busybox-test-version
bump the busybox test version to resolve test failures
2023-08-16 12:44:13 -07:00
Kubernetes Prow Robot
ad4418516f
Merge pull request #119939 from dims/kubectl-lookup-host-in-kubeconfig-when-needed
[kubectl] Lookup Host from kubeconfig when needed
2023-08-16 12:44:06 -07:00
Kubernetes Prow Robot
dd44792cec
Merge pull request #119880 from saschagrunert/seccomp-filter
Make seccomp status checks in e2e tests more robust
2023-08-16 12:43:54 -07:00
Kubernetes Prow Robot
18290bfdc8
Merge pull request #119966 from aojea/fixfix
e2e framework util subtle bug checking endpoints
2023-08-16 09:33:08 -07:00
git-jxj
a5b3a4b738
cleanup: Update deprecated FromInt to FromInt32 (#119858)
* redo commit

* apply suggestions from liggitt

* update Parse function based on suggestions
2023-08-16 09:33:01 -07:00
Kubernetes Prow Robot
c5a675808b
Merge pull request #119745 from tsmetana/fix-local-stress-flake
Local PV Stress test: don't fail on deleting missing PV
2023-08-16 09:32:53 -07:00
Kubernetes Prow Robot
dd177629ca
Merge pull request #119654 from p0lyn0mial/upstream-watch-list-e2e-panic
e2e/apimachinery/watchlist: stop panicking when run against unsupported cluster/environment
2023-08-16 09:32:46 -07:00
Todd Neal
b75c5d33e5 bump the busybox test version to resolve test failures
- bump busybox version
- specify the path to /bin/sleep to avoid calling a new shell
  builtin
2023-08-16 08:50:20 -05:00
Kubernetes Prow Robot
e298e92115
Merge pull request #119819 from pohly/dra-performance-test-driver
dra test: enhance performance of test driver controller
2023-08-16 04:32:26 -07:00
Antonio Ojea
8314122382 e2e framework util subtle bug checking endpoints
Change-Id: Ied14efcb75a45e3bbd5f76d4ee4c89703161df54
2023-08-16 08:18:11 +00:00
Kubernetes Prow Robot
b57b2dcfcf
Merge pull request #119928 from aojea/fixe2e
e2e WaitForServiceEndpointsNum take into account the address family
2023-08-16 00:24:27 -07:00
Patrick Ohly
1e961af858 scheduler_perf: test case for DRA with multiple claims
The new test case covers pods with multiple claims from multiple drivers. This
leads to different behavior (scheduler waits for information from all drivers
instead of optimistically selecting one node right away) and to more concurrent
updates of the PodSchedulingContext objects.

The test case is currently not enabled for unit testing or integration
testing. It can be used manually with:

   -bench=BenchmarkPerfScheduling/SchedulingWithMultipleResourceClaims/2000pods_100nodes
   ... -perf-scheduling-label-filter=
2023-08-16 08:32:36 +02:00
Patrick Ohly
0331e98957 scheduler_perf: fix installing DRA test driver multiple times
The driver name configuration option was ignored, so a second driver
would have used the same name.
2023-08-16 08:32:36 +02:00
Kubernetes Prow Robot
c41c448b80
Merge pull request #119890 from tzneal/containers-lifecycle-flake
crio: increase test buffer to eliminate test flakes
2023-08-15 23:13:45 -07:00
Kubernetes Prow Robot
130a5a423f
Merge pull request #119785 from sanposhiho/waitonpermit-fiterror
fix: register the plugin rejects Pods in WaitOnPermit to UnschedulablePlugins
2023-08-15 23:13:04 -07:00
Kubernetes Prow Robot
57212647e9
Merge pull request #119769 from Huang-Wei/bug/prefilter-preemption
Fix a bug that PostFilter plugin may don't function if previous PreFilter plugins return Skip
2023-08-15 23:12:50 -07:00
Kubernetes Prow Robot
061ae8a68b
Merge pull request #119765 from tzneal/detect-nfsv3-and-change-mount-path
fix mirror pod nfs test failure due to differing NFS versions
2023-08-15 23:12:44 -07:00
Kubernetes Prow Robot
55c3cd9a30
Merge pull request #119761 from soltysh/fix-sset-test
Statefulset should wait for new replicas when removing .start.ordinal
2023-08-15 23:12:37 -07:00
Kubernetes Prow Robot
251c17113c
Merge pull request #119741 from humblec/cloudbuild
Update the container images used in cloudbuild to their latest version
2023-08-15 19:44:07 -07:00
Kubernetes Prow Robot
ef533decd7
Merge pull request #119715 from logicalhan/ExponentialBucketsRange
add support for ExponentialBucketsRange
2023-08-15 19:43:53 -07:00
Kubernetes Prow Robot
3111fee8bf
Merge pull request #119670 from lengrongfu/fix/oomkill-multi-target-container
fix OOM killer
2023-08-15 19:43:40 -07:00
Kubernetes Prow Robot
22d90ebde2
Merge pull request #119636 from tzneal/update-busybox-test-image
update the busybox test image to 1.36.1
2023-08-15 19:43:25 -07:00
Kubernetes Prow Robot
59f81980a1
Merge pull request #119579 from yt2985/cleanSA
Fix the flaky legacy_service_account_token_clean_up_test.
2023-08-15 19:43:12 -07:00
Kubernetes Prow Robot
de8c36cb56
Merge pull request #119502 from Songjoy/cleanup-e2e-scheduling-framework-equal
e2e_scheduling: stop using deprecated framework.ExpectEqual
2023-08-15 19:42:43 -07:00
Kubernetes Prow Robot
3455845b15
Merge pull request #119492 from carlory/cleanup-e2e-instrumentation-framework-equal
e2e_instrumentation: stop using deprecated framework.ExpectEqual
2023-08-15 19:42:36 -07:00
Kubernetes Prow Robot
e725d10960
Merge pull request #119490 from carlory/cleanup-e2e-dra-framework-equal
e2e_dra: stop using deprecated framework.ExpectEqual
2023-08-15 19:42:25 -07:00
Kubernetes Prow Robot
3525255622
Merge pull request #119212 from CoderSherlock/master
Added oomkill test for init container and fix typos
2023-08-15 15:17:48 -07:00
Heba Elayoty
224087abfa
Add Pod Scheduling SLI Duration metric (#119049)
Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com>
Co-authored-by: Aldo Culquicondor <1299064+alculquicondor@users.noreply.github.com>
2023-08-15 15:17:41 -07:00
Kubernetes Prow Robot
d486180eb0
Merge pull request #118748 from andreaskaris/kubectl-wait-for
Kubectl: Improve conditionFuncFor expression parsing for wait --for jsonpath
2023-08-15 15:17:26 -07:00
Kubernetes Prow Robot
67c33faddd
Merge pull request #117631 from skitt/intstr-fromint32-testing
Test: use new intstr functions
2023-08-15 15:16:27 -07:00
Antonio Ojea
a0f07f2b4d e2e WaitForServiceEndpointsNum take into account the address family
Since EndpointSlices can carry dual stack families, but Endpoints can
only have one single family, the function must take this into account
and only compare the addresses of the same family, otherwise it will
always fail for Services with dual stack endoints, because the endpoint
slices will have always twice addresses than the Endpoints.

Change-Id: Id08cb22f8a2adc103a4f5a4fe3eec25f448cd21b
2023-08-14 17:02:50 +00:00
Davanum Srinivas
a161fa8017
pull Host from kubeconfig when needed
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-08-14 10:16:24 -04:00
Aohan Yang
03415ebc1d Integration tests for IP mode field 2023-08-14 17:21:54 +08:00
Kensei Nakada
cf3f0bd778 fix: register the plugin rejects Pods in WaitOnPermit to UnschedulablePlugins 2023-08-12 07:18:01 +00:00
Todd Neal
e258228e4a use a buffer equivalent to grace period to eliminate test flakes
This modifies the test to wait up to 2x the grace period for the pod to
be removed.
2023-08-11 14:08:11 -05:00
Todd Neal
717c149a73 fix mirror pod nfs test failure due to differing NFS versions
/exports *(rw,fsid=0,insecure,no_root_squash)

can be mounted as `/exports` using NFSv3 and `/` using NFSv4

Mount as '/', since clients that support both can try both.
2023-08-11 07:27:05 -05:00
Roman Bednar
dd7b7073c3 e2e: add test for verification of timestamp for new PVs 2023-08-11 11:39:47 +02:00
Roman Bednar
8e66149870 e2e: remove extra spaces 2023-08-11 11:28:11 +02:00
Wei Huang
765f3916c2
Fix a bug that PostFilter plugin may not function if previous PreFilter plugins return Skip 2023-08-10 13:43:00 -07:00
Kubernetes Prow Robot
a287fd6aa6
Merge pull request #119869 from a-hilaly/e2e-tests/match-conditions/flake-fix
Remove `matchConditions` flaky test
2023-08-10 01:01:28 -07:00
Sascha Grunert
8ab6bee676
Make seccomp status checks in e2e tests more robust
The tests have been introduced in
ca7be7dc6d
and checked for `ecc` in `/proc/self/status` since its creation.

We got a new field `Seccomp_filters:` with the Linux commit
c818c03b66,
means that `ecc` would now match both and interfere with possible test
results depending on the host.

The field `Seccomp:` got introduced in
2f4b3bf6b2
and has never changed since then, means we can use it directly to make
the tests more strict.

Refers to https://github.com/kubernetes-sigs/cri-tools/pull/1236

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-08-10 09:51:03 +02:00
Kubernetes Prow Robot
e6218f3d3b
Merge pull request #119876 from marosset/revert-118895
Revert "Merge pull request #118895 from RyanAoh/kep-1860"
2023-08-09 19:27:31 -07:00
Mark Rossetti
0d90d1ffa5
Revert "Merge pull request #118895 from RyanAoh/kep-1860"
This reverts commit 890a6c8f70, reversing
changes made to 4f60a8d493.
2023-08-09 15:51:20 -07:00
Kubernetes Prow Robot
3d941afece
Merge pull request #119835 from liggitt/mitigate-aggregated-discovery-npe
Avoid returning nil responseKind in v1beta1 aggregated discovery
2023-08-09 15:37:30 -07:00
Kubernetes Prow Robot
201f807036
Merge pull request #119784 from sanposhiho/bug-fix
fix: broadcast when pod is pushed back to activeQ directly in AddUnschedulableIfNotPresent
2023-08-09 13:05:31 -07:00
Jordan Liggitt
1876ddf714
Avoid returning nil responseKind in v1beta1 aggregated discovery 2023-08-09 14:45:33 -04:00
Amine Hilaly
55ca1a6ee1 Remove leases test case from AdmissionWebhookMatchConditions suit
Signed-off-by: Amine Hilaly <hilalyamine@gmail.com>
2023-08-09 18:39:01 +02:00
Kensei Nakada
050c0437e6 fix: broadcast when pod is pushed back to activeQ directly in AddUnschedulableIfNotPresent 2023-08-09 03:32:14 +00:00
Stephen Heywood
74053e2151 Create e2e test for PV/PVC endpoints
e2e test validates the following 15 endpoints
- createCoreV1NamespacedPersistentVolumeClaim
- createCoreV1PersistentVolume
- deleteCoreV1CollectionNamespacedPersistentVolumeClaim
- deleteCoreV1CollectionPersistentVolume
- deleteCoreV1NamespacedPersistentVolumeClaim
- deleteCoreV1PersistentVolume
- listCoreV1NamespacedPersistentVolumeClaim
- listCoreV1PersistentVolume
- listCoreV1PersistentVolumeClaimForAllNamespaces
- patchCoreV1NamespacedPersistentVolumeClaim
- patchCoreV1PersistentVolume
- readCoreV1NamespacedPersistentVolumeClaim
- readCoreV1PersistentVolume
- replaceCoreV1NamespacedPersistentVolumeClaim
- replaceCoreV1PersistentVolume
2023-08-09 09:12:42 +12:00
Jeremy Rickard
38a81ca04f
Bump distroless-iptables to 0.2.7
Signed-off-by: Jeremy Rickard <jeremyrrickard@gmail.com>
2023-08-08 07:14:14 -06:00
Kubernetes Prow Robot
68d79b0d69
Merge pull request #119798 from aojea/endpoints_and_slices
Fix flaky test depending on EndpointSlices to be ready
2023-08-08 04:52:14 -07:00
Patrick Ohly
0e23840929 dra test: enhance performance of test driver controller
Analyzing the CPU profile of

    go test -timeout=0 -count=5 -cpuprofile profile.out -bench=BenchmarkPerfScheduling/.*Claim.* -benchtime=1ns -run=xxx ./test/integration/scheduler_perf

showed that a significant amount of time was spent iterating over allocated
claims to determine how many were allocated per node. That "naive" approach was
taken to avoid maintaining a redundant data structure, but now that performance
measurements show that this comes at a cost, it's not "premature optimization"
anymore to introduce such a second field.

The average scheduling throughput in
SchedulingWithResourceClaimTemplate/2000pods_100nodes increases from 16.4
pods/s to 19.2 pods/s.
2023-08-08 13:36:35 +02:00
Jeremy Rickard
72c8cf9fce
[go] Bump images, versions and deps to use Go 1.20.7
Signed-off-by: Jeremy Rickard <jeremyrrickard@gmail.com>
2023-08-07 13:25:59 -06:00
Antonio Ojea
2ceca1c78d e2e WaitForServiceEndpointsNum should watch EndpointSlices
EndpointSlices is the evolution of the Endpoint object and most of the
components are using it for implementing Services, this menas that
despite the Endpoint object is up to date, the EndpointSlices may
lag behind, so test must ensure that both objects are in sync to
avoid race conditions.

Change-Id: I5d9bc7774c68f321537379d1f20b2a1fe0b39e6e
2023-08-07 18:34:06 +00:00
Maciej Szulik
ef51dbfa0c
Drop [Feature: tag to make sure this tests runs on pre-submits
In v1.27 StatefulSetStartOrdinal became beta, which makes it on by
default, but we forgot to turn these tests on along with it. This makes
these tests run always.
2023-08-07 13:38:43 +02:00
carlory
3e908c5bde e2e_common: stop using deprecated framework.ExpectEqual 2023-08-07 01:15:33 +08:00
Maciej Szulik
d154ca9c00
Statefulset should wait for new replicas when removing .start.ordinal 2023-08-04 18:41:01 +02:00
Tomas Smetana
24f32d9d18 Local PV Stress test: don't fail on deleting missing PV 2023-08-03 12:51:31 +02:00
Humble Chirammal
85c385d410 Update the container images used in cloudbuild to their latest version
This carry below GOLLANG_VERSION which got many CVE fixes:

/workspace # echo $GOLANG_VERSION
1.20.5
/workspace #

CVE-2023-29403 and Go issue https://go.dev/issue/60272.
CVE-2023-29404 and CVE-2023-29405:
  Go issues https://go.dev/issue/60305 and https://go.dev/issue/60306
CVE-2023-29402 and Go issue https://go.dev/issue/60167

Signed-off-by: Humble Chirammal <humble.devassy@gmail.com>
2023-08-03 00:09:53 +05:30
Han Kang
d5d90b7b22 add support for ExponentialBucketsRange 2023-08-01 11:38:20 -07:00
Paco Xu
f0b38766a0 use LoadBalancer type service for e2e service test to patch ingress status 2023-07-31 17:13:12 +08:00
lengrongfu
c23cee1be3 fix OOM killer
Signed-off-by: lengrongfu <rongfu.leng@daocloud.io>
2023-07-30 11:16:12 +08:00
Davanum Srinivas
b4ef4015a2
Avoid pulling mounter.tar through the CDN
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-07-28 22:15:55 -04:00
tinatingyu
ca338e517b Fix the flaky legacy_service_account_token_clean_up_test.Fix the flaky
legacy_service_account_token_clean_up_test.
2023-07-28 18:18:40 +00:00
Lukasz Szaszkiewicz
a44ce346d7 e2e/apimachinery/watchlist: stop panicking when run against unsupported cluster/environment 2023-07-28 12:25:13 +02:00
Todd Neal
2a767c9916 update the busybox test image to 1.36.1 2023-07-27 15:26:06 -05:00
HirazawaUi
133696725a implement secret empty string value integration test 2023-07-27 23:23:32 +08:00
upodroid
a65d207507 calculate the correct machine-type 2023-07-26 23:10:06 +00:00
upodroid
7d13c9b096 set map to nil if an empty string is passed 2023-07-26 10:32:27 +03:00
宋文杰
63cf568251 e2e_node: stop using deprecated framework.ExpectEqual 2023-07-26 10:20:30 +08:00
Kubernetes Prow Robot
dc6d8c6b58
Merge pull request #119571 from haircommander/node-log-query-on-node
test: fix NodeLogQuery tests to query the correct node for logs
2023-07-25 15:36:31 -07:00
Amine
eb4063f0ce
Add cases to tests AdmissionWebhook MatchConditions size limit (#119404) 2023-07-25 15:36:19 -07:00
Amine
ef4907eddd
Add e2e tests for admission webhooks MatchCondition fields (#118319)
* Add e2e tests for admission webhooks MatchCondition fields

Signed-off-by: Amine Hilaly <hilalyamine@gmail.com>

* improve naming to distinguish tests

* adding e2e for mutating webhooks and match conditions

* Use `ginkgo.It` instead of `framework.ConformaceIt` and cleanup
resrources after creation

* Enable AdmissionWebhookMatchConditions feature

* Tag only matchcondition tests

* Improve expected error message for denied requests.

* Rename `onlyAllowLeaseObjectMatchConditions` to
`excludeLeasesMatchConditions`

* remove [Alpha] tag from AdmissionWebhookMatchConditions tests

* Using `gomega.Expect` instead of `framworkfail`

* Remove [Feature:AdmissionWebhookMatchConditions] tag

Signed-off-by: Amine <hilalyamine@gmail.com>

* Improve e2e names to specify whether it's using Validating or Mutating admission webhooks

---------

Signed-off-by: Amine Hilaly <hilalyamine@gmail.com>
Signed-off-by: Amine <hilalyamine@gmail.com>
2023-07-25 14:02:17 -07:00
Peter Hunt
a248c4de52 test: fix NodeLogQuery tests to query the correct node for logs
Currently, the test queries the local node, which is not correct for most kubernetes environments.
Instead, ssh to the target node and call journalctl there

Signed-off-by: Peter Hunt <pehunt@redhat.com>
2023-07-25 15:41:38 -04:00
Kubernetes Prow Robot
cacc6e751b
Merge pull request #119531 from ritazh/kmsv2-test-enablement-restart
kmsv2 test feature enablement disablement and restart
2023-07-25 12:03:55 -07:00
Kubernetes Prow Robot
86b44a3222
Merge pull request #119564 from logicalhan/update-docs
update documentation with descriptions
2023-07-25 10:00:07 -07:00
Rita Zhang
67769438e1
kmsv2 test feature enablement disablement and restart
Signed-off-by: Rita Zhang <rita.z.zhang@gmail.com>
2023-07-25 09:50:59 -07:00
Han Kang
18a7025383 update documentation with descriptions 2023-07-25 08:57:54 -07:00
SataQiu
213ed03c00 remove deprecated kube-apiserver identity lease garbage collector 2023-07-25 10:10:18 +08:00
carlory
57226fbd27 e2e_dra: stop using deprecated framework.ExpectEqual
Co-authored-by: Thomas Milox <thomasmilox@gmail.com>
2023-07-25 10:03:56 +08:00
Kubernetes Prow Robot
b53830590f
Merge pull request #119409 from alexzielenski/apiserver/policy/vap-tests
Add test cases for ValidatingAdmissionPolicy
2023-07-24 15:12:13 -07:00
Alexander Zielenski
3b9af47118 add admission policy integration test all resources
duplicates a lot of existing webhook integration test code
2023-07-24 10:54:54 -07:00
Jordan Liggitt
a164005cc0
Fix non-test code relying on test-code 2023-07-24 11:37:57 -04:00
Andreas Karis
4188998430 Improve conditionFuncFor expression parsing for wait --for jsonpath
Make it possible to parse jsonpath filter expressions: Split
jsonpath expressions on single '=' only and leave '==' as part of the
string.

Reported-at: https://github.com/kubernetes/kubernetes/issues/119206
Signed-off-by: Andreas Karis <ak.karis@gmail.com>
2023-07-23 15:54:28 +02:00
Talor Itzhak
3964f71fe0 e2e:podresources: verify count for terminal pods
PodResourcesAPI reports in the List call about resources of pods in terminal phase.
The internal managers reassign resources assigned to pods in terminal phase, so podresources should ignore them.

Whether this behavior intended or not (the docs are not unequivocal)
this e2e test demonstrates and verifies the mentioned above.

Signed-off-by: Talor Itzhak <titzhak@redhat.com>
2023-07-23 12:46:41 +03:00
Kubernetes Prow Robot
d0814fa476
Merge pull request #119513 from borg-land/add-node-env
Add node-env and instance-type flags to node-e2e tests
2023-07-21 17:44:19 -07:00
Kubernetes Prow Robot
18f8cb8398
Merge pull request #118644 from alexzielenski/apiserver/policy/namespaceParamRef
KEP-3488: Promote ValidatingAdmissionPolicy to Beta
2023-07-21 17:44:08 -07:00
Kubernetes Prow Robot
773a6b1e46
Merge pull request #118828 from enj/enj/f/kms_v2_hkdf_expand
kmsv2: KDF based nonce extension
2023-07-21 16:10:19 -07:00
Kubernetes Prow Robot
f3a070f9c6
Merge pull request #116469 from RomanBednar/pv-phase-transition-time
PersistentVolume last phase transition time
2023-07-21 16:10:07 -07:00
upodroid
1c99f9591b add node-env and instance-type flags to node-e2e tests 2023-07-21 21:46:37 +00:00
Alexander Zielenski
d647958744 update codegen 2023-07-21 13:56:23 -07:00
Alexander Zielenski
ef8670c946 refactor: replace usage of v1alpha1 with v1beta1
v1alpha -> v1beta

fill in DenyAction where there is no ParameterNotFoundAction
2023-07-21 13:41:24 -07:00
Roman Bednar
ae185e9118 e2e: test LastPhaseTransitionTime field when binding PV 2023-07-21 21:39:23 +02:00
Monis Khan
bf49c727ba
kmsv2: KDF based nonce extension
Signed-off-by: Monis Khan <mok@microsoft.com>
2023-07-21 15:25:52 -04:00
Wojciech Tyczyński
4e2e059c7b Graduate RemainingItemCount to GA 2023-07-21 16:25:21 +02:00
Kubernetes Prow Robot
4e8908d566
Merge pull request #117015 from kannon92/job-fix-replacement-after-deletion
Job: create replacement pods only after terminated
2023-07-21 06:46:09 -07:00
Wojciech Tyczyński
6acfa3cb4a Graduate APIListChunking to GA 2023-07-21 14:09:40 +02:00
Kubernetes Prow Robot
f5130e4387
Merge pull request #119484 from logicalhan/update-docs
update documented metrics
2023-07-21 04:32:06 -07:00
宋文杰
e9b69d1958 e2e_scheduling: stop using deprecated framework.ExpectEqual 2023-07-21 17:35:16 +08:00
carlory
2222509cb4 e2e_instrumentation: stop using deprecated framework.ExpectEqual 2023-07-21 15:41:26 +08:00
Kubernetes Prow Robot
4457f85eb3
Merge pull request #119473 from gjkim42/fix-e2e-node-to-use-pod-restartpolicy-correctly
node_e2e: Fix `createStaticSystemNodeCriticalPod`'s invalid spec
2023-07-20 18:28:07 -07:00
kannon92
74fcf3e766 implementation of PodReplacementPolicy kep in the job controller 2023-07-21 00:44:53 +00:00
Han Kang
cb6d7fbe66 update documented metrics 2023-07-20 16:05:38 -07:00
Alexander Zielenski
b3b775baa5 test: adjust tests for v1beta1 validatingadmissionpolicy 2023-07-20 13:24:09 -07:00
Kubernetes Prow Robot
5766947ab8
Merge pull request #119437 from serathius/etcd-semantics
Fix the semantic meaning of etcd server within component statuses and metrics.
2023-07-20 13:18:22 -07:00
Kubernetes Prow Robot
8a053c700a
Merge pull request #119215 from alexzielenski/apiserver/policy/namespaceParamRef-alpha
KEP-3488: Per namespace policy params
2023-07-20 13:18:10 -07:00
Kubernetes Prow Robot
4b6c340c13
Merge pull request #119422 from saschagrunert/conformance-debian-base
Use debian-base instead of distroless for conformance image
2023-07-20 11:18:10 -07:00
Alexander Zielenski
b5e9e0168c feature: add multiple params capability to VAP controller 2023-07-20 09:30:10 -07:00
Kubernetes Prow Robot
f9744df78b
Merge pull request #119412 from marosset/mark-node-log-query-tests-linux-only
Mark existing NodeLogQuery e2e tests as [LinuxOnly] since they do not run on Windows
2023-07-20 09:12:09 -07:00
Marek Siarkowicz
03aad1f823 Fix the semantic meaning of etcd server within component statuses and metrics.
Instead of numerating all the etcd endpoints known by apiserver, we will
group them by purpose. `etcd-0` will be the default etcd, `etcd-1` will
be the first resource override, `etcd-2` will be the second override and
so on.
2023-07-20 15:08:41 +02:00
Gunju Kim
e0a6eb93a1
node_e2e: Fix createStaticSystemNodeCriticalPod's invalid spec
This fixes `createStaticSystemNodeCriticalPod` to set pod's
restartPolicy instead of container's restartPolicy.
2023-07-20 20:18:05 +09:00
Itamar Holder
ee82654e39 Add pod_swap_usage_bytes as an expected metric in e2e test
Use haveKeys() matcher from previous commit to ensure
required keys exist.

Signed-off-by: Itamar Holder <iholder@redhat.com>
2023-07-19 14:44:05 +03:00
Itamar Holder
81abfca407 Add a haveKeys() helper function to match multiple keys
Signed-off-by: Itamar Holder <iholder@redhat.com>
2023-07-19 14:44:04 +03:00
Sascha Grunert
1000ed64e0
Use debian-base instead of distroless for conformance image
The `diff` binary (required by the `kubectl diff` e2e test) gets
statically or dynamically linked based on the used glibc version. We
cannot really predict that behavior for the various platforms of
debian-base and therefore cannot copy the binary around. This means that
distroless is not a great choice for the conformance image unless we
stop relying on `diff`.

This means we now switch back to `debian-base` for the conformance image
to simplify the build process and reduce the amount of moving parts.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-07-19 10:14:37 +02:00
Kubernetes Prow Robot
66e99b3ff1
Merge pull request #117740 from Richabanker/uvip-impl
Unknown Version Interoperability Proxy Impl
2023-07-18 18:36:02 -07:00
Richa Banker
c1aef65640 Add integration test 2023-07-18 17:36:30 -07:00
Richa Banker
cd5f3d9f9d Add impl for uvip 2023-07-18 17:36:22 -07:00
Kubernetes Prow Robot
d1d86dafb7
Merge pull request #118772 from kannon92/terminating-pod-gc
KEP-3939: pod gc changes for pod replacement policy kep
2023-07-18 16:46:03 -07:00
Michał Woźniak
a15c27661e
Job controller implementation of backoff limit per index (#118009) 2023-07-18 13:44:11 -07:00
Kubernetes Prow Robot
f55f2785e2
Merge pull request #116254 from pohly/dra-node-authorizer
node authorizer: limit kubelet access to ResourceClaim objects
2023-07-18 13:44:04 -07:00
Mark Rossetti
b5012ba1e2
Mark existing NodeLogQuery e2e tests as [LinuxOnly] since they do not run on Windows
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2023-07-18 13:24:01 -07:00
Kubernetes Prow Robot
7aa4e089cd
Merge pull request #118808 from Jefftree/updated-lazy-crd-controller-v2
Fix lazy OpenAPI V2 CRD Controller
2023-07-18 12:33:20 -07:00
Keerthan Reddy Mala
106a931c29 fix intergration test 2023-07-18 10:24:04 -07:00
Kubernetes Prow Robot
56b59c89f5
Merge pull request #117666 from carlory/fix-008
Remove ability to re-enable serving deprecated policyv1beta1 APIs
2023-07-18 08:41:12 -07:00
Sascha Grunert
a35608cc61
Update debian-base for test images and others
Bump all users of debian-base to the latest bookworm image.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-07-18 16:39:03 +02:00
Kubernetes Prow Robot
9946ea9fd8
Merge pull request #119365 from xmudrii/go1206-iptables
Bump distroless-iptables to 0.2.6
2023-07-18 02:11:20 -07:00
Kubernetes Prow Robot
d43e6705f1
Merge pull request #119095 from saschagrunert/debian-base-bookworm
Update debian-base to bookworm
2023-07-18 02:11:08 -07:00
Jefftree
735be024cf Make CRDs built and aggregated lazily for oasv2 2023-07-18 04:49:56 +00:00
Kubernetes Prow Robot
b4d793c450
Merge pull request #118865 from iholder101/kubelet/add-swap-to-summary-stats
Add swap to stats to Summary API and Prometheus endpoints (`/stats/summary` and `/metrics/resource`)
2023-07-17 19:49:18 -07:00
Kubernetes Prow Robot
da2fdf8cc3
Merge pull request #118764 from iholder101/Swap/burstableQoS-impl
Add full cgroup v2 swap support with automatically calculated swap limit for LimitedSwap and Burstable QoS Pods
2023-07-17 19:49:07 -07:00
carlory
850dc6123a Remove ability to re-enable serving deprecated policyv1beta1 APIs 2023-07-18 10:24:26 +08:00
Kubernetes Prow Robot
d17f3ba2cf
Merge pull request #119168 from gjkim42/sidecar-allow-probes-and-lifecycle-hooks
Allow all probes and lifecycle for restartable init containers
2023-07-17 18:11:07 -07:00
Itamar Holder
4cb5547f93 Adjust summary API e2e test
Signed-off-by: Itamar Holder <iholder@redhat.com>
2023-07-18 02:55:56 +03:00
Kubernetes Prow Robot
a9b3ca34b5
Merge pull request #119381 from gjkim42/fix-e2e-tests-overriding-grace-period-when-probe-is-set
Fix e2e tests for overriding timoutGracePeriodSeconds when probe is set
2023-07-17 16:55:18 -07:00
Gunju Kim
d029f0ef06
Add restartable init container lifecycle hook e2e tests 2023-07-18 08:12:25 +09:00
Gunju Kim
f17e911777
Add restartable init container probe e2e tests 2023-07-18 08:12:25 +09:00
Gunju Kim
3bf282652f
Allow restartable init containers to have lifecycle 2023-07-18 08:12:24 +09:00
Kensei Nakada
c7e7eee554
feature(scheduling_queue): track events per Pods (#118438)
* feature(sscheduling_queue): track events per Pods

* fix typos

* record events in one slice and make each in-flight Pod to refer it

* fix: use Pop() in test before AddUnschedulableIfNotPresent to register in-flight Pods

* eliminate MakeNextPodFuncs

* call Done inside the scheduling queue

* fix comment

* implement done() not to require lock in it

* fix UTs

* improve the receivedEvents implementation based on suggestions

* call DonePod when we don't call AddUnschedulableIfNotPresent

* fix UT

* use queuehint to filter out events for in-flight Pods

* fix based on suggestion from aldo

* fix based on suggestion from Wei

* rename lastEventBefore → previousEvent

* fix based on suggestion

* address comments from aldo

* fix based on the suggestion from Abdullah

* gate in-flight Pods logic by the SchedulingQueueHints feature gate
2023-07-17 15:53:07 -07:00
Kubernetes Prow Robot
a776bf0462
Merge pull request #116335 from gnufied/update-api-recovery-apis
Update api recovery apis
2023-07-17 14:52:35 -07:00
Kubernetes Prow Robot
1da70b0736
Merge pull request #119264 from logicalhan/promote-metrics
promote beta metrics
2023-07-17 13:47:41 -07:00
Kubernetes Prow Robot
92856db662
Merge pull request #118973 from ffromani/kubelet-podresources-getallocatable-ga
node: podresources: getallocatable: move to GA
2023-07-17 13:47:33 -07:00
Kubernetes Prow Robot
8633adbb07
Merge pull request #119342 from A-Hilaly/api-server/webhooks/match-conditions-integration-tests
Add integration tests for `MatchConditions` feature gate enablement
2023-07-17 12:47:23 -07:00
Hemant Kumar
e011187114 Update code to use new generic allocatedResourceStatus field 2023-07-17 15:30:35 -04:00
Kubernetes Prow Robot
890a6c8f70
Merge pull request #118895 from RyanAoh/kep-1860
Make Kubernetes aware of the LoadBalancer behaviour
2023-07-17 11:41:10 -07:00
Han Kang
aa788219f4 fix metric names 2023-07-17 11:22:21 -07:00
Amine
00de051729 Make matchConditionsFeatureGateInitiallyEnabled a boolean instead 2023-07-17 18:34:42 +01:00
Gunju Kim
48eee4eed8
Fix e2e tests for overriding timoutGracePeriodSeconds of probes
Passing "/bin/sh" arguments to agnhost container has caused failure by
itself.

This fixes the container image, allowing it to properly test the restart
triggered by probe failure.
2023-07-18 02:31:36 +09:00
Kubernetes Prow Robot
4f60a8d493
Merge pull request #119110 from andrewsykim/apf-metrics-beta
Promote kube-apiserver flowcontrol metrics to Beta
2023-07-17 09:05:12 -07:00
Marko Mudrinić
ff62cca47b
Bump distroless-iptables to 0.2.6
Signed-off-by: Marko Mudrinić <mudrinic.mare@gmail.com>
2023-07-17 12:13:37 +02:00
Paco Xu
709eb6c030 eviction for pid trigger PIDPressure condition slowly on CRI-O
Signed-off-by: Paco Xu <paco.xu@daocloud.io>
2023-07-17 16:34:28 +08:00
Aohan Yang
b1850497b4 Integration tests for IP mode field 2023-07-17 16:03:02 +08:00
kannon92
e38ab6d367 Add PodGC changes for PodReplacementPolicy 2023-07-16 23:47:04 +00:00
Amine
6b3ce3004d Add integration tests for match conditions feature gate enablement 2023-07-16 01:06:08 +01:00
Kubernetes Prow Robot
900237fada
Merge pull request #118635 from ffromani/devmgr-check-pod-running
kubelet: devices: skip allocation for running pods
2023-07-15 05:43:16 -07:00
Kubernetes Prow Robot
5c96e5321e
Merge pull request #119324 from xmudrii/go1206
[go] Bump images, versions and deps to use Go 1.20.6
2023-07-15 03:07:15 -07:00
Kubernetes Prow Robot
8a0ea1bd58
Merge pull request #109616 from wzshiming/feat/pod-host-ips
Field `status.hostIPs` added for Pod
2023-07-15 00:31:04 -07:00
Cici Huang
13172cba5c
ValidatingAdmissionPolicy: support namespace access (#118267)
* Support namespace access from cel expression in validatingadmissionpolicy.

* Whitelist the exposed fields in namespace object and add test

* better handling of cluster-scoped resources.

* [API REVIEW] namespaceObject in Expression doc.

* compatibility with composition.

* generated: ./hack/update-codegen.sh && ./hack/update-openapi-spec.sh

* workaround namespace of namespace is unexpectedly set.

* basic test coverage for namespaceObject.

---------

Co-authored-by: Jiahui Feng <jhf@google.com>
2023-07-14 17:53:08 -07:00
Kubernetes Prow Robot
47aeec63a8
Merge pull request #119272 from deads2k/resources
add list of served versions to storage version
2023-07-14 13:22:41 -07:00
David Eads
90ab7580aa add list of served versions to storage version 2023-07-14 13:47:19 -04:00
Itamar Holder
619be9c153 Add a swap e2e test
Signed-off-by: Itamar Holder <iholder@redhat.com>
2023-07-14 14:52:28 +03:00
Marko Mudrinić
69c4bc29f5
[go] Bump images, versions and deps to use Go 1.20.6
Signed-off-by: Marko Mudrinić <mudrinic.mare@gmail.com>
2023-07-14 12:04:13 +02:00
Shiming Zhang
b2613dd381 Add e2e to check that hostIPs and Downward API works 2023-07-14 09:35:31 +08:00
Jiahui Feng
b635f2a401
ValidatingAdmissionPolicy: Variable Composition (#118642)
* [API REVIEW] Variable Composition

* lazy map.

* variable composition implementation.

* check variables during VAP validation.

* generated: ./hack/update-vendor.sh

* generated: UPDATE_COMPATIBILITY_FIXTURE_DATA

(cd staging/src/k8s.io/api/ && env UPDATE_COMPATIBILITY_FIXTURE_DATA=true go test)

* cost calucation.

* tests for cost calculations.

* e2e test for variables.

* fix doc for Validation.Expression.

* generated: ./hack/update-codegen.sh

* fix missing utilruntime import.

* generated: ./hack/update-openapi-spec.sh
2023-07-13 17:13:28 -07:00
Kubernetes Prow Robot
1e21da87b8
Merge pull request #118988 from nilekhc/hash-keyid
[KMSv2] chore: hashes keyID being logged
2023-07-13 15:47:48 -07:00
Kubernetes Prow Robot
be2cfc9697
Merge pull request #118228 from carlory/move-non-graceful-node-shutdown-to-GA
move non-graceful node shutdown to GA
2023-07-13 15:47:37 -07:00
Kubernetes Prow Robot
bea27f82d3
Merge pull request #118209 from pohly/dra-pre-scheduled-pods
dra: pre-scheduled pods
2023-07-13 14:43:37 -07:00
Kubernetes Prow Robot
1db4658614
Merge pull request #119295 from jsafrane/remove-serial-localvolume
Remove test Pods sharing a single local PV
2023-07-13 13:43:21 -07:00
Nilekh Chaudhari
131216fa8f
chore: hashes keyID
Signed-off-by: Nilekh Chaudhari <1626598+nilekhc@users.noreply.github.com>
2023-07-13 20:42:09 +00:00
Jiahui Feng
049614f884
ValidatingAdmissionPolicy controller for Type Checking (#117377)
* [API REVIEW] ValidatingAdmissionPolicyStatucController config.

worker count.

* ValidatingAdmissionPolicyStatus controller.

* remove CEL typechecking from API server.

* fix initializer tests.

* remove type checking integration tests

from API server integration tests.

* validatingadmissionpolicy-status options.

* grant access to VAP controller.

* add defaulting unit test.

* generated: ./hack/update-codegen.sh

* add OWNERS for VAP status controller.

* type checking test case.
2023-07-13 13:41:50 -07:00
Andrew Sy Kim
d25075f342 update generated list of stable metrics
Signed-off-by: Andrew Sy Kim <andrewsy@google.com>
2023-07-13 20:13:04 +00:00
Patrick Ohly
80ab8f0542 dra: handle scheduled pods in kube-controller-manager
When someone decides that a Pod should definitely run on a specific node, they
can create the Pod with spec.nodeName already set. Some custom scheduler might
do that. Then kubelet starts to check the pod and (if DRA is enabled) will
refuse to run it, either because the claims are still waiting for the first
consumer or the pod wasn't added to reservedFor. Both are things the scheduler
normally does.

Also, if a pod got scheduled while the DRA feature was off in the
kube-scheduler, a pod can reach the same state.

The resource claim controller can handle these two cases by taking over for the
kube-scheduler when nodeName is set. Triggering an allocation is simpler than
in the scheduler because all it takes is creating the right
PodSchedulingContext with spec.selectedNode set. There's no need to list nodes
because that choice was already made, permanently. Adding the pod to
reservedFor also isn't hard.

What's currently missing is triggering de-allocation of claims to re-allocate
them for the desired node. This is not important for claims that get created
for the pod from a template and then only get used once, but it might be
worthwhile to add de-allocation in the future.
2023-07-13 21:27:11 +02:00
Jordan Liggitt
39207dada2 Add integration test for node authorizer claim references 2023-07-13 20:42:21 +02:00
CoderSherlock
b7cbebcd03 Added oomkill test for init container and fix typos 2023-07-13 17:19:34 +00:00
Jan Safranek
052b06bdad Remove test Pods sharing a single local PV
The test runs two pods accessing the same local volume, which is duplicate
with "Two pods mounting a local volume at the same time" test.
2023-07-13 18:33:18 +02:00
Rafael Fonseca
9f5b6db8be test: azure: check error for cloud detection.
If something goes wrong during the Azure cloud detection, trying to cast
the returned value will result in the following panic and give no clue
as to what the error was.

```
  panic: interface conversion: cloudprovider.Interface is nil, not *azure.Cloud

goroutine 1 [running]:
k8s.io/kubernetes/test/e2e/framework/providers/azure.newProvider()
	test/e2e/framework/providers/azure/azure.go:50 +0x2b5
k8s.io/kubernetes/test/e2e/framework.SetupProviderConfig({0xc0007966b8, 0x5})
	test/e2e/framework/provider.go:82 +0x1a6
```
2023-07-13 09:04:24 +02:00
Kubernetes Prow Robot
406d2dfe61
Merge pull request #119250 from pohly/controller-contextual-logging
kube-controller-manager: finish conversion to contextual logging
2023-07-12 18:59:30 -07:00
Kubernetes Prow Robot
4af23c157c
Merge pull request #119242 from carlory/add-logger
change the QueueingHintFn to pass a logger
2023-07-12 13:03:31 -07:00
Kubernetes Prow Robot
047d040ce7
Merge pull request #119012 from pohly/dra-batch-node-prepare
kubelet: support batched prepare/unprepare in v1alpha3 DRA plugin API
2023-07-12 10:57:37 -07:00
Kubernetes Prow Robot
2ec4e14bfa
Merge pull request #118812 from serathius/storage-metric
Improve apiserver storage size metric
2023-07-12 10:57:26 -07:00
carlory
0599b3caa0 change the QueueingHintFn to pass a logger 2023-07-13 00:56:41 +08:00
Patrick Ohly
08d40f53a7 dra: test with and without immediate ReservedFor
The recommendation and default in the controller helper code is to set
ReservedFor to the pod which triggered delayed allocation. However, this
is neither required nor enforced. Therefore we should also test the fallback
path were kube-scheduler itself adds the pod to ReservedFor.
2023-07-12 16:57:17 +02:00
Patrick Ohly
7d064812bb kube-controller-manager: finish conversion to contextual logging
This removes all exceptions and fixes the remaining unconverted log calls.
2023-07-12 14:57:29 +02:00
Kubernetes Prow Robot
3cc729fc7f
Merge pull request #119195 from pohly/dra-reallocate-flake
dra e2e: fix "reallocation works" flake
2023-07-12 05:55:25 -07:00
Patrick Ohly
d743c50bb9 kubelet: support batched prepare/unprepare in v1alpha3 DRA plugin API
Combining all prepare/unprepare operations for a pod enables plugins to
optimize the execution. Plugins can continue to use the v1beta2 API for now,
but should switch. The new API is designed so that plugins which want to work
on each claim one-by-one can do so and then report errors for each claim
separately, i.e. partial success is supported.
2023-07-12 14:50:30 +02:00
Marek Siarkowicz
7a63997c8a Improve apiserver storage size metric to allow it's graduation
Change name to make it compliant with prometheus guidelines.
Calculate it on demand instead of periodic to comply with prometheus standards.
Replace "endpoint" with "server" label to make it semantically consistent with storage factory
2023-07-12 14:33:10 +02:00
Francesco Romani
01c3a51a78 node: podresources: getallocatable: move to GA
lock the feature gate to GA, and remove the now-redundant code.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2023-07-12 14:11:22 +02:00
Francesco Romani
d78671447f e2e: node: add test to check device-requiring pods are cleaned up
Make sure orphanded pods (pods deleted while kubelet is down) are
handled correctly.
Outline:
1. create a pod (not static pod)
2. stop kubelet
3. while kubelet is down, force delete the pod on API server
4. restart kubelet
the pod becomes an orphaned pod and is expected to be killed by HandlePodCleanups.

There is a similar test already, but here we want to check device
assignment.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2023-07-12 13:25:36 +02:00
Francesco Romani
5cf50105a2 e2e: node: devices: improve the node reboot test
The recently added e2e device plugins test to cover node reboot
works fine if runs every time on CI environment (e.g CI) but
doesn't handle correctly partial setup when run repeatedly on
the same instance (developer setup).

To accomodate both flows, we extend the error management, checking
more error conditions in the flow.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2023-07-12 13:25:36 +02:00
Francesco Romani
b926aba268 e2e: node: devicemanager: update tests
Fix e2e device manager tests.
Most notably, the workload pods needs to survive a kubelet
restart. Update tests to reflect that.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2023-07-12 13:25:36 +02:00
Maciej Szulik
ab3a0b78ea
Match both old and new kubectl version for a while in e2e 2023-07-12 12:49:33 +02:00
Kubernetes Prow Robot
745cfa35bd
Merge pull request #119147 from mengjiao-liu/contextual-logging-controller-disruption
Migrate /pkg/controller/disruption to structured and contextual logging
2023-07-12 03:35:25 -07:00
Kubernetes Prow Robot
a8093823c3
Merge pull request #119042 from sttts/sttts-restcore-split
cmd/kube-apiserver: turn core (legacy) rest storage into standard RESTStorageProvider
2023-07-12 03:35:17 -07:00
Patrick Ohly
c143a875ed dra e2e: fix "reallocation works" flake
The main problem probably was that
https://github.com/kubernetes/kubernetes/pull/118862 moved creating the first
pod before setting up the callback which blocks allocating one claim for that
pod. This is racy because allocations happen in the background.

The test also was unnecessarily complex and hard to read:
- The intended effect can be achieved with three instead of four claims.
- It wasn't clear which claim has "external-claim-other" as name.
  Using the claim variable avoids that.
2023-07-12 11:20:47 +02:00
Mengjiao Liu
19869478c1 Migrate /pkg/controller/disruption to structured and contextual logging 2023-07-12 11:30:45 +08:00
Kubernetes Prow Robot
98e7c2a751
Merge pull request #119237 from jpbetz/jpbetz-apiserver-integration-owner
Add jpbetz as approver of apiserver integration tests
2023-07-11 20:03:18 -07:00
Kubernetes Prow Robot
2d9c951abe
Merge pull request #117011 from fabi200123/Add-Node-Log-Query-Tests-
Add e2e tests for feature NodeLogQuery
2023-07-11 20:03:11 -07:00
Kubernetes Prow Robot
6ffca50136
Merge pull request #116443 from benluddy/secondary-authz-decision-caching
Cache authz decisions within the scope of validating policy admission.
2023-07-11 12:41:11 -07:00
Joe Betz
6d6595d0f6 Add jpbetz as approver of apiserver integration tests 2023-07-11 14:36:45 -04:00
Kubernetes Prow Robot
da61644869
Merge pull request #119179 from gjkim42/add-prestop-e2e-test
node-e2e: Add container lifecycle e2e tests for preStop hook
2023-07-11 10:33:23 -07:00
Kubernetes Prow Robot
e0dafe57a3
Merge pull request #117351 from pohly/dra-generated-resource-claim-names
DRA: generated resource claim names
2023-07-11 10:33:11 -07:00
Dr. Stefan Schimanski
75e3576523
kube-apiserver: rewire service controllers: kubernetesservice + IP repair 2023-07-11 17:27:20 +02:00
Arda Güçlü
3267dd9d52
kubectl delete: Introduce new interactive flag for interactive deletion (#114530) 2023-07-11 06:05:11 -07:00
Patrick Ohly
ba810871ad dra e2e: check that not generating a ResourceClaim works
This is not something that normally happens, but the API supports it because it
might be needed at some point, so we have to test it.
2023-07-11 14:23:49 +02:00
Patrick Ohly
444d23bd2f dra: generated name for ResourceClaim from template
Generating the name avoids all potential name collisions. It's not clear how
much of a problem that was because users can avoid them and the deterministic
names for generic ephemeral volumes have not led to reports from users. But
using generated names is not too hard either.

What makes it relatively easy is that the new pod.status.resourceClaimStatus
map stores the generated name for kubelet and node authorizer, i.e. the
information in the pod is sufficient to determine the name of the
ResourceClaim.

The resource claim controller becomes a bit more complex and now needs
permission to modify the pod status. The new failure scenario of "ResourceClaim
created, updating pod status fails" is handled with the help of a new special
"resource.kubernetes.io/pod-claim-name" annotation that together with the owner
reference identifies exactly for what a ResourceClaim was generated, so
updating the pod status can be retried for existing ResourceClaims.

The transition from deterministic names is handled with a special case for that
recovery code path: a ResourceClaim with no annotation and a name that follows
the Kubernetes <= 1.27 naming pattern is assumed to be generated for that pod
claim and gets added to the pod status.

There's no immediate need for it, but just in case that it may become relevant,
the name of the generated ResourceClaim may also be left unset to record that
no claim was needed. Components processing such a pod can skip whatever they
normally would do for the claim. To ensure that they do and also cover other
cases properly ("no known field is set", "must check ownership"),
resourceclaim.Name gets extended.
2023-07-11 14:23:48 +02:00
Kubernetes Prow Robot
86038ae590
Merge pull request #116846 from moshe010/e2e--node-pod-resources
kubelet pod-resources: add e2e for KubeletPodResourcesGet feature
2023-07-11 04:53:24 -07:00
Kubernetes Prow Robot
8f1852bb44
Merge pull request #115295 from Namanl2001/pkg/controller/endpointslice
Migrated `pkg/controller/endpointslice` and `pkg/controller/endpointslicemirroring` to contextual logging
2023-07-11 03:19:12 -07:00
carlory
f443c458af move non-graceful node shutdown to GA 2023-07-11 13:51:51 +08:00
Kubernetes Prow Robot
ad72319ece
Merge pull request #115122 from r-erema/110782-oidc-test-coverage
add integration tests for OIDC authenticator
2023-07-10 15:29:10 -07:00
Naman
645cb90732 migrated pkg/controller/endpointslicemirroring to contextual logging
Signed-off-by: Naman <namanlakhwani@gmail.com>
2023-07-11 01:43:30 +05:30
Naman
09849b09cf migrated pkg/controller/endpointslice to contextual logging
Signed-off-by: Naman <namanlakhwani@gmail.com>
2023-07-11 01:28:22 +05:30
Sascha Grunert
3bae26ae58
Check dbus error on container runtime start/stop
We should evaluate the error, otherwise we risk to hang indefinately on
waiting for the `reschan` in:

64939b66c6/test/e2e_node/util.go (L419)

We also increase the timeout, because it can take a bit longer for
runtimes to determinate depending on the work they have to be done on
running containers.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-07-10 13:45:40 +02:00
Kubernetes Prow Robot
80dab4127b
Merge pull request #116720 from soltysh/remove_short_version
Remove long/golang version information making short the default
2023-07-10 02:41:06 -07:00
Sascha Grunert
a6554b9d5d
Make kubelet label types public
We use the label definitions in CRI-O, means we now make them public to
stop vendoring/copying this part of Kubernetes.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-07-10 10:58:44 +02:00
Gunju Kim
8fb5b6eb4c
node-e2e: Add container lifecycle e2e tests for preStop hook
This ensures that the container's pre-stop hook is invoked if the
startup or liveness probe fails.
2023-07-10 08:55:48 +09:00
Kubernetes Prow Robot
d653dcab5a
Merge pull request #119048 from pohly/scheduler-perf-metrics-for-perfdash
scheduler-perf: metrics for perfdash
2023-07-09 09:27:04 -07:00
Kubernetes Prow Robot
19a25bac05
Merge pull request #119159 from alculquicondor/fix-job-uncounted
Only declare job as finished after removing all finalizers
2023-07-08 01:55:03 -07:00
kerthcet
47ef977ddd Direct reference to the packages
Signed-off-by: kerthcet <kerthcet@gmail.com>
2023-07-08 12:03:46 +08:00
Kubernetes Prow Robot
0e14098333
Merge pull request #116429 from SergeyKanzhelev/sidecar
Add SidecarContainers feature
2023-07-07 17:39:03 -07:00
Gunju Kim
03c2217687
Sidecar: Add e2e tests
Co-authored-by: Sergey Kanzhelev <S.Kanzhelev@live.com>
2023-07-08 07:26:12 +09:00
Kubernetes Prow Robot
16e3980c08
Merge pull request #119158 from dims/skip-GracefulNodeShutdown-tests-on-older-systemd-versions
Skip GracefulNodeShutdown on older systemd versions
2023-07-07 14:13:04 -07:00
Davanum Srinivas
10dc1ca084
Skip GracefulNodeShutdown on older systemd versions
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2023-07-07 16:08:42 -04:00
Kubernetes Prow Robot
c17601fa18
Merge pull request #118529 from kerthcet/feat/record-failed-plugins
Record failed plugins in Reserver Plugin
2023-07-07 11:09:03 -07:00
Aldo Culquicondor
f7a1fb76f4
Only declare job as finished after removing all finalizers
Change-Id: Id4b01b0e6fabe24134e57e687356e0fc613cead4
2023-07-07 14:08:19 -04:00
Maciej Szulik
3f07fc3acc
Remove long/golang version information making short the default 2023-07-07 18:46:34 +02:00
Kubernetes Prow Robot
ddb2013363
Merge pull request #119103 from pohly/e2e-unexpected-args
e2e: detect unexpected command line arguments
2023-07-07 04:37:04 -07:00
kerthcet
c0eb0caf4a Support fine-gained rescheduling in ReservePlugin
Signed-off-by: kerthcet <kerthcet@gmail.com>
2023-07-07 13:30:29 +08:00
Kubernetes Prow Robot
b07a843cb5
Merge pull request #119046 from kerthcet/fix/handle-unschedule-plugins
Fix fitError in Permit plugin not handled perfectly
2023-07-06 21:01:03 -07:00
kerthcet
278a8376e1 Fix: fiterror in permit plugin not handled perfectly
We only added failed plulgins, but actually this will not work unless
we make the status with a fitError because we only copy the failured plugins
to podInfo if it is a fitError

Signed-off-by: kerthcet <kerthcet@gmail.com>
2023-07-07 10:35:59 +08:00
Kubernetes Prow Robot
d02d8ba635
Merge pull request #118862 from byako/batching-dra-calls
DRA controller: batch resource claims for Allocate
2023-07-06 11:33:03 -07:00
Kubernetes Prow Robot
6f9d1d38d8
Merge pull request #118817 from pohly/dra-delete-claims
DRA: improve handling of completed pods
2023-07-06 10:15:15 -07:00
Alexey Fomenko
b10cc642b5
DRA controller: batch resource claims for Allocate
Signed-off-by: Alexey Fomenko <alexey.fomenko@intel.com>
Co-authored-by: Patrick Ohly <patrick.ohly@intel.com>
2023-07-06 19:31:45 +03:00
Kubernetes Prow Robot
8c1bf4f461
Merge pull request #116930 from fatsheep9146/contextual-logging-cleanup
contextual logging cleanup
2023-07-06 07:39:03 -07:00
Kubernetes Prow Robot
d48fc2ad2d
Merge pull request #119035 from saschagrunert/critical-pod
Fix `should be able to create and delete a critical pod` test
2023-07-06 00:51:03 -07:00
Kubernetes Prow Robot
e5efa0a5ee
Merge pull request #117108 from pohly/test-integration-race-detection-component-base-logs
component-base/logs: improve handling of re-applying a configuration
2023-07-05 21:29:08 -07:00
Kubernetes Prow Robot
cd32adebd9
Merge pull request #118386 from Richabanker/enhance-storage-version
Add servedVersions info in StorageVersion API
2023-07-05 19:23:02 -07:00
Kubernetes Prow Robot
77ad8a516d
Merge pull request #119025 from ii/remove-csidriver-test
Remove conformance test for StorageV1CSIDriver Endpoints
2023-07-05 17:53:02 -07:00
Ziqi Zhao
dfc1838379 Migrated pkg/controller/volume|util|replicaset|nodeipam to contextual logging
Signed-off-by: Ziqi Zhao <zhaoziqi9146@gmail.com>
2023-07-06 07:39:52 +08:00
Kubernetes Prow Robot
a88defe09a
Merge pull request #119107 from soltysh/drop_deprecated_api
Stop using deprecated API
2023-07-05 12:01:02 -07:00
Kubernetes Prow Robot
88e42040b1
Merge pull request #118981 from ffromani/e2e-podres-deflake
e2e: node: podresources: cooldown the rate limit
2023-07-05 12:00:50 -07:00
Kubernetes Prow Robot
80af36cfff
Merge pull request #119104 from pohly/e2e-remove-node-gcp
e2e node: remove unused test/e2e_node/gcp
2023-07-05 10:41:08 -07:00
Patrick Ohly
02efe09abe component-base/logs: improve handling of re-applying a configuration
Normal binaries should never have to do this. It's not safe when there are
already some goroutines running which might do logging. Therefore the new
default is to return an error when a binary accidentally re-applies.

A few unit ensure that there are no goroutines and have to call the functions
more then once. The new ResetForTest API gets used by those to enable changing the
logging settings more than once in the same process.

Integration tests use the same code as the normal binaries. To make reuse of
that code safe, component-base/logs can be configured to silently ignore any
additional calls. This addresses data races that were found when enabling -race
for integration tests. To catch cases where the integration test does want
to modify the config, the old and new config get compared and an error is
raised when it's not the same.

To avoid having to modify all integration tests which start test servers,
reconfiguring component-base/logs is done by the test server packages.
2023-07-05 19:08:54 +02:00
Kubernetes Prow Robot
ce7fd466aa
Merge pull request #118134 from pohly/e2e-pod-security-levels
e2e: support admissionapi.LevelRestricted in test/e2e/framework/pod
2023-07-05 08:39:09 -07:00
Maciej Szulik
9dcc57ede6
Stop using deprecated API 2023-07-05 16:42:55 +02:00
Sascha Grunert
5e1777be52
Update debian-base to bookworm
Rolling out the latest bookworm base image.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-07-05 16:13:03 +02:00
Patrick Ohly
a514f40131 dra resourceclaim controller: delete generated claims when pod is done
When a pod is done, but not getting removed yet for while, then a claim that
got generated for that pod can be deleted already. This then also triggers
deallocation.
2023-07-05 16:10:20 +02:00
Patrick Ohly
e8a0c42212 dra resourceclaim controller: remove reservation for completed pods
When a pod is known to never run (again), the reservation for it also can be
removed. This is relevant in particular for the job controller.
2023-07-05 16:10:20 +02:00
Patrick Ohly
7f5a02fc7e dra resourceclaim controller: enhance logging
Adding logging to event handlers makes it more obvious why (or why not) claims
and pods need to be processed.
2023-07-05 16:10:20 +02:00
Patrick Ohly
16e9cc42c1 e2e node: remove unused test/e2e_node/gcp
The test package was not included anywhere and thus just dead code that doesn't
need to be maintained anymore.
2023-07-05 14:31:32 +02:00
Patrick Ohly
932d0337b8 e2e: detect unexpected command line arguments
Invalid flags are detected by flag parsing, but optional arguments are just
passed through to the E2E suites. None of them support any, so rejecting them
with an error message is useful because it helps catch typos (like a missing
hyphen before a flag).
2023-07-05 13:34:09 +02:00
Kubernetes Prow Robot
a9a7a3730e
Merge pull request #118994 from pohly/test-integration-race-detection-grpc-logger
integration testing: configure gRPC logging during init
2023-07-05 02:58:55 -07:00
Kubernetes Prow Robot
00b8a0a95b
Merge pull request #118160 from minherz/master
Support JSONPath condition without value
2023-07-04 00:26:52 -07:00
roman
18f2e9055f Add OIDC integration tests 2023-07-04 08:04:53 +03:00
Patrick Ohly
6b01ece580 scheduler-perf: fix perfdash display problem
perfdash expects all data items to have the same set of labels.  It then
renders drop-down buttons for each label with all values found for each
label. Previously, data items that didn't have a label didn't match any label
filter in perfdash and couldn't get selected because perfdash doesn't have
"unset" in it's drop-down menus.

To avoid that, scheduler-perf now collects all labels and then adds missing
labels with "not applicable" as value:

    {
      "data": {
        "Average": 939.7071223010004,
        "Perc50": 927.7987421383649,
        "Perc90": 2166.153846153846,
        "Perc95": 2363.076923076923,
        "Perc99": 2520.6153846153848
      },
      "unit": "ms",
      "labels": {
        "Metric": "scheduler_pod_scheduling_duration_seconds",
        "Name": "SchedulingBasic/5000Nodes/namespace-2",
        "extension_point": "not applicable",
        "result": "not applicable"
      }
    },
    ...
    {
      "data": {
        "Average": 1.1172570650000004,
        "Perc50": 1.1418367346938776,
        "Perc90": 1.5500000000000003,
        "Perc95": 1.6410256410256412,
        "Perc99": 3.7333333333333334
      },
      "unit": "ms",
      "labels": {
        "Metric": "scheduler_framework_extension_point_duration_seconds",
        "Name": "SchedulingBasic/5000Nodes/namespace-2",
        "extension_point": "Score",
        "result": "not applicable"
      }
    },
2023-07-03 21:16:53 +02:00
Patrick Ohly
29e5771aa4 scheduler-perf: shorten "Name" label in metrics
Because the JSON file gets written at the end of the top-level benchmark, all
data items had `BenchmarkPerfScheduling/` as prefix in the `Name` label. This
is redundant and makes it harder to see the actual name. Now that common prefix
gets removed.
2023-07-03 21:15:16 +02:00
Kubernetes Prow Robot
8f79a3d91e
Merge pull request #118915 from astoycos/remove-netpol-leg
remove legacy NetworkPolicy tests
2023-07-03 09:17:04 -07:00
Patrick Ohly
c903c29c3b e2e: support admissionapi.LevelRestricted in test/e2e/framwork/pod
CreatePod and MakePod only accepted an `isPrivileged` boolean, which made it
impossible to write tests using those helpers which work in a default
framework.Framework, because the default there is LevelRestricted.

The simple boolean gets replaced with admissionapi.Level. Passing
LevelRestricted does the same as calling e2epod.MixinRestrictedPodSecurity.

Instead of explicitly passing a constant to these modified helpers, most tests
get updated to pass f.NamespacePodSecurityLevel. This has the advantage
that if that level gets lowered in the future, tests only need to be updated in
one place.

In some cases, helpers taking client+namespace+timeouts parameters get replaced
with passing the Framework instance to get access to
f.NamespacePodSecurityEnforceLevel. These helpers don't need separate
parameters because in practice all they ever used where the values from the
Framework instance.
2023-07-03 16:26:28 +02:00
Humble Chirammal
ede17eace4 test-e2e: Retrigger the RBD image post merge job
The post merge job was failed https://github.com/kubernetes/kubernetes/pull/117103
and this causes the e2e tests to fail. This PR retrigger the same.

Signed-off-by: Humble Chirammal <humble.devassy@gmail.com>
2023-07-03 18:37:46 +05:30
Kubernetes Prow Robot
8c33d3ef7b
Merge pull request #118967 from tukwila/upgrade_npd_image_v0.8.13
upgrade npd image version to v0.8.13 for kubemark
2023-07-03 05:13:02 -07:00
Sascha Grunert
bcbc12cd79
Fix should be able to create and delete a critical pod test
The namespace the crictical pod was referring to was wrong, because it
was using the generated one instead of `kube-system`. This and the
resulting test condition is now fixed.

The test seems to run only in `ci-crio-cgroupv1-node-e2e-flaky` for now.

Closes https://github.com/kubernetes/kubernetes/issues/109296

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-07-03 11:15:59 +02:00
guangli.bao
c143710ecf upgrade npd image version to v0.8.13 for kubemark
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>

fix error

Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
2023-07-03 13:46:58 +08:00
Stephen Heywood
96694514a7 Remove csiDriver Conformance test
This test is redundant as the csiDriver lifecycle test (118478)
includes the same functionality.
2023-07-03 08:50:16 +12:00
Kubernetes Prow Robot
ec87834bae
Merge pull request #118936 from pohly/dra-deallocate-when-unused
DRA: for delayed allocation, deallocate when no longer used
2023-07-01 12:56:48 -07:00
roman
59592ba463 Add OIDC test server 2023-07-01 10:53:53 +03:00