kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	84a999923f	Merge pull request #119335 from mimowo/use-final-diff-for-job-pod-creation Ensure final diff is used for setting expectations for Job pod creation	2023-07-14 15:20:54 -07:00
Kubernetes Prow Robot	6f3856f953	Merge pull request #118883 from danielvegamyhre/kep-4017-job Add completion index as pod label for indexed jobs	2023-07-14 12:23:50 -07:00
Michal Wozniak	9564bdc39d	Ensure final diff is used for setting expectations for Job pod creation	2023-07-14 19:09:39 +02:00
Kubernetes Prow Robot	5c72df7281	Merge pull request #118953 from mskrocki/escLib Convert EndpointSlice Reconciler to a library in staging.	2023-07-13 17:13:34 -07:00
Kubernetes Prow Robot	be2cfc9697	Merge pull request #118228 from carlory/move-non-graceful-node-shutdown-to-GA move non-graceful node shutdown to GA	2023-07-13 15:47:37 -07:00
Daniel Vega-Myhre	037091284e	fix unit test bug	2023-07-13 22:38:21 +00:00
Kubernetes Prow Robot	bea27f82d3	Merge pull request #118209 from pohly/dra-pre-scheduled-pods dra: pre-scheduled pods	2023-07-13 14:43:37 -07:00
Daniel Vega-Myhre	a1a5f49bb9	remove statefulset label added to wrong branch	2023-07-13 21:07:17 +00:00
Daniel Vega-Myhre	1ae60c0ed1	use job completion index annotation as label	2023-07-13 21:04:37 +00:00
Jiahui Feng	049614f884	ValidatingAdmissionPolicy controller for Type Checking (#117377 ) * [API REVIEW] ValidatingAdmissionPolicyStatucController config. worker count. * ValidatingAdmissionPolicyStatus controller. * remove CEL typechecking from API server. * fix initializer tests. * remove type checking integration tests from API server integration tests. * validatingadmissionpolicy-status options. * grant access to VAP controller. * add defaulting unit test. * generated: ./hack/update-codegen.sh * add OWNERS for VAP status controller. * type checking test case.	2023-07-13 13:41:50 -07:00
Patrick Ohly	80ab8f0542	dra: handle scheduled pods in kube-controller-manager When someone decides that a Pod should definitely run on a specific node, they can create the Pod with spec.nodeName already set. Some custom scheduler might do that. Then kubelet starts to check the pod and (if DRA is enabled) will refuse to run it, either because the claims are still waiting for the first consumer or the pod wasn't added to reservedFor. Both are things the scheduler normally does. Also, if a pod got scheduled while the DRA feature was off in the kube-scheduler, a pod can reach the same state. The resource claim controller can handle these two cases by taking over for the kube-scheduler when nodeName is set. Triggering an allocation is simpler than in the scheduler because all it takes is creating the right PodSchedulingContext with spec.selectedNode set. There's no need to list nodes because that choice was already made, permanently. Adding the pod to reservedFor also isn't hard. What's currently missing is triggering de-allocation of claims to re-allocate them for the desired node. This is not important for claims that get created for the pod from a template and then only get used once, but it might be worthwhile to add de-allocation in the future.	2023-07-13 21:27:11 +02:00
Patrick Ohly	cffbb1f1b2	dra controller: enhance testing The allocation mode is relevant when clearing the reservedFor: for delayed allocation, deallocation gets requested, for immediate allocation not. Both should get tested. All pre-defined claims now use delayed allocation, just as they would if created normally.	2023-07-13 21:27:11 +02:00
Patrick Ohly	5cec6d798c	dra: revamp event handlers in kube-controller-manager Enabling logging is useful to track what the code is doing. There are some functional changes: - The pod handler checks for existence of claims. This avoids adding pods to the work queue in more cases when nothing needs to be done, at the cost of making the event handlers a bit slower. This will become more important when adding more work to the controller - The handler for deleted ResourceClaim did not check for cache.DeletedFinalStateUnknown.	2023-07-13 21:27:11 +02:00
Kubernetes Prow Robot	4fa97eae1b	Merge pull request #119291 from mimowo/use-jobctx-for-first-pending Pass Job context down to firstPendingIndexes	2023-07-13 09:57:05 -07:00
Michal Wozniak	7e3b53042b	Pass Job context down to firstPendingIndexes	2023-07-13 16:11:06 +02:00
Kubernetes Prow Robot	cd9215915b	Merge pull request #118480 from carlory/gc_metrics podgc metrics should count all pod deletion behaviors	2023-07-13 06:52:05 -07:00
Patrick Ohly	98ba89d31d	resourceclaim controller: avoid caching deleted pod unnecessarily We don't need to remember that a pod got deleted when it had no resource claims because the code which checks the cached UIDs only checks for pods which have resource claims.	2023-07-12 16:57:17 +02:00
Patrick Ohly	7d064812bb	kube-controller-manager: finish conversion to contextual logging This removes all exceptions and fixes the remaining unconverted log calls.	2023-07-12 14:57:29 +02:00
Patrick Ohly	1b8ddf6b79	podgc controller: convert to contextual logging	2023-07-12 13:45:10 +02:00
Mengjiao Liu	19869478c1	Migrate /pkg/controller/disruption to structured and contextual logging	2023-07-12 11:30:45 +08:00
Maciej Skrocki	7c873327b6	Convert controller name to reconciler variable.	2023-07-11 18:08:25 +00:00
Maciej Skrocki	29fad383da	move endpointslice reconciler to staging endpointslice repo	2023-07-11 18:08:12 +00:00
Kubernetes Prow Robot	a6890b361d	Merge pull request #119193 from mimowo/sync-job-context Introduce syncJobContext to limit the number of function parameters	2023-07-11 10:33:30 -07:00
Kubernetes Prow Robot	e0dafe57a3	Merge pull request #117351 from pohly/dra-generated-resource-claim-names DRA: generated resource claim names	2023-07-11 10:33:11 -07:00
Patrick Ohly	fec25785ee	dra: store generated ResourceClaims in cache This addresses the following bad sequence of events: - controller creates ResourceClaim - updating pod status fails - pod gets retried before the informer receives the created ResourceClaim - another ResourceClaim gets created Storing the generated ResourceClaim in a MutationCache ensures that the controller knows about it during the retry. A positive side effect is that ResourceClaims now get index by pod owner and thus iterating over existing ones becomes a bit more efficient.	2023-07-11 14:23:49 +02:00
Patrick Ohly	444d23bd2f	dra: generated name for ResourceClaim from template Generating the name avoids all potential name collisions. It's not clear how much of a problem that was because users can avoid them and the deterministic names for generic ephemeral volumes have not led to reports from users. But using generated names is not too hard either. What makes it relatively easy is that the new pod.status.resourceClaimStatus map stores the generated name for kubelet and node authorizer, i.e. the information in the pod is sufficient to determine the name of the ResourceClaim. The resource claim controller becomes a bit more complex and now needs permission to modify the pod status. The new failure scenario of "ResourceClaim created, updating pod status fails" is handled with the help of a new special "resource.kubernetes.io/pod-claim-name" annotation that together with the owner reference identifies exactly for what a ResourceClaim was generated, so updating the pod status can be retried for existing ResourceClaims. The transition from deterministic names is handled with a special case for that recovery code path: a ResourceClaim with no annotation and a name that follows the Kubernetes <= 1.27 naming pattern is assumed to be generated for that pod claim and gets added to the pod status. There's no immediate need for it, but just in case that it may become relevant, the name of the generated ResourceClaim may also be left unset to record that no claim was needed. Components processing such a pod can skip whatever they normally would do for the claim. To ensure that they do and also cover other cases properly ("no known field is set", "must check ownership"), resourceclaim.Name gets extended.	2023-07-11 14:23:48 +02:00
Kubernetes Prow Robot	8f1852bb44	Merge pull request #115295 from Namanl2001/pkg/controller/endpointslice Migrated `pkg/controller/endpointslice` and `pkg/controller/endpointslicemirroring` to contextual logging	2023-07-11 03:19:12 -07:00
Michal Wozniak	bf48165232	Remarks to syncJobCtx	2023-07-11 09:44:08 +02:00
Michal Wozniak	990339d4c3	Introduce syncJobContext to limit the number of function parameters	2023-07-11 09:27:21 +02:00
carlory	f443c458af	move non-graceful node shutdown to GA	2023-07-11 13:51:51 +08:00
Kubernetes Prow Robot	986171d388	Merge pull request #119185 from xing-yang/metrics_attach Add reason to force detach metric	2023-07-10 14:03:18 -07:00
Naman	645cb90732	migrated pkg/controller/endpointslicemirroring to contextual logging Signed-off-by: Naman <namanlakhwani@gmail.com>	2023-07-11 01:43:30 +05:30
Daniel Vega-Myhre	98c6e25c37	update name of pod index label	2023-07-10 20:11:52 +00:00
Naman	09849b09cf	migrated pkg/controller/endpointslice to contextual logging Signed-off-by: Naman <namanlakhwani@gmail.com>	2023-07-11 01:28:22 +05:30
Kubernetes Prow Robot	10a12165de	Merge pull request #116755 from my-git9/feat/endpoint/logging Migrated `pkg/controller/endpoint` to contextual logging	2023-07-10 05:37:05 -07:00
Kubernetes Prow Robot	64939b66c6	Merge pull request #119146 from xuexu6666/xuexu6666/ControllerUtilUseCmpDiff Use cmp diff in controller_util_test.go	2023-07-10 02:41:18 -07:00
xing-yang	cca6601106	Add reason to force detach metric	2023-07-10 06:30:05 +00:00
Aldo Culquicondor	f7a1fb76f4	Only declare job as finished after removing all finalizers Change-Id: Id4b01b0e6fabe24134e57e687356e0fc613cead4	2023-07-07 14:08:19 -04:00
xuexu6666	d7708e79d3	Use cmp diff	2023-07-06 23:01:06 -05:00
Heba Elayoty	2fe38f93e5	feat: Append job creation timestamp to cronjob annotations (#118137 ) * Append job name to job annotations Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * Update annotation description, remove timezone, and fix time Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * Remove unused ctx Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * code review comments Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * code review comments Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * Add timezone back Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> --------- Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com>	2023-07-06 14:39:04 -07:00
Daniel Vega-Myhre	3a02ecb341	check test case param instead of feature flag in unit test code	2023-07-06 17:30:40 +00:00
Kubernetes Prow Robot	6f9d1d38d8	Merge pull request #118817 from pohly/dra-delete-claims DRA: improve handling of completed pods	2023-07-06 10:15:15 -07:00
Kubernetes Prow Robot	7e5506de8d	Merge pull request #119111 from kannon92/remove-equal-ready-job remove equalReady and replace with k8 util function	2023-07-06 09:13:16 -07:00
Ziqi Zhao	dfc1838379	Migrated pkg/controller/volume\|util\|replicaset\|nodeipam to contextual logging Signed-off-by: Ziqi Zhao <zhaoziqi9146@gmail.com>	2023-07-06 07:39:52 +08:00
xin.li	6c0387d004	Migrated pkg/controller/endpoint to contextual logging Signed-off-by: xin.li <xin.li@daocloud.io>	2023-07-06 07:36:51 +08:00
xin.li	3cf2822bc5	Migrated pkg/controller/garbagecollector to contextual logging Signed-off-by: xin.li <xin.li@daocloud.io>	2023-07-06 07:36:51 +08:00
Kubernetes Prow Robot	916c3466b9	Merge pull request #118940 from soltysh/drop_missedschedules Hide numberOfMissedSchedules as an algorithm internal number	2023-07-05 16:27:02 -07:00
kannon92	921b7e6e8f	remove equalReady and replace with k8 util function	2023-07-05 20:11:48 +00:00
Daniel Vega-Myhre	a647f9febb	default enabled pod index for test cases, add test case disabling it	2023-07-05 18:47:45 +00:00
carlory	322da7c1aa	fix gc metrics	2023-07-06 02:06:03 +08:00

1 2 3 4 5 ...

6240 Commits