kubernetes

Author	SHA1	Message	Date
Kubernetes Prow Robot	8e2b12a220	Merge pull request #119068 from lauchokyip/podgc-unit-test added podgc orphaned pod unit tests	2023-09-05 03:19:49 -07:00
Patrick Ohly	3c2cfd9a4f	resource claim controller: separate generated suffix from base When the resource claim name inside the pod had some suffix like "1a" in "resource-1a", the generated name suffix got added directly after that, leading to "my-pod-resource-1ax6zgt". Adding another hyphen makes the result more readable: "my-pod-resource-1a-x6zgt".	2023-09-04 09:45:25 +02:00
Kubernetes Prow Robot	1cadbd5887	Merge pull request #120172 from DrAuYueng/fix-log-in-deployment-controller Fix pod deletion log in deployment controller	2023-09-01 11:28:31 -07:00
Albert Sverdlov	a46bab6930	Fix a job quota related deadlock (#119776 ) * Fix a job quota related deadlock In case ResourceQuota is used and sets a max # of jobs, a CronJob may get trapped in a deadlock: 1. Job quota for a namespace is reached. 2. CronJob controller can't create a new job, because quota is reached. 3. Cleanup of jobs owned by a cronjob doesn't happen, because a control loop iteration is finished because of an error to create a job. To fix this we stop early quitting from a control loop iteration when cronjob reconciliation failed and always let old jobs to be cleaned up. * Dont reorder imports * Don't stop requeuing on reconciliation error Previous code only logged the reconciliation error inside jm.sync() and didn't return the reconciliation error to it's invoker processNextWorkItem(). Adding a copy-paste back to avoid this issue. * Remove copy-pasted cleanupFinishedJobs() Now we always call jm.cleanupFinishedJobs() first and then jm.syncCronJob(). We also extract cronJobCopy and updateStatus outside jm.syncCronJob function and pass pointers to them in both jm.syncCronJob and jm.cleanupFinishedJobs to make delayed updates handling more explicit and not dependent on the order in which cleanupFinishedJobs and syncCronJob are invoked. * Return updateStatus bool instead of changing the reference * Explicitly ignore err in tests to fix linter	2023-08-31 08:25:00 -07:00
Sharpz7	e9be1d7438	Test now has coverage!	2023-08-27 05:06:53 +00:00
DrAuYueng	a4ce32769f	fix pod delete log in deployment controller Signed-off-by: DrAuYueng <ouyang1204@gmail.com>	2023-08-25 22:20:51 +08:00
Adam McArthur	0bc0256093	Update job_controller_test.go	2023-08-25 08:15:53 -06:00
Sharpz7	22f4b1c56a	Static check fix	2023-08-25 11:35:05 +00:00
Sharpz7	70e2deb32f	Fixing lint problem	2023-08-25 11:08:59 +00:00
Sharpz7	6ded53ce4d	Added back test changes	2023-08-25 10:35:58 +00:00
Sharpz7	5fb049ff47	Added create job & cleanup	2023-08-25 10:35:58 +00:00
Sharpz7	ff1659cb79	Added syncjob	2023-08-25 10:35:58 +00:00
Sharpz7	f87cc43cdb	Review Changes	2023-08-25 10:35:58 +00:00
Sharpz7	d08fc3a4d0	Another one creeped in	2023-08-25 10:35:58 +00:00
Sharpz7	ef6a0eb6d8	Final Lint Fix	2023-08-25 10:35:58 +00:00
Sharpz7	aa9f38c36d	More Lint Fixes	2023-08-25 10:35:58 +00:00
Sharpz7	601679446a	Lint fixes	2023-08-25 10:35:58 +00:00
Sharpz7	cf32ae9453	Initial Commit	2023-08-25 10:35:58 +00:00
Sharpz7	297f04b74a	Added function to remove finalizers as backup	2023-08-25 10:35:57 +00:00
Kubernetes Prow Robot	f852d7fead	Merge pull request #118653 from pohly/volume-resource-requirements Volume resource requirements	2023-08-21 14:08:05 -07:00
Kubernetes Prow Robot	6cbc5dfac6	Merge pull request #114095 from aimuz/fix-114083 scheduler: Fix field apiVersion is missing from events reported from taint manager	2023-08-21 07:03:23 -07:00
Patrick Ohly	2472291790	api: introduce separate VolumeResourceRequirements struct PVC and containers shared the same ResourceRequirements struct to define their API. When resource claims were added, that struct got extended, which accidentally also changed the PVC API. To avoid such a mistake from happening again, PVC now uses its own VolumeResourceRequirements struct. The `Claims` field gets removed because risk of breaking someone is low: theoretically, YAML files which have a claims field for volumes now get rejected when validating against the OpenAPI. Such files have never made sense and should be fixed. Code that uses the struct definitions needs to be updated.	2023-08-21 15:31:28 +02:00
lowang-bh	2f576969ee	reducae function calling to once Signed-off-by: lowang-bh <lhui_wang@163.com>	2023-08-20 19:36:50 +08:00
qingwave	7f805dc935	fix test name	2023-08-19 03:31:34 +00:00
Kubernetes Prow Robot	df493712e4	Merge pull request #119874 from kannon92/pod-replacement-policy-typos fix typos for pod replacement policy	2023-08-17 11:21:34 -07:00
qingwave	f07a3a3f26	calculate sidecar container resource in pod autoscaler Signed-off-by: qingwave <isguory@gmail.com>	2023-08-17 07:47:59 +00:00
git-jxj	a5b3a4b738	cleanup: Update deprecated FromInt to FromInt32 (#119858 ) * redo commit * apply suggestions from liggitt * update Parse function based on suggestions	2023-08-16 09:33:01 -07:00
Kubernetes Prow Robot	d5f2420309	Merge pull request #119914 from luohaha3123/job-feature Job: Change job controller methods receiver to pointer	2023-08-15 23:14:05 -07:00
Kubernetes Prow Robot	fa1fc7a9cb	Merge pull request #119904 from tenzen-y/replace-deprecated-workqueue-lib Job: Replace deprecated workqueue function with supported one	2023-08-15 23:13:52 -07:00
Kubernetes Prow Robot	5638fe5f33	Merge pull request #119214 from kaisoz/refactor-controller-utils-test Rewrite the tests to be table driven	2023-08-15 15:17:55 -07:00
Kubernetes Prow Robot	7407f36b4b	Merge pull request #117992 from liggitt/gc-discovery-flutter Fix duplicate GC event handlers getting added if discovery flutters	2023-08-15 15:16:50 -07:00
lhaha	947c9376f6	change struct methods receiver to pointer	2023-08-12 10:21:14 +08:00
Yuki Iwai	6f27733af8	Job: Replace deprecated workqueue library with supported one Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>	2023-08-11 20:35:36 +09:00
kannon92	f73c253acc	fix typos for pod replacement policy	2023-08-09 20:34:48 +00:00
Tomas Tormo	074d5b5329	Rewrite the tests to be table driven	2023-08-03 08:39:46 +00:00
Kubernetes Prow Robot	18f8cb8398	Merge pull request #118644 from alexzielenski/apiserver/policy/namespaceParamRef KEP-3488: Promote ValidatingAdmissionPolicy to Beta	2023-07-21 17:44:08 -07:00
Alexander Zielenski	ef8670c946	refactor: replace usage of v1alpha1 with v1beta1 v1alpha -> v1beta fill in DenyAction where there is no ParameterNotFoundAction	2023-07-21 13:41:24 -07:00
Kubernetes Prow Robot	a30f6b7922	Merge pull request #119506 from mimowo/fix-job-controller-flaky-test Fix the flaky TestJobApiBackoffReset test	2023-07-21 09:30:07 -07:00
Michal Wozniak	dbea279112	Fix the flaky TestJobApiBackoffReset test	2023-07-21 14:45:04 +02:00
jackcui	9d8959224c	add explanation for large-cluster-size-threshold arg about multiple zones cluster	2023-07-21 17:25:51 +08:00
kannon92	74fcf3e766	implementation of PodReplacementPolicy kep in the job controller	2023-07-21 00:44:53 +00:00
Michal Wozniak	35d0af9243	Include ignored pods when computing backoff delay for Job pod failures	2023-07-19 17:39:58 +02:00
Kubernetes Prow Robot	88c8bcbb4a	Merge pull request #115952 from pacoxu/cleanup-cronjob cronjob: return immediately when failed to create job for the namespace is terminating	2023-07-18 21:28:02 -07:00
Chok Yip Lau	dbaa9fe6b4	added podgc orphaned pod unit tests	2023-07-18 23:06:19 -04:00
Kubernetes Prow Robot	d1d86dafb7	Merge pull request #118772 from kannon92/terminating-pod-gc KEP-3939: pod gc changes for pod replacement policy kep	2023-07-18 16:46:03 -07:00
Michał Woźniak	a15c27661e	Job controller implementation of backoff limit per index (#118009 )	2023-07-18 13:44:11 -07:00
carlory	ad0b671b56	getPersistentVolume remove reduant deep copy	2023-07-18 18:55:42 +08:00
Daniel Vega-Myhre	7698fe7639	Add StatefulSet pod index as pod label (#119232 ) * add statefulset pod index as pod label * change statefulset pod index label name * check 3 pods * change label variable name	2023-07-17 12:47:10 -07:00
kannon92	e38ab6d367	Add PodGC changes for PodReplacementPolicy	2023-07-16 23:47:04 +00:00
Kubernetes Prow Robot	84a999923f	Merge pull request #119335 from mimowo/use-final-diff-for-job-pod-creation Ensure final diff is used for setting expectations for Job pod creation	2023-07-14 15:20:54 -07:00
Kubernetes Prow Robot	6f3856f953	Merge pull request #118883 from danielvegamyhre/kep-4017-job Add completion index as pod label for indexed jobs	2023-07-14 12:23:50 -07:00
Michal Wozniak	9564bdc39d	Ensure final diff is used for setting expectations for Job pod creation	2023-07-14 19:09:39 +02:00
Kubernetes Prow Robot	5c72df7281	Merge pull request #118953 from mskrocki/escLib Convert EndpointSlice Reconciler to a library in staging.	2023-07-13 17:13:34 -07:00
Kubernetes Prow Robot	be2cfc9697	Merge pull request #118228 from carlory/move-non-graceful-node-shutdown-to-GA move non-graceful node shutdown to GA	2023-07-13 15:47:37 -07:00
Daniel Vega-Myhre	037091284e	fix unit test bug	2023-07-13 22:38:21 +00:00
Kubernetes Prow Robot	bea27f82d3	Merge pull request #118209 from pohly/dra-pre-scheduled-pods dra: pre-scheduled pods	2023-07-13 14:43:37 -07:00
Daniel Vega-Myhre	a1a5f49bb9	remove statefulset label added to wrong branch	2023-07-13 21:07:17 +00:00
Daniel Vega-Myhre	1ae60c0ed1	use job completion index annotation as label	2023-07-13 21:04:37 +00:00
Jiahui Feng	049614f884	ValidatingAdmissionPolicy controller for Type Checking (#117377 ) * [API REVIEW] ValidatingAdmissionPolicyStatucController config. worker count. * ValidatingAdmissionPolicyStatus controller. * remove CEL typechecking from API server. * fix initializer tests. * remove type checking integration tests from API server integration tests. * validatingadmissionpolicy-status options. * grant access to VAP controller. * add defaulting unit test. * generated: ./hack/update-codegen.sh * add OWNERS for VAP status controller. * type checking test case.	2023-07-13 13:41:50 -07:00
Patrick Ohly	80ab8f0542	dra: handle scheduled pods in kube-controller-manager When someone decides that a Pod should definitely run on a specific node, they can create the Pod with spec.nodeName already set. Some custom scheduler might do that. Then kubelet starts to check the pod and (if DRA is enabled) will refuse to run it, either because the claims are still waiting for the first consumer or the pod wasn't added to reservedFor. Both are things the scheduler normally does. Also, if a pod got scheduled while the DRA feature was off in the kube-scheduler, a pod can reach the same state. The resource claim controller can handle these two cases by taking over for the kube-scheduler when nodeName is set. Triggering an allocation is simpler than in the scheduler because all it takes is creating the right PodSchedulingContext with spec.selectedNode set. There's no need to list nodes because that choice was already made, permanently. Adding the pod to reservedFor also isn't hard. What's currently missing is triggering de-allocation of claims to re-allocate them for the desired node. This is not important for claims that get created for the pod from a template and then only get used once, but it might be worthwhile to add de-allocation in the future.	2023-07-13 21:27:11 +02:00
Patrick Ohly	cffbb1f1b2	dra controller: enhance testing The allocation mode is relevant when clearing the reservedFor: for delayed allocation, deallocation gets requested, for immediate allocation not. Both should get tested. All pre-defined claims now use delayed allocation, just as they would if created normally.	2023-07-13 21:27:11 +02:00
Patrick Ohly	5cec6d798c	dra: revamp event handlers in kube-controller-manager Enabling logging is useful to track what the code is doing. There are some functional changes: - The pod handler checks for existence of claims. This avoids adding pods to the work queue in more cases when nothing needs to be done, at the cost of making the event handlers a bit slower. This will become more important when adding more work to the controller - The handler for deleted ResourceClaim did not check for cache.DeletedFinalStateUnknown.	2023-07-13 21:27:11 +02:00
Kubernetes Prow Robot	4fa97eae1b	Merge pull request #119291 from mimowo/use-jobctx-for-first-pending Pass Job context down to firstPendingIndexes	2023-07-13 09:57:05 -07:00
Michal Wozniak	7e3b53042b	Pass Job context down to firstPendingIndexes	2023-07-13 16:11:06 +02:00
Kubernetes Prow Robot	cd9215915b	Merge pull request #118480 from carlory/gc_metrics podgc metrics should count all pod deletion behaviors	2023-07-13 06:52:05 -07:00
Jordan Liggitt	12a874d227	Preserve resourcequota informers for groups with discovery resolution errors only	2023-07-12 12:29:33 -04:00
Jordan Liggitt	c9a084d59c	Fix duplicate GC event handlers getting added if discovery flutters	2023-07-12 12:29:31 -04:00
Patrick Ohly	98ba89d31d	resourceclaim controller: avoid caching deleted pod unnecessarily We don't need to remember that a pod got deleted when it had no resource claims because the code which checks the cached UIDs only checks for pods which have resource claims.	2023-07-12 16:57:17 +02:00
Patrick Ohly	7d064812bb	kube-controller-manager: finish conversion to contextual logging This removes all exceptions and fixes the remaining unconverted log calls.	2023-07-12 14:57:29 +02:00
Patrick Ohly	1b8ddf6b79	podgc controller: convert to contextual logging	2023-07-12 13:45:10 +02:00
Mengjiao Liu	19869478c1	Migrate /pkg/controller/disruption to structured and contextual logging	2023-07-12 11:30:45 +08:00
Maciej Skrocki	7c873327b6	Convert controller name to reconciler variable.	2023-07-11 18:08:25 +00:00
Maciej Skrocki	29fad383da	move endpointslice reconciler to staging endpointslice repo	2023-07-11 18:08:12 +00:00
Kubernetes Prow Robot	a6890b361d	Merge pull request #119193 from mimowo/sync-job-context Introduce syncJobContext to limit the number of function parameters	2023-07-11 10:33:30 -07:00
Kubernetes Prow Robot	e0dafe57a3	Merge pull request #117351 from pohly/dra-generated-resource-claim-names DRA: generated resource claim names	2023-07-11 10:33:11 -07:00
Patrick Ohly	fec25785ee	dra: store generated ResourceClaims in cache This addresses the following bad sequence of events: - controller creates ResourceClaim - updating pod status fails - pod gets retried before the informer receives the created ResourceClaim - another ResourceClaim gets created Storing the generated ResourceClaim in a MutationCache ensures that the controller knows about it during the retry. A positive side effect is that ResourceClaims now get index by pod owner and thus iterating over existing ones becomes a bit more efficient.	2023-07-11 14:23:49 +02:00
Patrick Ohly	444d23bd2f	dra: generated name for ResourceClaim from template Generating the name avoids all potential name collisions. It's not clear how much of a problem that was because users can avoid them and the deterministic names for generic ephemeral volumes have not led to reports from users. But using generated names is not too hard either. What makes it relatively easy is that the new pod.status.resourceClaimStatus map stores the generated name for kubelet and node authorizer, i.e. the information in the pod is sufficient to determine the name of the ResourceClaim. The resource claim controller becomes a bit more complex and now needs permission to modify the pod status. The new failure scenario of "ResourceClaim created, updating pod status fails" is handled with the help of a new special "resource.kubernetes.io/pod-claim-name" annotation that together with the owner reference identifies exactly for what a ResourceClaim was generated, so updating the pod status can be retried for existing ResourceClaims. The transition from deterministic names is handled with a special case for that recovery code path: a ResourceClaim with no annotation and a name that follows the Kubernetes <= 1.27 naming pattern is assumed to be generated for that pod claim and gets added to the pod status. There's no immediate need for it, but just in case that it may become relevant, the name of the generated ResourceClaim may also be left unset to record that no claim was needed. Components processing such a pod can skip whatever they normally would do for the claim. To ensure that they do and also cover other cases properly ("no known field is set", "must check ownership"), resourceclaim.Name gets extended.	2023-07-11 14:23:48 +02:00
Kubernetes Prow Robot	8f1852bb44	Merge pull request #115295 from Namanl2001/pkg/controller/endpointslice Migrated `pkg/controller/endpointslice` and `pkg/controller/endpointslicemirroring` to contextual logging	2023-07-11 03:19:12 -07:00
Michal Wozniak	bf48165232	Remarks to syncJobCtx	2023-07-11 09:44:08 +02:00
Michal Wozniak	990339d4c3	Introduce syncJobContext to limit the number of function parameters	2023-07-11 09:27:21 +02:00
carlory	f443c458af	move non-graceful node shutdown to GA	2023-07-11 13:51:51 +08:00
Kubernetes Prow Robot	986171d388	Merge pull request #119185 from xing-yang/metrics_attach Add reason to force detach metric	2023-07-10 14:03:18 -07:00
Naman	645cb90732	migrated pkg/controller/endpointslicemirroring to contextual logging Signed-off-by: Naman <namanlakhwani@gmail.com>	2023-07-11 01:43:30 +05:30
Daniel Vega-Myhre	98c6e25c37	update name of pod index label	2023-07-10 20:11:52 +00:00
Naman	09849b09cf	migrated pkg/controller/endpointslice to contextual logging Signed-off-by: Naman <namanlakhwani@gmail.com>	2023-07-11 01:28:22 +05:30
Kubernetes Prow Robot	10a12165de	Merge pull request #116755 from my-git9/feat/endpoint/logging Migrated `pkg/controller/endpoint` to contextual logging	2023-07-10 05:37:05 -07:00
Kubernetes Prow Robot	64939b66c6	Merge pull request #119146 from xuexu6666/xuexu6666/ControllerUtilUseCmpDiff Use cmp diff in controller_util_test.go	2023-07-10 02:41:18 -07:00
xing-yang	cca6601106	Add reason to force detach metric	2023-07-10 06:30:05 +00:00
Aldo Culquicondor	f7a1fb76f4	Only declare job as finished after removing all finalizers Change-Id: Id4b01b0e6fabe24134e57e687356e0fc613cead4	2023-07-07 14:08:19 -04:00
xuexu6666	d7708e79d3	Use cmp diff	2023-07-06 23:01:06 -05:00
Heba Elayoty	2fe38f93e5	feat: Append job creation timestamp to cronjob annotations (#118137 ) * Append job name to job annotations Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * Update annotation description, remove timezone, and fix time Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * Remove unused ctx Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * code review comments Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * code review comments Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> * Add timezone back Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com> --------- Signed-off-by: Heba Elayoty <hebaelayoty@gmail.com>	2023-07-06 14:39:04 -07:00
Daniel Vega-Myhre	3a02ecb341	check test case param instead of feature flag in unit test code	2023-07-06 17:30:40 +00:00
Kubernetes Prow Robot	6f9d1d38d8	Merge pull request #118817 from pohly/dra-delete-claims DRA: improve handling of completed pods	2023-07-06 10:15:15 -07:00
Kubernetes Prow Robot	7e5506de8d	Merge pull request #119111 from kannon92/remove-equal-ready-job remove equalReady and replace with k8 util function	2023-07-06 09:13:16 -07:00
Ziqi Zhao	dfc1838379	Migrated pkg/controller/volume\|util\|replicaset\|nodeipam to contextual logging Signed-off-by: Ziqi Zhao <zhaoziqi9146@gmail.com>	2023-07-06 07:39:52 +08:00
xin.li	6c0387d004	Migrated pkg/controller/endpoint to contextual logging Signed-off-by: xin.li <xin.li@daocloud.io>	2023-07-06 07:36:51 +08:00
xin.li	3cf2822bc5	Migrated pkg/controller/garbagecollector to contextual logging Signed-off-by: xin.li <xin.li@daocloud.io>	2023-07-06 07:36:51 +08:00
Kubernetes Prow Robot	916c3466b9	Merge pull request #118940 from soltysh/drop_missedschedules Hide numberOfMissedSchedules as an algorithm internal number	2023-07-05 16:27:02 -07:00
kannon92	921b7e6e8f	remove equalReady and replace with k8 util function	2023-07-05 20:11:48 +00:00
Daniel Vega-Myhre	a647f9febb	default enabled pod index for test cases, add test case disabling it	2023-07-05 18:47:45 +00:00
carlory	322da7c1aa	fix gc metrics	2023-07-06 02:06:03 +08:00
Kubernetes Prow Robot	91698fe900	Merge pull request #114061 from Octopusjust/k8s-pr15 testutil: use contextual logging	2023-07-05 08:38:57 -07:00
Maciej Szulik	1240a29af9	Hide numberOfMissedSchedules as an algorithm internal number	2023-07-05 16:58:28 +02:00
Patrick Ohly	a514f40131	dra resourceclaim controller: delete generated claims when pod is done When a pod is done, but not getting removed yet for while, then a claim that got generated for that pod can be deleted already. This then also triggers deallocation.	2023-07-05 16:10:20 +02:00
Patrick Ohly	e8a0c42212	dra resourceclaim controller: remove reservation for completed pods When a pod is known to never run (again), the reservation for it also can be removed. This is relevant in particular for the job controller.	2023-07-05 16:10:20 +02:00
Patrick Ohly	7f5a02fc7e	dra resourceclaim controller: enhance logging Adding logging to event handlers makes it more obvious why (or why not) claims and pods need to be processed.	2023-07-05 16:10:20 +02:00
Patrick Ohly	d1ba893ad8	dra resourceclaim controller: refactor isPodDone This covers pods that get deleted before running and will be used more than once soon.	2023-07-05 16:09:41 +02:00
Kubernetes Prow Robot	229dd79efd	Merge pull request #117865 from aleksandra-malinowska/parallel-sts-3 Parallel StatefulSet pod create & delete	2023-07-03 10:16:51 -07:00
Kubernetes Prow Robot	0a82bdbfdb	Merge pull request #118173 from huiwq1990/feat-autoscale-variable hpa: cleanup `currentReplicas` code	2023-07-02 23:00:50 -07:00
Kubernetes Prow Robot	ec87834bae	Merge pull request #118936 from pohly/dra-deallocate-when-unused DRA: for delayed allocation, deallocate when no longer used	2023-07-01 12:56:48 -07:00
Kubernetes Prow Robot	52b1247b28	Merge pull request #118232 from luckymrwang/style style: correct the sentence	2023-06-30 01:51:59 -07:00
Kubernetes Prow Robot	9af93df9b0	Merge pull request #117845 from ctripcloud/fix-hpa-plain-calc fix HPA plain metric calculate	2023-06-30 01:51:47 -07:00
Kubernetes Prow Robot	68b9ccc511	Merge pull request #117554 from yanggangtony/clean-endpoint-controller clean endpoint controller typo logs	2023-06-29 16:23:44 -07:00
Daniel Vega-Myhre	e0af0a5a45	add test case param for feature flag	2023-06-29 21:51:15 +00:00
Patrick Ohly	1b47e6433b	dra delayed allocation: deallocate when a pod is done This releases the underlying resource sooner and ensures that another consumer can get scheduled without being influenced by a decision that was made for the previous consumer. An alternative would have been to have the apiserver trigger the deallocation whenever it sees the `status.reservedFor` getting reduced to zero. But that then also triggers deallocation when kube-scheduler removes the last reservation after a failed scheduling cycle. In that case we want to keep the claim allocated and let the kube-scheduler decide on a case-by-case basis which claim should get deallocated.	2023-06-29 09:47:30 +02:00
Aleksandra Malinowska	d616cf72a3	Add unit tests for parallel StatefulSet create & delete	2023-06-28 16:55:38 +02:00
Kubernetes Prow Robot	74bd77a9df	Merge pull request #118917 from kmala/daemonsetfix increase the log level for the GetTargetNodeName error message	2023-06-28 00:08:32 -07:00
Kubernetes Prow Robot	960830bc66	Merge pull request #118102 from RomanBednar/retro-sc-assignment-ga graduate RetroactiveDefaultStorageClass feature to GA in 1.28	2023-06-27 20:46:32 -07:00
yanggang	860aab842d	fix a reference to the wrong variable name Signed-off-by: yanggang <gang.yang@daocloud.io>	2023-06-28 06:12:44 +08:00
Daniel Vega-Myhre	a9afaa1eee	add feature gate	2023-06-27 18:07:17 +00:00
Aldo Culquicondor	a4519665fe	Skip terminal Pods with a deletion timestamp from the Daemonset sync (#118716 ) * Skip terminal Pods with a deletion timestamp from the Daemonset sync Change-Id: I64a347a87c02ee2bd48be10e6fff380c8c81f742 * Review comments and fix integration test Change-Id: I3eb5ec62bce8b4b150726a1e9b2b517c4e993713 * Include deleted terminal pods in history Change-Id: I8b921157e6be1c809dd59f8035ec259ea4d96301	2023-06-27 08:56:33 -07:00
Maciej Szulik	af1c9e49c4	Update schedule logic to properly calculate missed schedules Before this change we've assumed a constant time between schedule runs, which is not true for cases like "30 6-16/4 * * 1-5". The fix is to calculate the potential next run using the fixed schedule as the baseline, and then go back one schedule back and allow the cron library to calculate the correct time. This approach saves us from iterating multiple times between last schedule time and now, if the cronjob for any reason wasn't running for significant amount of time.	2023-06-27 11:29:30 +02:00
Keerthan Reddy Mala	0033f65808	increase the log level for the GetTargetNodeName error message	2023-06-26 17:31:50 -07:00
Daniel Vega-Myhre	2176053415	add completion index as pod label	2023-06-26 19:53:14 +00:00
Kubernetes Prow Robot	a48bcc06fe	Merge pull request #117064 from RomanBednar/multiple-defaults storage class assignment should not ignore errors	2023-06-23 03:09:23 -07:00
Kubernetes Prow Robot	9325a57125	Merge pull request #118765 from mimowo/job-controller-unit-tests Do not set jm.syncJobBatchPeriod=0 if not needed	2023-06-22 10:25:41 -07:00
Michal Wozniak	8ed23558b4	Do not set jm.syncJobBatchPeriod=0 if not needed	2023-06-22 11:10:53 +02:00
Kubernetes Prow Robot	28296ba59e	Merge pull request #113994 from mengjiao-liu/contextual-logging-controller-certificates certificate controller: use contextual logging	2023-06-21 09:03:42 -07:00
Roman Bednar	1bd3f072fa	stop ignoring storage class selection errors The GetDefaultClass() was fixed in scope of this issue: https://github.com/kubernetes/kubernetes/issues/110514 Before this change assignDefaultStorageClass() was ignoring errors from this function since it could mean there are multiple defaults - assign could safely continue and do nothing. This is no longer true because we always choose one from multiple defaults - any errors returned from GetDefaultClass() are real errors and should not be ignored.	2023-06-21 13:09:05 +02:00
Aleksandra Malinowska	63e5b6bdb2	Parallel StatefulSet pod create & delete	2023-06-20 12:15:21 +02:00
Aleksandra Malinowska	2fb472c83c	Refactor StatefulSet controller update logic	2023-06-20 12:14:25 +02:00
Michal Wozniak	784a309b91	Do not error in Job controller sync when there are pod failures	2023-06-20 11:31:24 +02:00
wangxiang	12b370cbbb	bugfix(namespace-controller): Correctly cascade ctx when making API calls	2023-06-20 15:34:23 +08:00
Aldo Culquicondor	c9b01dd977	Fix race in logging expectations Change-Id: I27da7cae741935da6f0815639a54bfd597a2a6c6	2023-06-19 13:26:39 -04:00
ialidzhikov	958c8fb695	Make use of `k8s.io/utils/pointer.Duration` Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>	2023-06-18 21:46:26 +03:00
Michal Wozniak	74c5ff97f1	Lower the constants for the rate limiter in Job controller	2023-06-16 17:00:04 +02:00
Michal Wozniak	c51a422d78	Cleanup job controller handling of backoff	2023-06-16 14:53:27 +02:00
aimuz	396c8a6783	test: TestPodDeletionEvent Signed-off-by: aimuz <mr.imuz@gmail.com>	2023-06-16 10:31:33 +08:00
aimuz	975b2c6611	scheduler: Fix field apiVersion is missing from events reported from taint manager Signed-off-by: aimuz <mr.imuz@gmail.com>	2023-06-16 09:51:25 +08:00
Kubernetes Prow Robot	58d7a794d2	Merge pull request #113504 from pacoxu/taint-unit-test AddOrUpdateTaintOnNode: if node does not exists, return an error	2023-06-15 17:48:19 -07:00
Kubernetes Prow Robot	47e79b8156	Merge pull request #116910 from fatsheep9146/job-controller-contextual-logging Migrated pkg/controller/job to contextual logging	2023-06-14 08:00:18 -07:00
Ziqi Zhao	7bc449d7e0	add contextual logging to job-controller Signed-off-by: Ziqi Zhao <zhaoziqi9146@gmail.com>	2023-06-14 13:40:02 +08:00
carlory	5e048041e4	remove helper function for unused storage feature in pkg/proxy/util	2023-06-13 09:22:59 +08:00
Kubernetes Prow Robot	8a0b1bbf24	Merge pull request #118095 from carlory/patch-002 remove unused pv informer from expand_controller	2023-06-12 11:43:58 -07:00
xin.li	325205efb7	Migrated pkg/controller/endpoint to contextual logging Signed-off-by: xin.li <xin.li@daocloud.io>	2023-06-11 20:12:07 +08:00
Kubernetes Prow Robot	a5332a839d	Merge pull request #118470 from mimowo/job-controller-fix-delay Ensure Job sync invocations are batched by 1s periods	2023-06-07 12:02:13 -07:00
Michal Wozniak	2f6b1d3c0f	Ensure Job sync invocations are batched by 1s periods	2023-06-07 17:32:46 +02:00
Kubernetes Prow Robot	3fbf67a403	Merge pull request #116732 from czybjtu/cleanup_map_to_sets Cleanup: use Set instead of map in endpointSlice utils	2023-06-07 07:38:13 -07:00
Roman Bednar	ac15d69757	remove RetroactiveDefaultStorageClass feature gate checks	2023-06-07 14:31:16 +02:00
Roman Bednar	6afb363ca1	test: remove RetroactiveDefaultStorageClass feature gate Since the feature is GA and locked to true, tests can no longer set it to false. Cleaning up by removing all references to this feature gate from tests. Feature gate will be removed in v1.29.	2023-06-07 14:31:16 +02:00
Michal Wozniak	71ab7dc791	Remarks	2023-06-05 10:48:32 +02:00
Michal Wozniak	70d3bb43e5	Adjust the algorithm for computing the pod finish time Change-Id: Ic282a57169cab8dc498574f08b081914218a1039	2023-06-05 10:06:56 +02:00
SataQiu	01861282b1	controller: fix the help information format of sorting_deletion_age_ratio metric	2023-06-01 23:28:42 +08:00
ZhangYu	f9ae3e774d	testutil: use contextual logging	2023-05-30 10:07:04 +08:00
Kubernetes Prow Robot	c35a2775b2	Merge pull request #115554 from yt2985/cleanSA LegacyServiceAccountTokenCleanUp alpha	2023-05-26 08:54:53 -07:00
Kubernetes Prow Robot	1c9f08a1c5	Merge pull request #118262 from torredil/pv-controller Improve Clarity of Provisioner Waiting Message	2023-05-26 07:57:04 -07:00
Kubernetes Prow Robot	a874f587b4	Merge pull request #118071 from kerthcet/cleanup/use-contextual-logging-in-statefulset Chore: Apply to use contextual logging for all loggers in statefulSet	2023-05-26 07:56:53 -07:00
torredil	5d18f47149	Improve clarity and guidance in provisioner waiting message Signed-off-by: torredil <torredil@amazon.com>	2023-05-25 17:26:06 +00:00
tinatingyu	133eff3df4	implement LegacyServiceAccountTokenCleanUp alpha	2023-05-24 23:20:17 +00:00
Cookie Wang	e2ed8f8c3c	style: correct the sentence	2023-05-24 20:24:36 +08:00
guangli.bao	931cc96b8d	remote windows condition when #116693 is merged Signed-off-by: guangli.bao <guangli.bao@daocloud.io>	2023-05-24 14:54:22 +08:00
Kubernetes Prow Robot	484645e817	Merge pull request #116659 from claudiubelu/skip-flaky-tests-2 unit tests: Skip flaky tests on Windows (part 2)	2023-05-23 20:04:48 -07:00
huiwq1990	048938f1d8	hpa: cleanup `currentReplicas` code Signed-off-by: huiwq1990 <huiwq1990@163.com>	2023-05-22 20:37:49 +08:00
Michal Wozniak	0fe27a06f9	Cleanup the Job controller handling of terminating pods	2023-05-19 09:52:08 +02:00
kerthcet	3f1d8e7aeb	Apply to use contextual logging for all loggers in statefulSet Signed-off-by: kerthcet <kerthcet@gmail.com>	2023-05-18 16:55:19 +08:00
carlory	bea71a3e4e	remove unused pv informer from expand_controller	2023-05-18 15:58:38 +08:00
Joe Betz	f0f92853ad	Add api-machinery TL owners permissions for jpbetz	2023-05-15 11:09:54 -04:00
Kubernetes Prow Robot	8479db5876	Merge pull request #117946 from lavalamp/lavalamp-taking-a-break lavalamp is taking a long break	2023-05-12 14:34:47 -07:00
Kubernetes Prow Robot	02659772cb	Merge pull request #117931 from tklauser/cidrset-rm-getindexforcidr cidrset, multicidrset: directly use getIndexForIP method	2023-05-11 10:59:36 -07:00
Daniel Smith	1ffe3f467e	lavalamp is taking a long break	2023-05-11 16:43:38 +00:00
SataQiu	3fa55d469c	fix a bug where the AttachedVolume is not printed correctly in the log	2023-05-11 22:04:30 +08:00
Tobias Klauser	e7c679c362	cidrset, multicidrset: directly use getIndexForIP method Instead of constructing a net.IPnet whose .Mask field is never used, call the getIndexForIP method of CidrSet and *MultiCIDRSet directly.	2023-05-11 10:53:03 +02:00
zach593	e3d5feb76c	fix HPA plain metric calculate fix inconsistency in plain metric replica calculator when usageRatio == 1.0, keep same with resource replicas	2023-05-10 11:14:29 +08:00
Yuki Iwai	e4340f0d9b	Job: Use generic Set in controller Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>	2023-05-08 15:02:23 +09:00
Kante Yin	a7035f5459	Pass Context to StartTestServer Signed-off-by: Kante Yin <kerthcet@gmail.com>	2023-05-04 10:25:09 +08:00
Akhil Velagapudi	69eccd41b8	endpointslice: reduce visibility of vars/funcs Reduce the visibility of various public identifiers that are only used within the scope of a package. This was originally motivated by KEP-3685 in order to reduce the public API surface and improve supportability.	2023-05-02 15:58:07 -07:00
Kubernetes Prow Robot	42799903af	Merge pull request #117626 from skitt/intstr-fromint32-cloud-network Network: use new intstr functions	2023-05-02 08:18:59 -07:00
Stephen Kitt	1c4b97ea27	network: replace intstr.FromInt with intstr.FromInt32 This touches cases where FromInt() is used on numeric constants, or values which are already int32s, or int variables which are defined close by and can be changed to int32s with little impact. Signed-off-by: Stephen Kitt <skitt@redhat.com>	2023-05-01 09:17:30 +02:00
Stephen Kitt	5bfe738af2	apps: replace intstr.FromInt with intstr.FromInt32 This touches cases where FromInt() is used on numeric constants, or values which are already int32s, or int variables which are defined close by and can be changed to int32s with little impact. Signed-off-by: Stephen Kitt <skitt@redhat.com>	2023-05-01 09:17:05 +02:00
Sheng Zhan	892ebf2d25	Ensure version "" is passed instead of "" for all authz checks (#116937 ) ensure version * is passed instead of for all authz checks * unexport match function * remove allversion constant	2023-04-25 09:06:18 -07:00
Kubernetes Prow Robot	29fe2c70b1	Merge pull request #117252 from alculquicondor/node-lifecycle-owner Add SIG ownership to controller/nodelifecycle	2023-04-17 14:10:57 -07:00
Hao Ruan	c4e1b01416	replace spew methods with dump methods	2023-04-14 08:05:53 +08:00
Claudiu Belu	0979d55443	unit tests: Skip flaky tests on Windows (part 2) Some of the unit tests are currently flaky on Windows. This commit skips them until they are resolved.	2023-04-13 12:07:18 +00:00
Kubernetes Prow Robot	c3e3ff989a	Merge pull request #116896 from thockin/apimachinery_util_diff_cleanup apimachinery util/diff cleanups	2023-04-13 02:26:37 -07:00
Kubernetes Prow Robot	5550bd5eac	Merge pull request #117249 from tnqn/fix-data-race Fix a data race in TopologyCache	2023-04-12 16:28:30 -07:00
Kubernetes Prow Robot	9082903f94	Merge pull request #117245 from tnqn/fix-topology-hint Fix TopologyAwareHint not working when zone label is added after Node creation	2023-04-12 13:24:44 -07:00
Aldo Culquicondor	b23ab389b4	Add SIG ownership to controller/nodelifecycle Change-Id: I31a329d9ca08bdf12a428cae44a5f061afa01e73	2023-04-12 15:42:06 -04:00
Quan Tian	d567d61aa5	Fix a data race in TopologyCache The member variable `cpuRatiosByZone` should be accessed with the lock acquired as it could be be updated by `SetNodes` concurrently. Signed-off-by: Quan Tian <qtian@vmware.com> Co-authored-by: Antonio Ojea <aojea@google.com>	2023-04-13 02:03:16 +08:00
Quan Tian	a62563f130	Fix TopologyAwareHint not working when zone label is added after Node creation The topology.kubernetes.io/zone label may be added by could provider asynchronously after the Node is created. The previous code didn't update the topology cache after receiving the Node update event, causing TopologyAwareHint to not work until kube-controller-manager restarts or other Node events trigger the update. Signed-off-by: Quan Tian <qtian@vmware.com>	2023-04-13 01:47:55 +08:00
Tim Hockin	bc302fa414	Replace uses of ObjectReflectDiff with cmp.Diff ObjectReflectDiff is already a shim over cmp.Diff, so no actual output or behavior changes	2023-04-12 08:48:03 -07:00
Tim Hockin	9627c50ef3	Replace uses of ObjectGoPrintDiff with cmp.Diff ObjectGoPrintDiff is already a shim over cmp.Diff, so no actual output or behavior changes	2023-04-12 08:46:16 -07:00
Tim Hockin	29c0b73d64	Replace uses of diff.ObjectDiff with cmp.Diff ObjectDiff is already a shim over cmp.Diff, so no actual output or behavior changes	2023-04-12 08:46:12 -07:00
Andrea Tosatto	d09842e0ad	node-lifecycle-controller: improve monitorNodeHealth test-coverage (#116687 ) * node-lifecycle-controller: refactor monitorNodeHealth tests to improve test-coverage * address PR review comments * dedupe test logic	2023-04-12 07:02:33 -07:00
Kubernetes Prow Robot	4e2977d462	Merge pull request #117178 from gjtempleton/SIG-Autoscaling-Cleanup-HPA-OWNERS HPA Controller - Update Owners	2023-04-11 21:17:42 -07:00
Kubernetes Prow Robot	8cdc7fa542	Merge pull request #116675 from pacoxu/volume-flake deflake: Add retry with timeout to wait for final conditions	2023-04-11 18:19:09 -07:00
Kubernetes Prow Robot	32cb4a6dc5	Merge pull request #115100 from HirazawaUi/delte-pkg-controller-unused-functions delete unused functions in pkg/controller directory	2023-04-11 15:35:12 -07:00
Kubernetes Prow Robot	4e6da50c56	Merge pull request #113800 from pacoxu/patch-4 add sig/storage label for pkg/controller/volume package PR	2023-04-11 15:34:31 -07:00
Guy Templeton	d102a6fea5	HPA Controller - Update Owners Move josephburnett to emeritus status	2023-04-09 22:44:20 +01:00
Harshal Patil	1972dd1005	Do not log entire pod struct while attaching the volume Signed-off-by: Harshal Patil <harpatil@redhat.com>	2023-04-05 20:24:12 -04:00
Michal Wozniak	b5dd5f1f3a	Investigate and fix the handling of Succeeded pods in DaemonSet	2023-04-04 19:21:15 +02:00

... 2 3 4 5 6 ...

6449 Commits