kubernetes

Author	SHA1	Message	Date
Maciej Kwiek	0bec588202	PetSet returns valid replica count in status If the first pod is not healthy and next pods are not yet created, do not provide the status with incorrect replica count	2016-09-27 10:58:26 +02:00
Kubernetes Submit Queue	7309d34873	Merge pull request #33492 from kargakis/stop-retrying-selector-overlaps Automatic merge from submit-queue controller: don't retry deployments with overlapping selectors Returning an error will cause the deployment to be requeued. We should just emit an event for deployments with overlapping selectors and silently drop then out of the queue. This should be transitioned to a Condition once we have them. @kubernetes/deployment ptal	2016-09-26 23:50:40 -07:00
Kubernetes Submit Queue	6c1c0b9842	Merge pull request #32027 from m1093782566/m109-petset-fix-test-err Automatic merge from submit-queue [BUG FIX] Fix bug of UT in Pet Set <!-- Thanks for sending a pull request! Here are some tips for you: 1. If this is your first time, read our contributor guidelines https://github.com/kubernetes/kubernetes/blob/master/CONTRIBUTING.md and developer guide https://github.com/kubernetes/kubernetes/blob/master/docs/devel/development.md 2. If you want faster PR reviews, read how: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/faster_reviews.md 3. Follow the instructions for writing a release note: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/pull-requests.md#release-notes --> What this PR does / why we need it: Fix bug of UT in Pet Set. [1] https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/petset/pet_set_test.go#L74-L75, I think` len(pl)` is not equal to `len(fc.pets)`, see [here](https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/petset/fakes.go#L229-L233) [2] https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/petset/fakes.go#L249 I think should change to ``` if len(f.pets) <= index { ``` because when `len(f.pets)==index`, then [here](https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/petset/fakes.go#L252-L254) will cause `index out of range` panic! [3] https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/petset/fakes.go#L271 same reason with [2] [4] https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/petset/pet_set_test.go#L79 which doesn't make use of the error returned by [setHealthy](https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/petset/fakes.go#L248) and has a risk of letting the error out. Should we catch the error and use `t.Errorf()` to stop the test?	2016-09-26 21:13:14 -07:00
Chao Xu	7249c9bd8a	fix TestCreateWithNonExistentOwner remove the use of gc.QueuesDrained	2016-09-26 16:51:56 -07:00
Ivan Shvedunov	5651f822fd	Fix DaemonSet namespace handling for predicates In order to determine whether a node should run its daemon pod, DaemonController creates a dummy pod based on DaemonSet's template and then uses scheduler predicates (currently GeneralPredicates) to test whether such pod can be run by the node. The problem was that DaemonController was not setting Namespace for the dummy pod. This was not affecting currently used GeneralPredicates but this problem could bite later when some namespace-dependent predicates are added to GeneralPredicates or directly to DaemonController's node checks (e.g. pod affinity). Stumbled upon it while working on e2e test for #31136	2016-09-26 22:14:28 +03:00
Michail Kargakis	0a843a50ba	controller: don't retry deployments with overlapping selectors Returning an error will cause the deployment to be requeued. We should just emit an event for deployments with overlapping selectors and silently drop then out of the queue. This should be transitioned to a Condition once we have them.	2016-09-26 17:59:51 +02:00
Kubernetes Submit Queue	eed1e02346	Merge pull request #33012 from wojtek-t/informer_in_route_controller Automatic merge from submit-queue Use Informer framework in route controller	2016-09-26 06:56:06 -07:00
Jan Safranek	a54c9e2887	Refactor volume controller parameters into a structure persistentvolumecontroller.NewPersistentVolumeController has 11 arguments now, put them into a structure. Also, rename NewPersistentVolumeController to NewController, persistentvolume is already name of the package. Fixes #30219	2016-09-26 14:15:25 +02:00
Jan Safranek	5ff1597cf9	Rename controller.go to pv_controller.go To make log filtering easier. controller.go is used by several controllers and matching logs for "pv_controller.*" is much better.	2016-09-26 12:26:58 +02:00
Kubernetes Submit Queue	4785f6f517	Merge pull request #31978 from jsafrane/detach-before-delete Automatic merge from submit-queue Do not report error when deleting an attached volume Persistent volume controller should not send warning events to a PV and mark the PV as failed when the volume is still attached. This happens when a user quickly deletes a pod and associated PVC - PV is slowly detaching, while the PVC is already deleted and the PV enters Failed phase. `Deleter.Deleter` can now return `tryAgainError`, which is sent as INFO to the PV to let the user know we did not forget to delete the PV, however the PV stays in Released state. The controller tries again in the next sync (15 seconds by default). Fixes #31511	2016-09-25 18:55:32 -07:00
Kubernetes Submit Queue	64777d37b6	Merge pull request #33268 from deads2k/client-14-rc-svc-lister Automatic merge from submit-queue simplify RC listers Make the RC and SVC listers use the common list functions that more closely match client APIs, are consistent with other listers, and avoid unnecessary copies.	2016-09-23 23:37:15 -07:00
Kubernetes Submit Queue	071927a59d	Merge pull request #32549 from smarterclayton/gc_non_kube_legacy Automatic merge from submit-queue Allow garbage collection to work against different API prefixes The GC needs to build clients based only on Resource or Kind. Hoist the restmapper out of the controller and the clientpool, support a new ClientForGroupVersionKind and ClientForGroupVersionResource, and use the appropriate one in both places. Allows OpenShift to use the GC	2016-09-23 14:06:35 -07:00
Kubernetes Submit Queue	8152cfb9c3	Merge pull request #32670 from soltysh/cron_update Automatic merge from submit-queue Remove hacks from ScheduledJobs cron spec parsing Previusly `github.com/robfig/cron` library did not allow passing cron spec without seconds. First commit updates the library, which has additional method ParseStandard which follows the standard cron spec, iow. minute, hour, day of month, month, day of week. @janetkuo @erictune as promised in #30227 I've updated the library and now I'm updating it in k8s	2016-09-23 13:27:16 -07:00
Kubernetes Submit Queue	331eb83585	Merge pull request #33376 from luxas/fix_arm_atomics_2 Automatic merge from submit-queue Move HighWaterMark to the top of the struct in order to fix arm, second time ref: #33117 Sorry for not fixing everyone at once, but I seriously wasn't prepared for that quick LGTM 😄, so here's the other half. @lavalamp > lgtm, but seriously, this is terrible, we probably have this bug all over. And what if someone embeds the etcdWatcher struct in something else not at the top? We need the compiler to enforce things like this, it just can't be done manually. Can you file or link a golang issue for this? I totally agree! There isn't currently a way of programmatically detecting this unfortunately. I guess @davecheney or @minux can explain better to you why it's so hard. This is noted in https://github.com/kubernetes/kubernetes/blob/master/docs/proposals/multi-platform.md as a corner case indeed. @pwittrock This should be cherrypicked toghether with #33117	2016-09-23 12:05:09 -07:00
Kubernetes Submit Queue	0a4316f11e	Merge pull request #32807 from jingxu97/stateupdateNeeded-9-15 Automatic merge from submit-queue Fix race condition in setting node statusUpdateNeeded flag This PR fixes the race condition in setting node statusUpdateNeeded flag in master's attachdetach controller. This flag is used to indicate whether a node status has been updated by the node_status_updater or not. When updater finishes update a node status, it is set to false. When the node status is changed such as volume is detached or new volume is attached to the node, the flag is set to true so that updater can update the status again. The previous workflow has a race condition as follows 1. updater gets the currently attached volume list from the node which needs to be updated. 2. A new volume A is attached to the same node right after 1 and set the flag to TRUE 3. updater updates the node attached volume list (which does not include volume A) and then set the flag to FALSE. The result is that volume A will be never added to the attached volume list so at node side, this volume is never attached. So in this PR, the flag is set to FALSE when updater tries to get the attached volume list (as in an atomic operation). So in the above example, after step 2, the flag will be TRUE again, in step 3, updater does not set the flag if updates is sucessful. So after that, flag is still TRUE and in next round of update, the node status will be updated.	2016-09-23 11:25:16 -07:00
Lucas Käldström	06917531b3	Move HighWaterMark to the top of the struct in order to fix arm, second time	2016-09-23 20:58:28 +03:00
deads2k	500959b70c	fix RC lister	2016-09-23 08:12:03 -04:00
Kubernetes Submit Queue	b2aed32578	Merge pull request #33269 from deads2k/client-15-svc-lister Automatic merge from submit-queue simplify svc lister trying to track down what killed the e2e tests.	2016-09-23 03:10:57 -07:00
Jing Xu	14cad206f5	Fix race conditino in setting node statusUpdateNeeded flag This PR fixes the race condition in setting node statusUpdateNeeded flag in master's attachdetach controller. This flag is used to indicate whether a node status has been updated by the node_status_updater or not. When updater finishes update a node status, it is set to false. When the node status is changed such as volume is detached or new volume is attached to the node, the flag is set to true so that updater can update the status again. The previous workflow has a race condition as follows 1. updater gets the currently attached volume list from the node which needs to be updated. 2. A new volume A is attached to the same node right after 1 and set the flag to TRUE 3. updater updates the node attached volume list (which does not include volume A) and then set the flag to FALSE. The result is that volume A will be never added to the attached volume list so at node side, this volume is never attached. So in this PR, the flag is set to FALSE when updater tries to get the attached volume list (as in an atomic operation). So in the above example, after step 2, the flag will be TRUE again, in step 3, updater does not set the flag if updates is sucessful. So after that, flag is still TRUE and in next round of update, the node status will be updated. This PR also changes a unit test due to the workflow changes	2016-09-22 14:02:30 -07:00
Kubernetes Submit Queue	e9f4db2748	Merge pull request #27714 from jsafrane/event-recycle Automatic merge from submit-queue Send recycle events from pod to pv. This allows users to diagnose what's wrong with recycler. Recycler pods are started automatically with a cryptic name and they are deleted immediately when they finish. e.g, `kubectl describe pv` could show that NFS cannot be mounted (and how many pods have tried it): ``` FirstSeen LastSeen Count From SubobjectPath Type Reason Message --------- -------- ----- ---- ------------- -------- ------ ------- 59m 59m 1 {persistentvolume-controller } Warning RecyclerPod Recycler pod: Unable to mount volumes for pod "recycler-for-nfs_default(5421800e-347b-11e6-a79b-3c970e965218)": timeout expired waiting for volumes to attach/mount for pod "recycler-for-nfs"/"default". list of unattached/unmounted volumes=[vol] 53m 53m 1 {persistentvolume-controller } Warning RecyclerPod Recycler pod: Unable to mount volumes for pod "recycler-for-nfs_default(3c9809e5-347c-11e6-a79b-3c970e965218)": timeout expired waiting for volumes to attach/mount for pod "recycler-for-nfs"/"default". list of unattached/unmounted volumes=[vol] 46m 46m 1 {persistentvolume-controller } Warning RecyclerPod Recycler pod: Unable to mount volumes for pod "recycler-for-nfs_default(250dd2a2-347d-11e6-a79b-3c970e965218)": timeout expired waiting for volumes to attach/mount for pod "recycler-for-nfs"/"default". list of unattached/unmounted volumes=[vol] 40m 40m 1 {persistentvolume-controller } Warning RecyclerPod Recycler pod: Unable to mount volumes for pod "recycler-for-nfs_default(0d84ea33-347e-11e6-a79b-3c970e965218)": timeout expired waiting for volumes to attach/mount for pod "recycler-for-nfs"/"default". list of unattached/unmounted volumes=[vol] 33m 33m 1 {persistentvolume-controller } Warning RecyclerPod Recycler pod: Unable to mount volumes for pod "recycler-for-nfs_default(f5fb63bf-347e-11e6-a79b-3c970e965218)": timeout expired waiting for volumes to attach/mount for pod "recycler-for-nfs"/"default". list of unattached/unmounted volumes=[vol] 27m 27m 1 {persistentvolume-controller } Warning RecyclerPod Recycler pod: Unable to mount volumes for pod "recycler-for-nfs_default(de7128fd-347f-11e6-a79b-3c970e965218)": timeout expired waiting for volumes to attach/mount for pod "recycler-for-nfs"/"default". list of unattached/unmounted volumes=[vol] 1h 3m 75 {persistentvolume-controller } Normal RecyclerPod Recycler pod: Successfully assigned recycler-for-nfs to 127.0.0.1 1h 3m 76 {persistentvolume-controller } Normal RecyclerPod Recycler pod: Pod was active on the node longer than specified deadline 1h 1m 12 {persistentvolume-controller } Warning RecyclerPod Recycler pod: Error syncing pod, skipping: timeout expired waiting for volumes to attach/mount for pod "recycler-for-nfs"/"default". list of unattached/unmounted volumes=[vol] 20m 1m 4 {persistentvolume-controller } Warning RecyclerPod (events with common reason combined) ``` These steps were necessary: - added event watcher to volume.RecycleVolumeByWatchingPodUntilCompletion - pass all these events through volume plugins to volume controller - rework volume.RecycleVolumeByWatchingPodUntilCompletion unit tests to a table (too much copy-paste) - fix all unit tests along the way	2016-09-22 12:18:53 -07:00
Clayton Coleman	97c35fcc67	Allow garbage collection to work against different API prefixes The GC needs to build clients based only on Resource or Kind. Hoist the restmapper out of the controller and the clientpool, support a new ClientForGroupVersionKind and ClientForGroupVersionResource, and use the appropriate one in both places.	2016-09-22 15:00:58 -04:00
Kubernetes Submit Queue	4ab5a76338	Merge pull request #33103 from deads2k/controller-03-kill-non-generatedclient Automatic merge from submit-queue switch controller manager to generated clients Switches the controller manager to generated clients. @ncdc ptal	2016-09-22 11:37:01 -07:00
Kubernetes Submit Queue	6e25117891	Merge pull request #32655 from dshulyak/fix_node_fake_update Automatic merge from submit-queue Fix FakeNodeHandler Update behaviour Two problems: 1. Get is always using Existing nodes slice, and you will for sure miss any updated data 2. Each Update adds a duplicate node entry to UpdatedNodes slice For the 1st, we will try to find a node in UpdatedNodes slice (same as for the List). 2nd - append only if there is no node with same name as updated, if there is we will replace object in UpdatedNodes slice.	2016-09-22 07:43:18 -07:00
deads2k	7ee5b26ad1	incorrect key determination	2016-09-22 09:55:24 -04:00
deads2k	483af28944	fix up service lister	2016-09-22 09:12:37 -04:00
Kubernetes Submit Queue	5af04d1dd1	Merge pull request #32876 from errordeveloper/more-cert-utils Automatic merge from submit-queue Refactor cert utils into one pkg, add funcs from bootkube for kubeadm to use What this PR does / why we need it: We have ended-up with rather incomplete and fragmented collection of utils for handling certificates. It may be worse to consider using `cfssl` for doing all of these things, but for now there is some functionality that we need in `kubeadm` that we can borrow from bootkube. It makes sense to move the utils from bookube into core, as discussed in #31221. Special notes for your reviewer: I've taken the opportunity to review names of existing funcs and tried to make some improvements in that area (with help from @peterbourgon). Release note: ```release-note NONE ```	2016-09-22 01:29:46 -07:00
Kubernetes Submit Queue	e115a4282d	Merge pull request #33169 from deads2k/api-12-move-groups Automatic merge from submit-queue move registry packages for all API groups This continues the pattern of `registry/<group>/resource` for our backing storage. This entire pull is nothing but moves. I'll reswizzle the actual storage next, but these are cargo-culted everywhere, so I want to lay this down early. @sttts @ncdc	2016-09-22 00:51:59 -07:00
Antoine Pelisse	938872582e	Revert "simplify RC and SVC listers"	2016-09-21 15:49:38 -07:00
deads2k	561f8d75a5	move core resource registry packages	2016-09-21 10:11:50 -04:00
Kubernetes Submit Queue	2d9d84dc64	Merge pull request #32888 from deads2k/client-10-fixup-remaining-listers Automatic merge from submit-queue simplify RC and SVC listers Make the RC and SVC listers use the common list functions that more closely match client APIs, are consistent with other listers, and avoid unnecessary copies.	2016-09-21 04:13:56 -07:00
Dmitry Shulyak	c0981963d9	Verify evicted pods managed by petset controller will be recreated Spawn pet set with single replica and simple pod. They will have conflicting hostPort definitions, and spawned on the same node. As the result pet set pod, it will be created after simple pod, will be in Failed state. Pet set controller will try to re-create it. After verifying that pet set pod failed and was recreated atleast once, we will remove pod with conflicting hostPort and wait until pet set pod will be in running state. Change-Id: I5903f5881f8606c696bd390df58b06ece33be88a	2016-09-21 12:03:11 +03:00
Kubernetes Submit Queue	02605106a6	Merge pull request #29505 from kargakis/debug-recreate-flake Automatic merge from submit-queue controller: enhance timeout error message for Recreate deployments Makes the error message from https://github.com/kubernetes/kubernetes/issues/29197 more obvious @kubernetes/deployment	2016-09-21 01:45:47 -07:00
Matt Liggett	ce0e7586a8	Only approve evictions when budgets would stay enforced after. Prior to this, we would approve eviction as long as the current state of the pods matched the budget. The new version requires that after the eviction, the pods would still match the budget. Also update tests to match.	2016-09-20 18:23:50 -07:00
Kubernetes Submit Queue	ad7ba62b24	Merge pull request #32785 from m1093782566/m109-job-controller-hot-loop Automatic merge from submit-queue [Controller Manager] Fix job controller hot loop <!-- Thanks for sending a pull request! Here are some tips for you: 1. If this is your first time, read our contributor guidelines https://github.com/kubernetes/kubernetes/blob/master/CONTRIBUTING.md and developer guide https://github.com/kubernetes/kubernetes/blob/master/docs/devel/development.md 2. If you want faster PR reviews, read how: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/faster_reviews.md 3. Follow the instructions for writing a release note: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/pull-requests.md#release-notes --> What this PR does / why we need it: Fix Job controller hotloop on unexpected API server rejections. Which issue this PR fixes Related issue is #30629 Special notes for your reviewer: @deads2k @derekwaynecarr PTAL.	2016-09-20 13:52:45 -07:00
deads2k	b83a317003	switch controller manager to generated clientset	2016-09-20 12:53:47 -04:00
m1093782566	27cc90cebb	fix job controller hot loop Change-Id: I55ce706381f1494e5cd2064177b938f56d9c356a	2016-09-20 22:25:11 +08:00
Michail Kargakis	59da5385e0	controller: enhance timeout error message for Recreate deployments	2016-09-20 15:53:24 +02:00
deads2k	16fbb47189	fix up service lister	2016-09-20 08:24:33 -04:00
deads2k	185a7adf84	fix RC lister	2016-09-20 08:24:32 -04:00
Kubernetes Submit Queue	4a176600fc	Merge pull request #32482 from m1093782566/m109-pet-set-fix-update-bug Automatic merge from submit-queue [Pet Set] Fix losing pet updated information between update retries <!-- Thanks for sending a pull request! Here are some tips for you: 1. If this is your first time, read our contributor guidelines https://github.com/kubernetes/kubernetes/blob/master/CONTRIBUTING.md and developer guide https://github.com/kubernetes/kubernetes/blob/master/docs/devel/development.md 2. If you want faster PR reviews, read how: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/faster_reviews.md 3. Follow the instructions for writing a release note: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/pull-requests.md#release-notes --> What this PR does / why we need it: Address #32481 @bprashanth	2016-09-20 05:16:04 -07:00
d00369826	3de4695057	fix petset update(pet) retries bug Change-Id: I92e2b653ab78fca72ae41cf87945d90fbbc67f44	2016-09-20 11:35:58 +08:00
m1093782566	2a117798b6	fix disruption hot loop Change-Id: Ib8eb56cb87f688fe9b2016f574f3fb9b685ce796	2016-09-19 20:50:48 +08:00
Wojciech Tyczynski	a6ef37ece9	Use Informer framework in route controller	2016-09-19 11:53:30 +02:00
Ilya Dmitrichenko	386fae4592	Refactor utils that deal with certs - merge `pkg/util/{crypto,certificates}` - add funcs from `github.com/kubernetes-incubator/bootkube/pkg/tlsutil` - ensure naming of funcs is fairly consistent	2016-09-19 09:03:42 +01:00
Michail Kargakis	2fd3c490df	controller: a couple of fixes for csr Fixes: * delete resource handler wasn't taking into account tombstones * csr would requeue twice on update failure	2016-09-18 22:48:46 +02:00
Wojciech Tyczynski	27d75054b3	Avoid unnecessary API calls from GC	2016-09-18 07:09:11 +02:00
Kubernetes Submit Queue	920581d964	Merge pull request #32664 from m1093782566/m109-certificates-hot-loop Automatic merge from submit-queue [Controller Manager] Fix certificates controller hotloop and use utilruntime.HandleError to replace glog.Errorf <!-- Thanks for sending a pull request! Here are some tips for you: 1. If this is your first time, read our contributor guidelines https://github.com/kubernetes/kubernetes/blob/master/CONTRIBUTING.md and developer guide https://github.com/kubernetes/kubernetes/blob/master/docs/devel/development.md 2. If you want faster PR reviews, read how: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/faster_reviews.md 3. Follow the instructions for writing a release note: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/pull-requests.md#release-notes --> What this PR does / why we need it: Fix certificates controller hotloop on unexpected API server rejections. Which issue this PR fixes Related issue is #30629 Special notes for your reviewer: @deads2k @derekwaynecarr PTAL. I find there is no unit test for certificates controller, and I will implement unit tests for it later.	2016-09-17 21:00:59 -07:00
Kubernetes Submit Queue	41fc0a4506	Merge pull request #32776 from m1093782566/m109-fix-endpoint-controller-hotloop Automatic merge from submit-queue [Controller Manager] Fix endpoint controller hot loop and use utilruntime.HandleError to replace glog.Errorf <!-- Thanks for sending a pull request! Here are some tips for you: 1. If this is your first time, read our contributor guidelines https://github.com/kubernetes/kubernetes/blob/master/CONTRIBUTING.md and developer guide https://github.com/kubernetes/kubernetes/blob/master/docs/devel/development.md 2. If you want faster PR reviews, read how: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/faster_reviews.md 3. Follow the instructions for writing a release note: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/pull-requests.md#release-notes --> Why: Fix endpoint controller hot loop and use `utilruntime.HandleError` to replace `glog.Errorf` What 1. Fix endpoint controller hot loop in `pkg/controller/endpoint` 2. Fix endpoint controller hot loop in `contrib/mesos/pkg/service` 3. Sweep cases of `glog.Errorf` and use `utilruntime.HandleError` instead. Which issue this PR fixes Fixes #32843 Related issue is #30629 Special notes for your reviewer: @deads2k @derekwaynecarr The changes on `pkg/controller/endpoints_controller.go` and `contrib/mesos/pkg/service/endpoints_controller.go` are almost the same except `contrib/mesos/pkg/service/endpoints_controller.go` does not pass `podInformer` as the parameter of `NewEndpointController()`. So, I didn't wait `podStoreSynced` before `syncService()`(Just leave it as it was). Will it lead to a problem?	2016-09-17 20:01:41 -07:00
Kubernetes Submit Queue	9fc8ebdafa	Merge pull request #32891 from wojtek-t/route_controller_fix Automatic merge from submit-queue Don't update NodeNetworkUnavailable condition if it's already set cor… Ref #32571	2016-09-17 00:39:34 -07:00
Kubernetes Submit Queue	cd051703b3	Merge pull request #32877 from deads2k/client-09-fixup-lister Automatic merge from submit-queue change factorization of listers to make them easier to add `Listers` have a tremendous amount of duplicate code. This factors that out. @smarterclayton ptal.	2016-09-16 22:39:37 -07:00

1 2 3 4 5 ...

1649 Commits