Commit Graph

496 Commits

Author SHA1 Message Date
Colin Hom
acd7f5045d gce/util: $replica-pd --> $replica_pd
fixes #32997
2016-09-19 12:00:08 -07:00
Kubernetes Submit Queue
87c2650038 Merge pull request #32873 from jszczepkowski/ha-delete-nodes2
Automatic merge from submit-queue

Implemented KUBE_DELETE_NODES flag in kube-down.

Implemented KUBE_DELETE_NODES flag in kube-down script.
It prevents removal of nodes when shutting down a HA master replica.
2016-09-19 01:08:18 -07:00
Jerzy Szczepkowski
58c8992590 Implemented KUBE_DELETE_NODES flag in kube-down.
Implemented KUBE_DELETE_NODES flag in kube-down script.
It prevents removal of nodes when shutting down a HA master replica.
2016-09-16 16:51:52 +02:00
Kubernetes Submit Queue
60840140ab Merge pull request #31437 from jszczepkowski/ha-poc-debian2
Automatic merge from submit-queue

Implemented creation of HA master for GCE on debian.
2016-09-16 05:44:18 -07:00
Random-Liu
bb233e2249 Change the upgrade script to keep os distro during upgrade. 2016-09-15 21:14:40 -07:00
Jerzy Szczepkowski
529818c9c1 Fixed #32366: wrong master pd name during kube-down.
Fixed #32366: wrong master pd name during kube-down.
2016-09-14 14:21:19 +02:00
Jerzy Szczepkowski
196a442e26 Implemented creation of HA master for GCE on debain.
Implemented creation of HA master for GCE on debain.
2016-09-12 16:42:20 +02:00
Kubernetes Submit Queue
8a77e3fdb7 Merge pull request #32155 from euank/update-root-size
Automatic merge from submit-queue

cluster/gce: Update master root disk size

As part of #29213, the hyperkube image will be deployed alongside
existing dependencies.

This ends up just running over the root disk size of 10 during
extraction.

cc @yifan-gu @aaronlevy
2016-09-11 23:28:08 -07:00
Jerzy Szczepkowski
8541ea6c29 Updated port for HA master etcd.
Updated port for HA master etcd to 2379 (needed after #30514).
2016-09-07 16:10:55 +02:00
Euan Kemp
e023ec90f3 cluster/gce: Update master root disk size
As part of #29213, the hyperkube image will be deployed alongside
existing dependencies.

This ends up just running over the root disk size of 10 during
extraction.
2016-09-06 13:54:39 -07:00
Kubernetes Submit Queue
09e5fa62b5 Merge pull request #31294 from mml/federated-oops
Automatic merge from submit-queue

Fix e2e logic when FEDERATION=false.

Also add debugging output to GCE detect-master.

Fixes #31285.
2016-09-05 13:01:12 -07:00
Amey Deshpande
6a2201f410 Pick a specific GCI version by default on GCE.
Prior to this change, a K8s branch (master as well as release) was
pinned to a GCI milestone.  It would pick up the latest GCI release on
that milestone at the time of cluster creation.  The rationale was the
K8s users would automatically get the bug fixes in newer versions of
GCI.  However in practice, it makes the runtime environment
non-deterministic, and lack of continuous e2e tests mean we would run
into breakages sooner or later.

With this change, each K8s release will pick a specific version
of GCI by default (similar to how the Debian-based container-vm gets used).
Users can override the default version through KUBE_GCE_MASTER_IMAGE and
KUBE_GCE_NODE_IMAGE environment variables.

We expect the default GCI version will be updated relatively frequently stay
updated with newer GCI releases.  We can also automate the process to
automatically bump the hard-coded GCI version in future.
2016-08-31 17:26:00 -07:00
Jerzy Szczepkowski
c5f18d517d Marked HA master feautre experimental.
Marked HA master feautre experimental.
2016-08-30 11:50:22 +02:00
Matt Liggett
2ddc08e567 Add debugging output when trying to find the name of the master. 2016-08-23 13:18:22 -07:00
Jerzy Szczepkowski
05a41623b0 Support for creation/removes of master replicas.
HA master: initial support for creation/removal of masters replicas by
kube-up/kube-down scripts for GCE on gci.
2016-08-15 22:36:49 +02:00
Vishnu kannan
2f6514bd63 Node e2e can run tests against GCE images based on a regex.
It can run tests against multiple existing images that match a regex.
GCI images will be using a regex.

Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-08-10 11:24:54 -07:00
Ryan Hitchman
5508e49ef5 Use bash ranges "{1..3}" instead of "$(seq 1 3)". 2016-08-08 16:23:39 -07:00
Jerzy Szczepkowski
c8d920ea6c Reverted conversion of influx-db to PetSet.
Reverted conversion of influx-db to PetSet.
2016-08-04 17:08:46 +02:00
Bryan Boreham
f5159b49c2 Correct gcloud list arg from '--zone' to '--zones'
`--zone` was accepted as an abbreviation but is now deprecated
and will be disabled entirely in gcloud 132.0.0.
2016-08-01 14:49:38 +01:00
Vishnu kannan
bb4e1e62fa Mention updating node e2e config before updating gci image milestone
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-07-29 14:26:15 -07:00
Filip Grzadkowski
e9585fba91 1. Precompute REGION variable in config
2. Add timeout for waiting for loadbalancer
3. Fix kube-down so that it doesn't delete some resources if there are still masters/nodes in other zones
2016-07-20 17:25:25 +02:00
Filip Grzadkowski
70bb57a3e1 Revert "Merge pull request #29278 from kubernetes/revert-29201-ha_master"
This reverts commit ecebdb5707, reversing
changes made to 976ca09d71.
2016-07-20 16:37:31 +02:00
Filip Grzadkowski
194895740c Revert "Add and delete load balancer in front of apiserver." 2016-07-20 14:02:59 +02:00
Piotr Szczesniak
976ca09d71 Merge pull request #29201 from fgrzadkowski/ha_master
Add and delete load balancer in front of apiserver.
2016-07-20 09:38:19 +02:00
Filip Grzadkowski
5c0f0daae0 Add and delete load balancer in front of apiserver. 2016-07-19 16:44:21 +02:00
k8s-merge-robot
c31882a5f2 Merge pull request #29140 from maisem/gce_on_cvm
Automatic merge from submit-queue

Fixing kube-up for CVM masters.

@roberthbailey @zmerlynn
2016-07-18 20:06:56 -07:00
Maisem Ali
fe14340051 Fixing kube-up for CVM masters. 2016-07-18 14:21:41 -07:00
Zach Loafman
4c667c38c9 GCE bring-up: Differentiate NODE_TAGS from NODE_INSTANCE_PREFIX
Kubernetes-side fix to #29074 (GKE-side fix necessary as well)
2016-07-18 14:20:45 -07:00
Maisem Ali
97f3f80833 Splitting OS_DISTRIBUTION into NODE_OS_DISTRIBUTION and
MASTER_OS_DISTRIBUTION.
2016-07-15 12:02:31 -07:00
David McMahon
ef0c9f0c5b Remove "All rights reserved" from all the headers. 2016-06-29 17:47:36 -07:00
Jerzy Szczepkowski
d00cdf75e8 Influxdb migrated to PetSet and PersistentVolumes.
Influxdb migrated to PetSet and PersistentVolumes.
2016-06-27 15:39:09 +02:00
Jerzy Szczepkowski
a745bb17e3 Cleanup in kube-down: removed deletion of old autoscaler.
Cleanup in kube-down: removed deletion of old autoscaler.
2016-06-24 13:32:37 +02:00
gmarek
6d201c9c57 kube-down deletes instance templates created by the cluster upgrade 2016-06-22 14:09:37 +02:00
nikhiljindal
864b267a8d Simplifying the code to create kubeconfig 2016-06-20 16:10:47 -07:00
k8s-merge-robot
b442ba72e5 Merge pull request #27522 from adityakali/gci53
Automatic merge from submit-queue

make GCI image detection robust

This change makes sure that in case we roll back a released GCI image, the image detection logic picks a correct active image. 

@kubernetes/goog-image @Amey-D @wonderfly @dchen1107
2016-06-20 04:05:45 -07:00
Aditya Kali
4a46c948f2 make GCI image detection robust 2016-06-17 09:56:53 -07:00
Zach Loafman
2d77dc639b gce/kube-down: Parallelize IGM deletion, batch more
Batch up to 200 instance / route deletions, handle
IGM deletion in parallel.
2016-06-13 14:05:43 -07:00
Aditya Kali
4dde9a66f4 version bump for gci to milestone 53
Fixes #26455

GCI release 53 includes kubernetes v1.3.0-alpha.5 with docker-1.11.2.
2016-06-10 15:28:52 -07:00
Marcin Wielgus
edf21902b5 Rename ENABLE_NODE_AUTOSCALER to ENABLE_CLUSTER_AUTOSCALER - part 1 2016-06-08 13:35:38 +02:00
Quinton Hoole
c578678820 Merge pull request #26914 from nikhiljindal/FedAPIServerSecret
Updating federation up script to create secrets with federation-apiserver and k8s apiservers kubeconfigs
2016-06-07 22:29:39 -07:00
k8s-merge-robot
bae931204c Merge pull request #26990 from mwielgus/ca-0.1.2
Automatic merge from submit-queue

Bump cluster autoscaler version and enable scale down by default

Follow up of https://github.com/kubernetes/contrib/pull/1148. 

cc: @piosz @fgrzadkowski @jszczepkowski
2016-06-07 19:42:19 -07:00
nikhiljindal
9443bf0f9c Adding secrets for kubernetes clusters as well 2016-06-07 17:30:15 -07:00
Marcin Wielgus
c191c72063 Bump cluster autoscaler version and enable scale down by default 2016-06-07 21:42:56 +02:00
k8s-merge-robot
98c432a943 Merge pull request #25841 from ojarjur/ojarjur/registry-mirror
Automatic merge from submit-queue

GCE: Enable using gcr.io as a Docker registry mirror.

```release-note
Use gcr.io as a Docker registry mirror when setting up a cluster in GCE.
```
[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()

This only affects clusters running under GCE.
2016-06-06 14:50:05 -07:00
Aditya Kali
f7c2624849 pin GCI version to milestone 52
This is mainly for pinning the 1.2 branch to GCI milestone 52
which contains correct docker and kubelet built in.
Doing this allows us to upgrade docker to v1.11 (issue #26455)
in GCI 53 without breaking the 1.2 release branch.
2016-06-02 13:52:38 -07:00
Daniel Wang
a8701baf17 Move the defaults setting of GCI to util.sh
This change recovers some of the side effects of
https://github.com/kubernetes/kubernetes/pull/26197, i.e., keeps the defaults of
`NODE_IMAGE` and `NODE_IMAGE_PROJECT` to `MASTER_IMAGE` and
`MASTER_IMAGE_PROJECT`, for backward compatibility. Although it keeps
`OS_DISTRIBUTION` defaulting to `gci`, the default settings of these vars are
moved to `cluster/gce/util.sh` and conditioned on `OS_DISTRIBUTION==gci`.
2016-06-01 15:37:02 -07:00
Marcin
74ad0def81 Add missing $ in util.sh 2016-05-31 19:41:54 +02:00
Marcin
0b7c2fa6fe Allow to enable scale down in kube-up for gce 2016-05-30 17:07:54 +02:00
Daniel Wang
3d95151971 Replace containervm with GCI as default master image for GCE clusters 2016-05-24 17:28:17 -05:00
Omar Jarjur
338b33f3f4 Enable using gcr.io as a Docker registry mirror.
This only affects clusters running under GCE.
2016-05-24 08:12:48 -07:00
k8s-merge-robot
04d6162e0f Merge pull request #26071 from gmarek/kube-up
Automatic merge from submit-queue

Make node-instance-group base names unique to prevent collisions

We create multiple IGMs for >1000 Node clusters. When we have a conflict on base name IGMs will fight over ownership of the VM that happen to have the name belonging to multiple IGMs.

This change will increase reliability of starting big clusters.

cc @wojtek-t @alex-mohr @roberthbailey @mikedanese
2016-05-24 07:31:22 -07:00
gmarek
7b629a778c Make node-instance-group base names unique to prevent collisions 2016-05-23 14:36:08 +02:00
Andy Zheng
f31c4f6d69 Revert "Revert "Add support for running GCI on the GCE cloud provider""
This reverts commit 40f53b1765.
2016-05-23 00:52:08 -07:00
Filip Grzadkowski
2f2b7c8c97 Merge pull request #25734 from jszczepkowski/kac-test
Added enforcing of setting nodes numbers for cluster autoscaler.
2016-05-20 10:28:56 +02:00
Daniel Smith
40f53b1765 Revert "Add support for running GCI on the GCE cloud provider" 2016-05-18 21:31:28 -07:00
Andy Zheng
a737e1eba1 Add support for running GCI on the GCE cloud provider 2016-05-18 15:15:05 -07:00
k8s-merge-robot
51203dad7a Merge pull request #25763 from andyzheng0831/gci-image
Automatic merge from submit-queue

GCI: Fix the condition for using the default image

This PR revises the condition for using the default GCI image. The old logic is not convenient for manually run e2e tests in some cases (mainly for GCI team to test custom images). The new logic by this PR is very similar to the logic in using ContainerVM. When setting distro to "gci", if master or node image is unset, we use gci-dev for it. If either is set, we respect it.

@roberthbailey @zmerlynn @dchen1107 please review it, and we should cherry pick it in release-1.2 branch. Thanks!

cc/ @kubernetes/goog-image @adityakali FYI
2016-05-17 22:55:41 -07:00
Zach Loafman
87a8905b2c Fix gce/util.sh:get-master-root-disk-size 2016-05-17 13:43:17 -07:00
Andy Zheng
e47f45d1a3 GCI: Fix the condition for using the default image 2016-05-17 10:53:14 -07:00
Jerzy Szczepkowski
858eb0388e Added enforcing of setting nodes numbers for cluster autoscaler.
Added enforcing of setting nodes numbers for cluster autoscaler in kube-up script.
2016-05-17 13:15:49 +02:00
gmarek
64618c3311 Make bigger master root disks in GCE for large clusters 2016-05-16 15:57:56 +02:00
k8s-merge-robot
aada051b20 Merge pull request #25357 from mwielgus/ca-salt
Automatic merge from submit-queue

Salt configuration for the new Cluster Autoscaler for GCE

Adds support for cloud autoscaler from contrib/cloud-autoscaler in kube-up.sh GCE script.

cc: @fgrzadkowski @piosz
2016-05-15 11:35:19 -07:00
Marcin Wielgus
ac4b380453 Salt configuration for the new Cluster Autoscaler for GCE 2016-05-13 17:04:24 +02:00
Jeff Grafton
bc7922c374 Replace awkward bash pipelines with gcloud's --format='value()'
Also remove old `gcloud compute instance-groups managed get-operation`
code. This command doesn't even exist in gcloud anymore.
2016-05-06 15:49:57 -07:00
Jeff Grafton
6cd9dcba60 Use --format='value()' instead of --fields with gcloud 2016-05-04 12:32:13 -07:00
k8s-merge-robot
f8196d9048 Merge pull request #24834 from ixdy/node-names
Automatic merge from submit-queue

Fix detect-node-names to not error out if there are no nodes

Fixes #21564.

Teardown was not working correctly in rare cases because `detect-node-names` was failing before any of the actual cleanup was run. I'm pretty sure the issue was that there was an instance group, but no instances in the instance group, so we bailed out when we tried to expand the bash array.

This PR adds a guard so we don't bail if the array is empty.

cc @jlowdermilk @spxtr
2016-04-29 16:37:41 -07:00
Andy Zheng
967fd495e0 GCI: Update the command to get the image 2016-04-29 09:15:27 -07:00
k8s-merge-robot
19408e5575 Merge pull request #24893 from andyzheng0831/gci
Automatic merge from submit-queue

Add support for running clusters on GCI

Google Container-VM Image (GCI) is the next revision of Container-VM. See documentation at https://cloud.google.com/compute/docs/containers/vm-image/. This change adds support for starting a Kubernetes cluster using GCI.

With this change, users can start a kubernetes cluster using the latest kubelet and kubectl release binary built in the GCI image by running:

	$ KUBE_OS_DISTRIBUTION="gci" cluster/kube-up.sh

Or run a testing cluster on GCI by running:

	$ KUBE_OS_DISTRIBUTION="gci" go run hack/e2e.go -v --up

The commands above will choose the latest GCI image by default.
2016-04-27 23:46:26 -07:00
Andy Zheng
93929c3853 Add support for running clusters on GCI 2016-04-27 15:24:21 -07:00
CJ Cullen
3253739505 Use ABAC authz instead of AllowAll. 2016-04-27 11:38:15 -07:00
Jeff Grafton
5487a3f8f6 Fix detect-node-names to not error out if there are no nodes 2016-04-26 17:48:25 -07:00
Jeff Grafton
b69b94103c Make detect-node-names less brittle for G{C,K}E 2016-04-25 12:10:25 -07:00
k8s-merge-robot
19a106f61e Merge pull request #24274 from euank/configurable-release-bucket
Automatic merge from submit-queue

jenkins: Allow configuration of release bucket

This allows others to leverage the existing E2E code to test some
patched kube binary by simply overriding the bucket and reusing many of
the existing scripts
2016-04-21 23:09:05 -07:00
Euan Kemp
9f9c54cdd2 jenkins: Allow configuration of release bucket
This allows others to leverage the existing E2E code to test some
patched kube binary by simply overriding the bucket and reusing many of
the existing scripts
2016-04-14 12:26:43 -07:00
CJ Cullen
5a785698aa Strip comments from configure-vm.sh for gce 2016-04-11 15:13:02 -07:00
k8s-merge-robot
c6e995a824 Merge pull request #23558 from andyzheng0831/enhance
Automatic merge from submit-queue

Trusty: Regional release .tar.gz support

@zmerlynn and @roberthbailey please review it. This change is to support the feature added in PR #22234. The entire logic is pretty much the same as in #22234, with only few minor changes in implementation.

I had manually run e2e tests with "export RELEASE_REGION_FALLBACK=true" on two clusters: (1) Trusty on master nodes on ContainerVM; (2) Master and nodes all on trusty. All tests are green. I don't figure out a way to simulate regional fallback. But I did test the function download_or_bust() out-of-box.

cc/ @wonderfly @dchen1107 @fabioy FYI.
2016-04-01 03:39:08 -07:00
Andy Zheng
a83f11c484 Trusty: Regional release .tar.gz support 2016-03-28 15:21:14 -07:00
Matt Liggett
300855c5bb Stop swallowing stdout and stderr when we check for the staging bucket.
Should help debug #23461
2016-03-25 15:01:21 -07:00
k8s-merge-robot
f2d14990a5 Merge pull request #23180 from roberthbailey/dead-code
Auto commit by PR queue bot
2016-03-21 17:30:22 -07:00
Andy Zheng
2ea9ef53f3 Trusty: Support hybrid cluster with nodes on ContainerVM 2016-03-19 22:47:35 -07:00
Robert Bailey
c5ca042ede Remove the restart-kube-proxy and restart-apiserver functions
since neither are used any longer.
2016-03-18 01:49:01 -07:00
gmarek
c325319ccb Default MIG size to 1000 2016-03-14 12:21:14 +01:00
k8s-merge-robot
31903c8bb1 Merge pull request #22594 from justinsb/aws_ulite
Auto commit by PR queue bot
2016-03-10 10:15:28 -08:00
Joe Finney
e660adbad8 Add connect timeout to ssh-to-node. 2016-03-07 13:29:04 -08:00
k8s-merge-robot
bc96422e23 Merge pull request #22571 from zmerlynn/regional-registry
Auto commit by PR queue bot
2016-03-06 13:21:19 -08:00
k8s-merge-robot
e5a908c898 Merge pull request #22476 from ixdy/cluster-logs
Auto commit by PR queue bot
2016-03-05 23:06:29 -08:00
Justin Santa Barbara
523e1d0d8c Ubernetes-Lite: reuse existing configuration when reusing master
In particular, we need to share the kubelet cert & key, otherwise we
can't connect to the kube-api.

Fix #22593
2016-03-05 17:00:02 -05:00
Zach Loafman
60e1fccc7f Regional registries: Hack the manifests!
Allow the gcr.io/google_containers registry to be overridden
regionally by just blasting a new KUBE_ADDON_REGISTRY out. Instead of
adding every addon to Salt and asking all of the other consumers
(Trusty, Juju, Mesos, etc) to change, just script the sed ourselves.

This is probably the 9th grossest thing I've ever done, but it works
well, and it works quickly. I kind of wish it didn't.
2016-03-04 20:00:54 -08:00
Jeff Grafton
d70c516d4f Remove log collection code in cluster/gce/util.sh.
Also update some docs to mention cluster/log-dump.sh.
2016-03-03 15:28:33 -08:00
Robert Bailey
30b78e4d7f Add the GCE cluster name to instance metadata. 2016-03-03 13:57:31 -08:00
Jeff Lowdermilk
73479d5209 Merge pull request #22441 from thockin/flake-22244-gce-static-ip
Check static IP postcondition if op fails
2016-03-03 11:38:50 -08:00
Tim Hockin
6dca871693 Check static IP postcondition if op fails 2016-03-03 08:20:12 -08:00
k8s-merge-robot
c9accdfd57 Merge pull request #22234 from zmerlynn/regional-buckets
Auto commit by PR queue bot
2016-03-02 03:01:54 -08:00
Joe Finney
74515a6b23 Fix detect-nodes in cluster/gce/util.sh. 2016-03-01 17:12:43 -08:00
Zach Loafman
5a031bf52c GCE startup: Regional release .tar.gz support
* In kube-up.sh, create a staging bucket with a location nearest the
zone being created. If new variable RELEASE_REGION_FALLBACK is set
(default false), create multiple buckets and stage to fallback
URLs. (In open source, this path is primarily for testing.)

* In configure-vm.sh, split the URL env variables by comma (if any
extra are present) and retry on the fallback URLs. Also factor the
hash checking into this path rather than outside, since a corrupt
release in a particular geo can be retried in a different geo.

* Remove the local already-staged .tar.gz checks. They've caused
several issues along the way, and with this code path become virtually
unmaintainable. (I could add a sentinel for each bucket it's possibly
staged to, but ew.)
2016-03-01 09:59:39 -08:00
Justin Santa Barbara
ee58dc22d7 Merge pull request #21673 from gmarek/kube-down
Auto commit by PR queue bot
2016-02-27 19:21:18 -05:00
gmarek
053e882622 kube-down looks for instance templates even if instance group does not exists 2016-02-24 11:15:45 +01:00
Zach Loafman
e85cf1bb97 Attempt to delete node-template before retry
I don't think we've seen this commonly, so I'm proposing a simplistic
fix for now.

Fixes #20878
2016-02-22 16:09:15 -08:00
k8s-merge-robot
1ce188e557 Merge pull request #21430 from spxtr/e2e-internal
Auto commit by PR queue bot
2016-02-21 12:08:22 -08:00
k8s-merge-robot
d6192dd152 Merge pull request #21342 from spxtr/remove-e2e-test
Auto commit by PR queue bot
2016-02-21 00:39:38 -08:00