Commit Graph

2217 Commits

Author SHA1 Message Date
Kubernetes Submit Queue
e6cdfb300e Merge pull request #62172 from shyamjvs/ip-alias-subnet-fix
Automatic merge from submit-queue (batch tested with PRs 62162, 60628, 62172). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

When using custom network with IP-alias, use the former's subnet for the latter too

Currently, when we're using custom subnet and ip-alias simultaneously, the cluster fails to come up.
The reason is because we're creating a subnet in the former with one name, but expecting a differently named subnet for the latter.

This is causing [continuous failures in our 100-node job](https://k8s-testgrid.appspot.com/sig-scalability-gce#gce) where I recently turned both of them on.

cc @kubernetes/sig-network-bugs 

```release-note
NONE
```
2018-04-05 12:33:08 -07:00
Shyam Jeedigunta
1961e21098 When using custom network with IP-alias, use the former's subnet for the latter too 2018-04-05 17:57:43 +02:00
Supriya Garg
e350c46116 Update the stackdriver agents yaml to include a deployment for cluster level resources 2018-04-05 10:09:11 -04:00
Filipe Brandenburger
7bf7e8991a Update COS version in Kubernetes GCE default and test
Set the default to cos-stable-65 (which is what we are using on GKE for
latest 1.9 and 1.8) and set config-test to use cos-beta-66, so that we
can get more exposure to it.

The testgrid seems to be fairly happy with these images. (both
e2e-gce-cosdev-k8sdev-default and e2e-gce-cosbeta-k8sdev-default are
generally green.)
2018-04-04 10:02:55 -07:00
Nick Sardo
f32f0fb337 Update GLBC manifest to v1.0.1 2018-04-03 11:43:43 -07:00
Kubernetes Submit Queue
399b835f54 Merge pull request #61432 from filbranden/killrkt1
Automatic merge from submit-queue (batch tested with PRs 61904, 61565, 61401, 61432, 61772). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Remove rktnetes code

**What this PR does / why we need it**:
rktnetes is scheduled to be deprecated in 1.10 (#53601). According to the deprecation policy for beta CLI and flags, we can remove the feature in 1.11.

**Which issue(s) this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged)*:
Fixes #58721

**Special notes for your reviewer**:

**Release note**:

```release-note
Removed rknetes code, which was deprecated in 1.10.
```

/assign @yujuhong 
/hold

Hold until the end of the freeze.
2018-03-29 11:46:19 -07:00
Kubernetes Submit Queue
4685df26dd Merge pull request #60590 from immutableT/enc_config_automation
Automatic merge from submit-queue (batch tested with PRs 60420, 60590). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Enable AESGCM encryption of secrets in etcd by default.

**What this PR does / why we need it**:
Enable encryption of secrets in etcd via AESGCM transform (as described here https://kubernetes.io/docs/tasks/administer-cluster/encrypt-data/) during kube-up.sh build of a cluster.
**Which issue(s) this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged)*:
Fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2018-03-28 23:53:06 -07:00
Kubernetes Submit Queue
789a27be9d Merge pull request #60166 from rramkumar1/hairpin-mode-default-change
Automatic merge from submit-queue (batch tested with PRs 60166, 61706, 61769). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Change HAIRPIN_MODE to hairpin-veth as default

**What this PR does / why we need it**:
Change the default HAIRPIN_MODE back to "hairpin-veth".

It was previously "promiscuous-bridge" in order to workaround a kernel bug which deadlocked the machine when hairpin-veth was used. (#27498)

After some thorough manual testing on ubuntu clusters, we feel confident now that the kernel bug is fixed so we should switch back to using hairpin-veth. This will allow us to clean up some ebtables rules that were put in place to make "promiscuous-bridge" work properly.

Once this change goes in, we need to carefully monitor our e2e tests to make sure the bug has not resurfaced.

**Release note**:
```release-note
In a GCE cluster, the default HAIRPIN_MODE is now "hairpin-veth".
```

/cc @freehan @prameshj 
/assign @roberthbailey
2018-03-28 10:46:04 -07:00
Filipe Brandenburger
366ad30ab6 Remove references to rkt from shell scripts in cluster/ and hack/. 2018-03-27 09:29:35 -07:00
Kubernetes Submit Queue
839fe27dc8 Merge pull request #61567 from davidz627/fix/localSSDPermissions
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Fixes permissions error for Local SSD when created with NODE_LOCAL_SSDS flag

**What this PR does / why we need it**:
The PR fixes a permissions error introduced in 1.9 whereby users are unable to write to their Local SSD if it is created with the `NODE_LOCAL_SSDS` flag.

This will need to be cherrypicked to 1.9 and 1.10.

/sig storage
/kind bug
/assign @msau42 

```release-note
NONE
```
2018-03-26 05:40:18 -07:00
Kubernetes Submit Queue
566e3445c4 Merge pull request #61561 from mwielgus/ca-1.2.0-head
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Cluster Autoscaler 1.2.0

```release-note
Cluster Autoscaler 1.2.0 - release notes available here: https://github.com/kubernetes/autoscaler/releases
```
2018-03-25 23:27:44 -07:00
Kubernetes Submit Queue
620f9a3a5a Merge pull request #61475 from anjensan/e2e-hamaster
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Fix e2e test lifecycle/ha_master.go

Minor fixes for gce-ha-master e2e test
https://k8s-testgrid.appspot.com/google-gce#gce-ha-master

```release-note
NONE
```
2018-03-25 08:54:33 -07:00
Kubernetes Submit Queue
4cc30e8c0c Merge pull request #61529 from wojtek-t/remove_useless_param
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Remove max-pods param from config-test.sh

This is set to default value anyway.
2018-03-25 05:20:09 -07:00
Kubernetes Submit Queue
e3af2374a6 Merge pull request #60801 from jingax10/gce_util_branch
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Suppress error message from grep when checking whether a subnet has a secondary range or not.

**What this PR does / why we need it**:

Get rid of stdrr caused by grep command when running cluster/kube-up.sh for GCE.

**Which issue(s) this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged)*:
Fixes #

N/A

**Special notes for your reviewer**:

No behavior change.

**Release note**:

```release-note
"NONE"
```
2018-03-25 02:40:33 -07:00
Kubernetes Submit Queue
053a12aee9 Merge pull request #60107 from wangzhen127/cos-audit-placeholder
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Update GCP fluentd configmap for COS audit logging on GKE node

**What this PR does / why we need it**:
This PR adds a placeholder in fluentd configmap for COS audit logging on GKE node.

**Which issue(s) this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged)*:
Fixes #

**Special notes for your reviewer**:
NONE

**Release note**:

```release-note
NONE
```
2018-03-25 00:51:52 -07:00
Ismo Puustinen
6372bb2f28 cluster/gce: fix checks for empty strings.
In order to use -n, the value needs either be quoted or [[ .. ]] block
has to be used. Fix the comparisons that way.

To verify, consider this (analogous) script:

  #!/bin/bash

  subnetwork_url=""

  if [ -n ${subnetwork_url} ]; then
    echo "foo"
  fi

  if [[ -n ${subnetwork_url} ]]; then
    echo "bar"
  fi

Here "foo" is echoed by the script, even though the variable
subnetwork_url has a zero-length value.
2018-03-23 15:37:03 +02:00
Ismo Puustinen
2226b1de09 cluster/gce: fix shell return value comparison.
In shell scripts inside [[ .. ]] blocks, ">" is a string comparison operator.
The "attempt" number comparison works (most likely by accident) because the max
number of attempts is below 10.  Change to -gt operator.
2018-03-23 15:37:03 +02:00
David Zhu
d3670856dd Added chmod a+x for local SSD when disk is created with NODE_LOCAL_SSDS 2018-03-22 16:03:03 -07:00
Kubernetes Submit Queue
4dd4cbe8e3 Merge pull request #60188 from jingax10/gce_owner_branch
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Add jingax10 as both reviewer and approver in cluster/gce.

**What this PR does / why we need it**:

Want to become both reviewer and owner for cluster/gce.

**Which issue(s) this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged)*:
Fixes #

N/A

**Special notes for your reviewer**:

**Release note**:

```release-note

"NONE"
```
2018-03-22 15:25:23 -07:00
immutablet
d08799ca09 Enable AESGCM encryption of secrets in etcd by default. 2018-03-22 13:51:09 -07:00
Marcin Wielgus
1d6725cad5 Cluster Autoscaler 1.2.0 2018-03-22 21:31:25 +01:00
Kubernetes Submit Queue
e81965d456 Merge pull request #61065 from freehan/fix-gcloud-dev
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

fix validation for dev gcloud

```release-note
NONE
```
2018-03-22 13:15:12 -07:00
Zhen Wang
d5c2cdcbbb Update GCP fluentd configmap for GKE node journal logging 2018-03-22 12:04:11 -07:00
Andrei Zhlobich
426c0dc1bc Fix master replication util for gce clusters - populate cluster-location.txt 2018-03-22 11:57:06 +01:00
wojtekt
85df34eaa1 Remove max-pods param from config-test.sh 2018-03-22 09:46:29 +01:00
Kubernetes Submit Queue
0c4ed53f7d Merge pull request #61359 from aleksandra-malinowska/cluster-autoscaler-1.2.0-beta1
Automatic merge from submit-queue (batch tested with PRs 60373, 61098, 61352, 61359, 61362). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Update Cluster Autoscaler version to 1.2.0-beta1

Update Cluster Autoscaler version to 1.2.0-beta1

```release-note:
NONE
```
2018-03-21 22:39:18 -07:00
Kubernetes Submit Queue
130caab7d5 Merge pull request #61235 from yguo0905/client-2
Automatic merge from submit-queue (batch tested with PRs 61124, 59537, 61235, 61258, 61114). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Support new NODE_OS_DISTRIBUTION 'custom' on GCE

**What this PR does / why we need it**:

This PR allows us to run e2e tests against arbitrary OS images on GCE.

It will be cherry picked into 1.8, 1.9 and 1.10.

**Release note**:

```
Support new NODE_OS_DISTRIBUTION 'custom' on GCE.
```

/assign @dashpole
2018-03-21 08:39:23 -07:00
Kubernetes Submit Queue
dcbfdfc279 Merge pull request #61302 from rramkumar1/glbc-manifest-patch
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Patch glbc manifest to use version 1.0.0. Also add rate limiting flags

Will also add a release note to the 1.10 google doc as well.

Fixes: #61305

/assign @bowei 
/cc @nicksardo 

Release Note:
```release-note
Bump ingress-gce image in glbc.manifest to 1.0.0
```
2018-03-19 11:18:43 -07:00
Aleksandra Malinowska
76c6210076 Update Cluster Autoscaler version to 1.2.0-beta1 2018-03-19 17:09:15 +01:00
Jing Ai
384868e570 Suppress error message from grep by removing in the end as it is wrongly interpreted as a file. 2018-03-16 18:12:39 -07:00
Kubernetes Submit Queue
c6d77ee656 Merge pull request #61119 from mtaufen/fix-cluster-autoscaler
Automatic merge from submit-queue (batch tested with PRs 61284, 61119, 61201). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Add AUTOSCALER_ENV_VARS to kube-env to hotfix cluster autoscaler

This provides a temporary way for the cluster autoscaler to get at
values that were removed from kube-env in #60020. Ideally this
information will eventually be available via e.g. the Cluster API,
because kube-env is an internal interface that carries no stability
guarantees.

This is the first half of the fix; the other half is that cluster autoscaler
needs to be modified to read from AUTOSCALER_ENV_VARS, if it is
available.

Since cluster autoscaler was also reading KUBELET_TEST_ARGS for the
kube-reserved flag, and we don't want to resurrect KUBELET_TEST_ARGS in kube-env,
we opted to create AUTOSCALER_ENV_VARS instead of just adding back
the old env vars. This also makes it clear that we have an ugly dependency
on kube-env.

```release-note
NONE
```
2018-03-16 16:56:00 -07:00
Rohit Ramkumar
31611a3e4b Patch glbc manifest to use version 1.0.0. Also add rate limiting flags 2018-03-16 15:05:17 -07:00
Michael Taufen
8cf3dc103e Add AUTOSCALER_ENV_VARS to kube-env to hotfix cluster autoscaler
This provides a temporary way for the cluster autoscaler to get at
values that were removed from kube-env in #60020. Ideally this
information will eventually be available via e.g. the Cluster API,
because kube-env is an internal interface that carries no stability
guarantees.
2018-03-16 11:43:41 -07:00
Yang Guo
518c6c1a37 Support new NODE_OS_DISTRIBUTION 'custom' on GCE 2018-03-15 14:05:15 -07:00
Ryan Hitchman
68f5d44865 Fix deprecated gcloud compute networks --mode switches.
"create --mode" becomes "create --subnet-mode", and switch-mode has been
folded into "update".

Create --mode was deprecated in October and will be removed in the next
gcloud release. It is already failing in staging tests.
2018-03-14 15:00:59 -07:00
Kubernetes Submit Queue
b651ed5ea7 Merge pull request #60998 from jpbetz/etcd-3.1.12
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Bump to etcd 3.1.12 to pick up critical fix

etcd [3.1.12](https://github.com/coreos/etcd/releases/tag/v3.1.12) (as well as 3.2.17 and 3.3.2) was released yesterday to fix a bug critical to kubernetes:

Fix [mvcc "unsynced" watcher restore operation](https://github.com/coreos/etcd/pull/9297).
- "unsynced" watcher is watcher that needs to be in sync with events that have happened.
- That is, "unsynced" watcher is the slow watcher that was requested on old revision.
- "unsynced" watcher restore operation was not correctly populating its underlying watcher group.
- Which possibly causes [missing events from "unsynced" watchers](https://github.com/coreos/etcd/issues/9086).

This will be backported to 1.9 as well.

Release note:
```release-note
Upgrade the default etcd server version to 3.1.12 to pick up critical etcd "mvcc "unsynced" watcher restore operation" fix.
```

cc @gyuho @wojtek-t @shyamjvs @timothysc @jdumars
2018-03-13 09:11:10 -07:00
Minhan Xia
ec77fe97ec fix validation for dev gcloud 2018-03-12 14:10:35 -07:00
Kubernetes Submit Queue
31b4719066 Merge pull request #60859 from verult/remount-kube-env
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Setting REMOUNT_VOLUME_PLUGIN_DIR for COS images in kube-env

**Which issue(s) this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged)*:
Fixes #60725

**Special notes for your reviewer**: Not sure if it's the best place to set `REMOUNT_VOLUME_PLUGIN_DIR`.

/sig storage
/sig cluster-lifecycle
2018-03-12 10:54:31 -07:00
Kubernetes Submit Queue
c6ae1e51ad Merge pull request #60926 from crassirostris/audit-log-gce-config
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Make log audit backend configurable in GCE

This PR will allow to enable audit logging batching by default in e2e tests, after https://github.com/kubernetes/kubernetes/pull/60739 is merged. This is an important step to prevent a regression in scale tests.

/cc @tallclair @sttts 

/assign @roberthbailey 

Robert, please approve

```release-note
NONE
```
2018-03-09 20:00:17 -08:00
Joe Betz
e2a25f9b54 Bump to etcd 3.1.12 to pick up critical fix 2018-03-09 14:28:23 -08:00
Mik Vyatskov
07905d6ee8 Make log audit backend configurable in GCE
Signed-off-by: Mik Vyatskov <vmik@google.com>
2018-03-08 14:09:32 +01:00
Shyam Jeedigunta
21f5e69f08 Rollback etcd server version to 3.1.11 due to #60589 2018-03-08 13:07:15 +01:00
Kubernetes Submit Queue
891b471064 Merge pull request #60808 from jpbetz/etcd-migrate-tls
Automatic merge from submit-queue (batch tested with PRs 60872, 60808). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Pass in etcd TLS credentials during migrate and rollback

For HA etcd clusters, the `etcd-creds` settings, specifically the `--peer-*` TLS flags that may be included, must be passed in to `start-stop-etcd.sh` in order for the etcd server mode started during migrate and rollback steps to be able to communicate between peers.

The `etcd.manifest` change needs to be backported all the way back to 1.7.

**Release note**:
```release-note
NONE
```
2018-03-07 00:12:49 -08:00
Cheng Xing
16ecc14017 Setting REMOUNT_VOLUME_PLUGIN_DIR for COS images in kube-env 2018-03-06 14:22:41 -08:00
Kubernetes Submit Queue
9aae9b58a5 Merge pull request #59836 from jpbetz/etcd-3.2.16-patch-upgrade
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Bump etcd server patch version to 3.2.16

etcd 3.2.16 contains a critical fix for HA clusters: https://github.com/coreos/etcd/pull/9281

Also, update newly added tests to use `REGISTRY` make variable.

Release note:
```release-note
Upgrade the default etcd server version to 3.2.16
```
2018-03-06 10:00:53 -08:00
Marcin Wielgus
16f050bd99 Bump Cluster Autoscaler to 1.1.2 2018-03-06 12:34:51 +01:00
Joe Betz
279ce92e59 Pass in etcd TLS credentials during migrate and rollback 2018-03-05 14:02:58 -08:00
Joe Betz
04c6d0ab26 Bump etcd server patch version to 3.2.16 2018-03-05 13:58:51 -08:00
Jing Ai
977252d4b2 Suppress error message from grep when checking whether a subnet has a secondary range or not. 2018-03-05 09:54:11 -08:00
Kubernetes Submit Queue
cca92f67aa Merge pull request #60639 from konryd/dashboardmesg
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Added dashboard banner passthrough to GCE kube-up.

**What this PR does / why we need it**: This PR adds GCE-specific provider flag for setting system-banner in the dashboard ui.

```release-note
NONE
```
2018-03-05 07:52:42 -08:00