Commit Graph

6110 Commits

Author SHA1 Message Date
Kubernetes Submit Queue
0a981d4921 Merge pull request #50391 from pipejakob/get-kube-breakglass
Automatic merge from submit-queue (batch tested with PRs 49725, 50367, 50391, 48857, 50181)

New get-kube.sh option: KUBERNETES_SKIP_RELEASE_VALIDATION

**What this PR does / why we need it**:
This is an alternative solution to https://github.com/kubernetes/kubernetes/pull/49884. The goal is to be able to pull releases that were built by bazel jobs (both presubmit and postsubmit builds), which currently fail our regex validation against the version string.

This implementation is a simple "I know what I'm doing" breakglass option to turn regex validation off, whereas https://github.com/kubernetes/kubernetes/pull/49884 was to extend our validation to support the new formats of bazel build jobs. I'm testing the waters to see if this is a more palatable solution.

**Release note**:

```release-note
New get-kube.sh option: KUBERNETES_SKIP_RELEASE_VALIDATION
```

CC @BenTheElder @fejta @ixdy
2017-08-10 01:47:17 -07:00
Kubernetes Submit Queue
4e1d1dedab Merge pull request #50372 from x13n/fluentd
Automatic merge from submit-queue (batch tested with PRs 50300, 50328, 50368, 50370, 50372)

Bugfix: set resources only for fluentd-gcp container.

There is more than one container in fluentd-gcp deployment. Previous
implementation was setting resources for all containers, not just
the fluent-gcp one.

**What this PR does / why we need it**:

Bugfix; https://github.com/kubernetes/kubernetes/pull/49009 without this is eating more resources.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #50366

**Special notes for your reviewer**:

**Release note**:
```release-note
NONE
```
2017-08-09 22:58:30 -07:00
Kubernetes Submit Queue
c653437377 Merge pull request #50338 from zmerlynn/die-gke-die
Automatic merge from submit-queue

GKE deployment: Kill cluster/gke

kubernetes/test-infra#3983 migrated the remaining GKE jobs using the bash deployment (cluster/gke). All jobs are now on the gke `deployer` in `kubetest`.

Fixes kubernetes/test-infra#3307

```release-note
`cluster/gke` has been removed. GKE end-to-end testing should be done using `kubetest --deployment=gke`
```
2017-08-09 17:14:26 -07:00
Jacob Beacham
e77830c71d New get-kube.sh option: KUBERNETES_SKIP_RELEASE_VALIDATION 2017-08-09 11:42:24 -07:00
Daniel Kłobuszewski
b7935b9699 Bugfix: set resources only for fluentd-gcp container.
There is more than one container in fluentd-gcp deployment. Previous
implementation was setting resources for multiple containers, not just
the fluent-gcp one.
2017-08-09 15:44:06 +02:00
Kubernetes Submit Queue
3655685d64 Merge pull request #50330 from mwielgus/default-pricing
Automatic merge from submit-queue

Ensure that pricing expander is used by default in Cluster Autoscaler

Pricing expander was set as the default one for GCP, however on some occasion it was possible that AUTOSCALER_EXPANDER_CONFIG variable was not set resulting in using the the random expander.
2017-08-09 04:30:09 -07:00
Kubernetes Submit Queue
65574809cd Merge pull request #50333 from shyamjvs/add-debug-to-logexporter
Automatic merge from submit-queue (batch tested with PRs 50173, 50324, 50288, 50263, 50333)

Add debug logs to log-dump

ref https://github.com/kubernetes/test-infra/issues/3981

cc @fejta @krzyzacy
2017-08-09 02:22:00 -07:00
Kubernetes Submit Queue
5793be779b Merge pull request #50290 from piosz/mon-heapster
Automatic merge from submit-queue

Added monitoring sidecar for Heapster
2017-08-08 16:31:06 -07:00
Zach Loafman
0f12159ccd GKE deployment: Kill cluster/gke
kubernetes/test-infra#3983 migrated the remaining GKE jobs using the
bash deployment (cluster/gke).

Fixes kubernetes/test-infra#3307
2017-08-08 13:47:16 -07:00
Shyam Jeedigunta
7456716120 Add debug logs to log-dump 2017-08-08 21:43:09 +02:00
Marcin Wielgus
d0bedba9e1 Ensure that pricing expander is used by default in Cluster Autoscaler 2017-08-08 20:52:07 +02:00
Shyam Jeedigunta
2eafc562fa Block on master-creation step for large clusters (>50 nodes) in kube-up 2017-08-08 14:09:15 +02:00
Piotr Szczesniak
bad549d725 Added monitoring sidecar for Heapster 2017-08-08 13:15:43 +02:00
Kubernetes Submit Queue
e2baa8e11c Merge pull request #49855 from zouyee/kiq
Automatic merge from submit-queue (batch tested with PRs 49855, 49915)

update dashboard image version

fix https://github.com/kubernetes/kubernetes/issues/50195
**What this PR does / why we need it:**
Updates dashboard addon to latest version. Changelog can be found [here](https://github.com/kubernetes/dashboard/releases/tag/v1.6.3).

**Release note:**
```
Update dashboard version to v1.6.3
```
2017-08-05 19:07:50 -07:00
Kubernetes Submit Queue
70b4db2023 Merge pull request #50082 from crassirostris/cleanup-fluentd-es
Automatic merge from submit-queue (batch tested with PRs 48532, 50054, 50082)

Refactored the fluentd-es addon

Refactor fluentd-elasticsearch addon:

- Decrease the number of files by moving RBAC-related objects in the same files where they're used
- Move the fluentd configuration out of the image
- Don't use PVC to avoid leaking resources in e2e tests
- Fluentd now ingest docker and kubelet logs that are written to journald
- Disable X-Pack, because it's not free

Fixes https://github.com/kubernetes/kubernetes/issues/41462
Fixes https://github.com/kubernetes/kubernetes/issues/49816
Fixes https://github.com/kubernetes/kubernetes/issues/48973
Fixes https://github.com/kubernetes/kubernetes/issues/49450

@aknuds1 @coffeepac Could you please take a look?

```release-note
Fluentd DaemonSet in the fluentd-elasticsearch addon is configured via ConfigMap and includes journald plugin
Elasticsearch StatefulSet in the fluentd-elasticsearch addon uses local storage instead of PVC by default
```
2017-08-05 05:14:03 -07:00
Kubernetes Submit Queue
361079af43 Merge pull request #49009 from x13n/fluentd
Automatic merge from submit-queue (batch tested with PRs 48487, 49009, 49862, 49843, 49700)

Enable overriding fluentd resources in GCP

**What this PR does / why we need it**: This enables overriding fluentd resources in GCP, when there is a need for custom ones.

**Release note**:

```release-note
NONE
```
2017-08-04 23:40:02 -07:00
Kubernetes Submit Queue
b070227e89 Merge pull request #48366 from wongma7/flex-e2e
Automatic merge from submit-queue (batch tested with PRs 50119, 48366, 47181, 41611, 49547)

Add basic install and mount flexvolumes e2e tests

fixes https://github.com/kubernetes/kubernetes/issues/47010

These two tests install a skeleton "dummy" flex driver, attachable and non-attachable respectively, then test that a pod can successfully use the flex driver. They are labeled disruptive because kubelet and controller-manager get restarted as part of the flex install. IMO it's important to keep this install procedure as part of the test to isolate any bugs with the startup plugin probe code.

There is a bit of an ugly dependency on cluster/gce/config-test.sh because --flex-volume-plugin-dir must be set to a dir that's readable from controller-manager container and writable by the flex e2e test. The default path is not writable on GCE masters with read-only root so I picked a location that looks okay.

In the "dummy" drivers I trick kubelet into thinking there is a mount point by doing "mount -t tmpfs none ${MNTPATH} >/dev/null 2>&1", hope that is okay.

I have only tested on GCE and theoretically they may work on AWS but I don't think there is a need to test on multiple cloudproviders.

-->
```release-note
NONE
```
2017-08-04 14:29:34 -07:00
Mik Vyatskov
46f53daef0 Refactored the fluentd-es addon files, moved the fluentd configuration to ConfigMap 2017-08-04 20:05:15 +02:00
Kubernetes Submit Queue
4a8d041c4c Merge pull request #50088 from wojtek-t/ha_etcd_migration
Automatic merge from submit-queue

Fix etcd migration for HA clusters

@jpbetz 

Fix #43600
2017-08-04 09:44:14 -07:00
Matthew Wong
55a417e84f Add basic install and mount flexvolumes e2e tests 2017-08-04 10:39:24 -04:00
Kubernetes Submit Queue
68a722155b Merge pull request #50096 from nicksardo/bump-glbc-0.9.6
Automatic merge from submit-queue (batch tested with PRs 46685, 49863, 50098, 50070, 50096)

GCE: Bump GLBC version to 0.9.6

Closes #50095

**Release note**:
```release-note
GCE: Bump GLBC version to 0.9.6
```
2017-08-04 00:21:00 -07:00
Kubernetes Submit Queue
d15baf69e1 Merge pull request #48969 from ixdy/update-kazel
Automatic merge from submit-queue (batch tested with PRs 50103, 49677, 49449, 43586, 48969)

Run kazel on the entire tree

**What this PR does / why we need it**: part of #47558: auto-generate `BUILD` files on the entire tree, since this is what `gazelle` does, and it'll make subsequent reviews easier if less is changing.

**Release note**:

```release-note
NONE
```
/assign
/release-note-none
2017-08-03 16:43:41 -07:00
Nick Sardo
97114d7343 Bump GLBC version to 0.9.6 2017-08-03 08:39:12 -07:00
Wojciech Tyczynski
e050e7a867 Fix etcd migration for HA clusters 2017-08-03 14:43:52 +02:00
Daniel Kłobuszewski
b48b1b80bc Enable overriding fluentd resources in GCP 2017-08-03 11:58:43 +02:00
Kubernetes Submit Queue
ae0ca36695 Merge pull request #48722 from aknuds1/upgrade-fluentd-elasticsearch
Automatic merge from submit-queue (batch tested with PRs 48365, 49902, 49808, 48722, 47045)

Upgrade fluentd-elasticsearch addon to Elasticsearch/Kibana 5.5

This is a patch to upgrade the fluentd-elasticsearch addon to Elasticsearch/Kibana 5.5. Please provide feedback!

```release-notes
* Upgrade Elasticsearch/Kibana to 5.5.1 in fluentd-elasticsearch addon
* Switch to basing our image of Elasticsearch in fluentd-elasticsearch addon off the official one
* Switch to the official image of Kibana in fluentd-elasticsearch addon
* Use StatefulSet for Elasticsearch instead of ReplicationController, with persistent volume claims
* Require authenticating towards Elasticsearch, as Elasticsearch 5.5 by default requires basic authentication
```
2017-08-02 22:32:59 -07:00
Kubernetes Submit Queue
efe3951ac0 Merge pull request #48365 from ixdy/hyperkube-base-image
Automatic merge from submit-queue (batch tested with PRs 48365, 49902, 49808, 48722, 47045)

Rebase hyperkube image on debian-hyperkube-base, based on debian-base.

**What this PR does / why we need it**: saves all of the hyperkube image dependencies in a cacheable base image, rather than downloading them for every build (which is slow and flaky).

This way, at build time, we only need to pull down the hyperkube base image and add the hyperkube binary.

I've additionally based the base image on `debian-base` instead of `debian`, though we amusing end up reinstalling a bunch of the things we removed in `debian-base`.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #35058, at least partially

**Special notes for your reviewer**: I'm increasingly convinced that the hyperkube image is a bad pattern, as this image carries the superset of dependencies anyone might need, rather than the limited set of dependencies one needs. hyperkube really needs a proper owner.

**Release note**:

```release-note
```

/assign @timstclair @luxas @philips @nikhiljindal 
cc @kubernetes/sig-release-pr-reviews
2017-08-02 22:32:49 -07:00
Jeff Grafton
3afeb7cd3c Fix BUILD files 2017-08-02 18:33:32 -07:00
Jeff Grafton
3579017b86 Run hack/update-bazel.sh to generate BUILD files 2017-08-02 18:33:25 -07:00
Kubernetes Submit Queue
968ebc6603 Merge pull request #49649 from shiliangxue/master
Automatic merge from submit-queue (batch tested with PRs 49989, 49806, 49649, 49412, 49512)

Use existing k8s binaries and images on disk when they are preloaded to gce cos image.

**What this PR does / why we need it**:
This change is to accelerate K8S startup time on gce when k8s tarballs and images are already preloaded in VM image, by skipping the downloading, extracting and file transfer steps.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2017-08-02 17:06:00 -07:00
Kubernetes Submit Queue
4c9e880627 Merge pull request #49866 from mmaquevice/patch-1
Automatic merge from submit-queue (batch tested with PRs 50029, 48517, 49739, 49866, 49782)

fix spelling

**What this PR does / why we need it**:

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
```
2017-08-02 12:46:55 -07:00
Arve Knudsen
0ed0f02c76 Upgrade Elasticsearch/Kibana to 5.5.1 and use official Kibana image 2017-08-02 19:40:19 +02:00
Kubernetes Submit Queue
a2f2a220c5 Merge pull request #49953 from gmarek/logrotate
Automatic merge from submit-queue

Allow configuration of logrorate in GCE

Fix #49941
2017-08-02 07:53:15 -07:00
Kubernetes Submit Queue
e9617b694e Merge pull request #48429 from shyamjvs/parallel-upgrade
Automatic merge from submit-queue

Add parallelism to GCE cluster upgrade

Fixes https://github.com/kubernetes/kubernetes/issues/48373
Should allow upgrading 500-node cluster (1.6->1.7) in < 1 hr. It currently takes ~1.5 day.
Though it is the duty of the upgrader to choose the right parallelism in order to avoid disrupting too many pods.

/cc @kubernetes/sig-cluster-lifecycle-pr-reviews @kubernetes/sig-scalability-misc @mikedanese @gmarek
2017-08-02 01:26:57 -07:00
Kubernetes Submit Queue
0a5ac4e787 Merge pull request #49940 from shyamjvs/logdump-rocks
Automatic merge from submit-queue

Don't stop log-dumping if logexporter fails

Just noticed.

Ref https://github.com/kubernetes/kubernetes/issues/48513
2017-08-02 00:32:28 -07:00
Kubernetes Submit Queue
a422196142 Merge pull request #49888 from MrHohn/addon-manager-remove-old-codes
Automatic merge from submit-queue

[addon-manager] Remove unneeded annotation codes

**What this PR does / why we need it**:
Clean up addon-manager codes to make it less confusing. The annotation logics is only needed for 1.4->1.5 upgrade.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2017-08-01 20:48:45 -07:00
Kubernetes Submit Queue
49955b1594 Merge pull request #49897 from mikedanese/atomic-append
Automatic merge from submit-queue (batch tested with PRs 49898, 49897, 49919, 48860, 49491)

gce: make append_or_replace.. atomic

Before this change,
* the final echo is not atomically written to the target file
* two concurrent callers will use the same tempfile

Helps with https://github.com/kubernetes/kubernetes/issues/49895

cc @miekg
2017-08-01 17:57:14 -07:00
Kubernetes Submit Queue
d79c8fe4ef Merge pull request #49898 from mikedanese/clober-tokens
Automatic merge from submit-queue (batch tested with PRs 49898, 49897, 49919, 48860, 49491)

gce: extend CLOBBER_CONFIG to support known_tokens.csv

Helps with #49895
2017-08-01 17:57:12 -07:00
Shyam Jeedigunta
73b419447f Don't stop log-dumping if logexporter fails 2017-08-01 17:39:50 +02:00
gmarek
54f43f5f56 Allow configuration of logrorate in GCE 2017-08-01 17:07:44 +02:00
shiliang
f561a299ac refactor function is-preloaded in configure.sh 2017-07-31 15:57:02 -07:00
Mike Danese
a120906756 gce: extend CLOBBER_CONFIG to support known_tokens.csv 2017-07-31 13:39:18 -07:00
Kubernetes Submit Queue
aba3339f85 Merge pull request #49823 from shyamjvs/optimize-logdump
Automatic merge from submit-queue

Reduce kubectl calls from O(#nodes) to O(1) in cluster logdump

Ref https://github.com/kubernetes/kubernetes/issues/48513

Each node's logexporter is made to write a file to a GCS directory on success (https://github.com/kubernetes/test-infra/pull/3782).
We now use that directory as a registry of successful nodes and get it through a single "gsutil ls" call. This:
- reduces the current waiting time for logexporter in 5k-node cluster from >1hr to <10s.
- eliminates dependency on `kubectl logs` calls which seem to be unreliable sometimes (e.g when kubelet (or apiserver) is down)

/cc @kubernetes/sig-scalability-misc @wojtek-t @gmarek @fejta
2017-07-31 13:30:37 -07:00
Mike Danese
1c2feb02fa gce: make append_or_replace.. atomic
Before this change,
* the final echo is not atomically written to the target file
* two concurrent callers will use the same tempfile
2017-07-31 13:14:51 -07:00
Zihong Zheng
1ef7854c14 [addon-manager] Remove unneeded annotation codes 2017-07-31 11:19:23 -07:00
Shyam Jeedigunta
5b5d3b9410 Add parallelism to GCE cluster upgrade 2017-07-31 13:39:01 +02:00
Shyam Jeedigunta
80084f0621 Reduce kubectl calls from O(#nodes) to O(1) in cluster logdump 2017-07-31 13:20:53 +02:00
Matthieu Maquevice
93acc4c276 fix spelling 2017-07-31 10:59:29 +02:00
zouyee
a57e3eab1f update dashboard image version 2017-07-31 11:08:08 +08:00
Kubernetes Submit Queue
ba8b26d47a Merge pull request #49708 from mborsz/regional
Automatic merge from submit-queue (batch tested with PRs 49538, 49708, 47665, 49750, 49528)

Add a support for GKE regional clusters in e2e tests.

**What this PR does / why we need it**:
Add a support for GKE regional clusters in e2e tests.
**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:
```release-note
NONE
```
2017-07-29 05:28:51 -07:00