Commit Graph

6166 Commits

Author SHA1 Message Date
Shyam Jeedigunta
5b5d3b9410 Add parallelism to GCE cluster upgrade 2017-07-31 13:39:01 +02:00
Shyam Jeedigunta
80084f0621 Reduce kubectl calls from O(#nodes) to O(1) in cluster logdump 2017-07-31 13:20:53 +02:00
Matthieu Maquevice
93acc4c276 fix spelling 2017-07-31 10:59:29 +02:00
zouyee
a57e3eab1f update dashboard image version 2017-07-31 11:08:08 +08:00
Kubernetes Submit Queue
ba8b26d47a Merge pull request #49708 from mborsz/regional
Automatic merge from submit-queue (batch tested with PRs 49538, 49708, 47665, 49750, 49528)

Add a support for GKE regional clusters in e2e tests.

**What this PR does / why we need it**:
Add a support for GKE regional clusters in e2e tests.
**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:
```release-note
NONE
```
2017-07-29 05:28:51 -07:00
Shyam Jeedigunta
6c8ae3d5d7 Fix bug and add log statements to log-dump script 2017-07-28 22:33:30 +02:00
Kubernetes Submit Queue
a9a9241db2 Merge pull request #49717 from wwwtyro/rye/block-on-failed-services
Automatic merge from submit-queue (batch tested with PRs 49712, 49694, 49714, 49670, 49717)

set juju master charm state to blocked if the services appear to be failing

**What this PR does / why we need it**: set the juju master charm state to blocked if the services appear to be failing

**Release note**:
```release-note
set the juju master charm state to blocked if the services appear to be failing
```
2017-07-28 08:15:09 -07:00
Kubernetes Submit Queue
a2a6fe8217 Merge pull request #49714 from juju-solutions/bug/add-maintainers
Automatic merge from submit-queue (batch tested with PRs 49712, 49694, 49714, 49670, 49717)

Adding old Juju charm maintainers

**What this PR does / why we need it**: Update email addresses of past Juju charm maintainers 

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: 

**Special notes for your reviewer**:

**Release note**:

```NONE
```
2017-07-28 08:15:04 -07:00
Kubernetes Submit Queue
a2d2ddf719 Merge pull request #48533 from superbrothers/fix-dead-link
Automatic merge from submit-queue (batch tested with PRs 47738, 49196, 48907, 48533, 48822)

Fix a dead link in cluster/update-storage-objects.sh

**What this PR does / why we need it**: This PR fixes a dead link in cluster/update-storage-objects.sh.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2017-07-28 03:10:35 -07:00
Maciej Borsz
7822b5d9fd Add a support for GKE regional clusters in e2e tests. 2017-07-28 11:42:54 +02:00
Bobby (Babak) Salamat
de8689c99b Add priority admission controller 2017-07-27 18:44:40 -07:00
Kubernetes Submit Queue
e4551d50e5 Merge pull request #49665 from shyamjvs/logdump-fix
Automatic merge from submit-queue

Fix log-dump script wrt logexporter
2017-07-27 18:02:18 -07:00
Rye Terrell
809a0abb31 set k8s master charm state to blocked if the services appear to be failing 2017-07-27 08:13:15 -05:00
Konstantinos Tsakalozos
86a036feb1 Adding old juju maintainers 2017-07-27 15:42:36 +03:00
Shyam Jeedigunta
3ffc9ca34b Fix log-dump script wrt logexporter 2017-07-27 00:09:59 +02:00
Casey Davenport
a0bf3f7f90 Add OWNERS file for Calico add-on 2017-07-26 14:05:56 -07:00
Kubernetes Submit Queue
2c624e590f Merge pull request #49558 from juju-solutions/bug/maintainers
Automatic merge from submit-queue (batch tested with PRs 48360, 48469, 49576, 49516, 49558)

Update maintainers for Juju charm layers

**What this PR does / why we need it**: Update maintainers of harm layers to reflect ... reality

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: 

**Special notes for your reviewer**:

**Release note**:

```NONE
```
2017-07-26 13:02:46 -07:00
Kubernetes Submit Queue
4a65675840 Merge pull request #48469 from fasaxc/rev-typha
Automatic merge from submit-queue (batch tested with PRs 48360, 48469, 49576, 49516, 49558)

Rev Calico's Typha daemon to v0.2.3 in add-on deployment.

**What this PR does / why we need it**:

This PR revs the version of Calico's Typha daemon used in the calico-policy-controller add-on to the latest bug-fix release, which incorporates a [critical bug fix](https://github.com/projectcalico/typha/issues/28).

**Which issue this PR fixes**

fixes #49473

**Release note**:

```release-note
Rev version of Calico's Typha daemon used in add-on to v0.2.3 to pull in bug-fixes.
```
2017-07-26 13:02:37 -07:00
Kubernetes Submit Queue
762ef32acb Merge pull request #48360 from gunjan5/remove-noSnat
Automatic merge from submit-queue

Set snat to false

**What this PR does / why we need it**:
- the [version](e8bea554c5) of the portmap plugin included with calico CNI version `v1.9.1` doesn't have `noSnat` config option, it has `snat` which is not specified (which is the case without this PR), [will be set to true by default](https://github.com/containernetworking/plugins/tree/master/plugins/meta/portmap#usage) , so we need to explicitly set it to `false` 

CC @caseydavenport
2017-07-26 12:53:06 -07:00
shiliang
994275aa55 skip downloading and extracting tarballs and docker images when they are preloaded. 2017-07-25 15:40:03 -07:00
Jeff Grafton
66b9ae7cd8 Rebase hyperkube image on debian-hyperkube-base, based on debian-base. 2017-07-25 15:03:38 -07:00
Kubernetes Submit Queue
0d6d025c45 Merge pull request #45040 from aaronlevy/cephcommon
Automatic merge from submit-queue (batch tested with PRs 45040, 48960)

Add ceph-common to hyperkube image

**What this PR does / why we need it**:

Adds the ceph-common package to the hyperkube image
2017-07-25 14:40:41 -07:00
Kubernetes Submit Queue
d197ff0595 Merge pull request #49426 from shyamjvs/logdump-fix
Automatic merge from submit-queue (batch tested with PRs 48976, 49474, 40050, 49426, 49430)

Fix bug in cluster/log-dump

We're breaking in case KUBECTL is set as "./cluster/kubectl.sh --match-server-version". Moreover we always are using cluster/kubectl.sh as the default and don't want to do match-server-version for the purpose of logexporter.

Also adding owners file so I'm not blocked for approves while making fixes in log-dump. Besides I'll be able to review fixes sent by others.

/cc @wojtek-t
2017-07-25 12:14:46 -07:00
Kubernetes Submit Queue
2189314895 Merge pull request #40050 from mtaufen/standalone-mode
Automatic merge from submit-queue (batch tested with PRs 48976, 49474, 40050, 49426, 49430)

Use presence of kubeconfig file to toggle standalone mode

Fixes #40049 

```release-note
The deprecated --api-servers flag has been removed. Use --kubeconfig to provide API server connection information instead. The --require-kubeconfig flag is now deprecated. The default kubeconfig path is also deprecated. Both --require-kubeconfig and the default kubeconfig path will be removed in Kubernetes v1.10.0.
```

/cc @kubernetes/sig-cluster-lifecycle-misc @kubernetes/sig-node-misc
2017-07-25 12:14:43 -07:00
Shyam Jeedigunta
bf33e2ac7e Fix bug in cluster/log-dump and add OWNERS file 2017-07-25 16:11:24 +02:00
Konstantinos Tsakalozos
68b42c3b88 Update maintainers for Juju charm layers 2017-07-25 13:09:09 +03:00
Kubernetes Submit Queue
7f1d9382ec Merge pull request #48846 from dashpole/remove_ood
Automatic merge from submit-queue

Remove flags low-diskspace-threshold-mb and outofdisk-transition-frequency

issue: #48843

This removes two flags replaced by the eviction manager.  These have been depreciated for two releases, which I believe correctly follows the kubernetes depreciation guidelines.

```release-note
Remove depreciated flags: --low-diskspace-threshold-mb and --outofdisk-transition-frequency, which are replaced by --eviction-hard
```

cc @mtaufen since I am changing kubelet flags
cc @vishh @derekwaynecarr 
/sig node
2017-07-24 23:05:50 -07:00
Michael Taufen
38aee0464d Providing kubeconfig file is now the switch for standalone mode
Replaces use of --api-servers with --kubeconfig in Kubelet args across
the turnup scripts. In many cases this involves generating a kubeconfig
file for the Kubelet and placing it in the correct location on the node.
2017-07-24 11:03:00 -07:00
Wojciech Tyczynski
5740c7afbf Fix ha_master tests 2017-07-24 12:10:22 +02:00
Kubernetes Submit Queue
9712a80505 Merge pull request #49394 from wwwtyro/rye/rbd-stats-improvement
Automatic merge from submit-queue (batch tested with PRs 49326, 49394, 49346, 49379, 49399)

more robust stat handling from ceph df output in the kubernetes-master charm create-rbd-pv action

**What this PR does / why we need it**: more robust stat handling from ceph df output in the kubernetes-master charm create-rbd-pv action

**Release note**:
```release-note
more robust stat handling from ceph df output in the kubernetes-master charm create-rbd-pv action
```
2017-07-21 23:23:32 -07:00
Kubernetes Submit Queue
b9e549e7cf Merge pull request #49371 from juju-solutions/bug/pv-metric
Automatic merge from submit-queue (batch tested with PRs 49420, 49296, 49299, 49371, 46514)

Fix: PV metric is not namespaced

**What this PR does / why we need it**: The PV metric of juju deployments is not namespaced. This PR fixes this bug. 

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/348

**Special notes for your reviewer**:

**Release note**:

```NONE
```
2017-07-21 22:23:21 -07:00
Kubernetes Submit Queue
0af4306695 Merge pull request #49296 from juju-solutions/feature/report-failing-services
Automatic merge from submit-queue (batch tested with PRs 49420, 49296, 49299, 49371, 46514)

Update status to show failing services.

**What this PR does / why we need it**: Report on charm status any services that are not running.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/341

**Special notes for your reviewer**:

**Release note**:

```Report failing services in Juju deployed clusters.
```
2017-07-21 22:23:17 -07:00
Rye Terrell
1c21e8ab31 more robust stat handling from ceph df output 2017-07-21 11:46:32 -05:00
Kubernetes Submit Queue
ab40f526fd Merge pull request #49344 from shyamjvs/master-disk-size
Automatic merge from submit-queue

Auto-calculate master disk and root disk sizes in GCE

@gmarek PR https://github.com/kubernetes/kubernetes/pull/49282 didn't fix the issue because MASTER_DISK_SIZE was defaulting to 20GB in config-test.sh before being calculated inside get-master-disk-size() where you use pre-existing value if any.

It should be fixed by this now.
2017-07-21 06:24:39 -07:00
Konstantinos Tsakalozos
0faf5bf4b0 Fix: PV metric is not namespaced 2017-07-21 13:55:17 +03:00
Konstantinos Tsakalozos
53f00722e2 Do not try to restart services 2017-07-21 12:39:42 +03:00
Kubernetes Submit Queue
ef98b80bd0 Merge pull request #49172 from ianchakeres/gce-local-ssd-fix
Automatic merge from submit-queue (batch tested with PRs 48565, 49172)

On GCE check whether NODE_LOCAL_SSDS=0 and handle this case appropriately

**What this PR does / why we need it**: Presently if you are using a mac and GCE and specify NODE_LOCAL_SSDS=0, or use the default, you end up with 2 local SSDs.

**Which issue this PR fixes** : fixes https://github.com/kubernetes/kubernetes/issues/49171

**Special notes for your reviewer**:

I've discovered that this issue is due to b353792f9c/cluster/gce/util.sh (L579)

If NODE_LOCAL_SSDS=0, this evaluates to $(seq 0)

```
$ for i in $(seq 0); do echo $i; done
1
0
```

From man seq on mac osx

```
The seq utility prints a sequence of numbers, one per line (default), from first (default 1), 
to near last as possible, in increments of incr (default 1).When first is larger than last the 
default incr is -1.
```

This was run on mac with the seq manpage indicating it comes from BSD Feb 19 2010.

**Release note**:

```release-note
NONE
```
2017-07-21 01:13:19 -07:00
Kubernetes Submit Queue
80582826aa Merge pull request #49337 from jingxu97/July/mounterfix
Automatic merge from submit-queue (batch tested with PRs 49222, 49333, 48708, 49337)

Fix issue in installing containerized mounter

Fix PR #49335

PR #49157 causes failure when installing containerized mounter. This
PR is a fix for it
2017-07-20 23:01:23 -07:00
Kubernetes Submit Queue
d2c5dd64ee Merge pull request #49222 from yujuhong/fix-l7-backend
Automatic merge from submit-queue (batch tested with PRs 49222, 49333, 48708, 49337)

glbc: change the label of the l7-lb-controller pod

This ensures that the default http backend service doesn't include this
pod as its endpoint. This fixes #49159
2017-07-20 23:01:16 -07:00
Kubernetes Submit Queue
0e1744bb92 Merge pull request #49278 from juju-solutions/bug/master-msg
Automatic merge from submit-queue (batch tested with PRs 49330, 49252, 49262, 49278, 49334)

Simplify master-worker relation missing message

**What this PR does / why we need it**: Simplify messaging of missing relation in Juju deployments

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/309

**Special notes for your reviewer**:

**Release note**:

```NONE
```
2017-07-20 21:59:34 -07:00
Shyam Jeedigunta
94b3cc21f9 Auto-calculate master disk and root disk sizes in GCE 2017-07-21 03:35:37 +02:00
Jing Xu
04e5243db4 Fix issue in installing containerized mounter
PR #49157 causes failure when installing containerized mounter. THis
PR is a fix for it
2017-07-20 16:30:55 -07:00
Kubernetes Submit Queue
32bd2abd44 Merge pull request #49316 from ajitak/npd-port
Automatic merge from submit-queue

Use custom port for node-problem-detector

It fixes https://github.com/kubernetes/kubernetes/issues/49263

```release-note
Use port 20256 for node-problem-detector in standalone mode.
```
2017-07-20 16:28:57 -07:00
David Ashpole
7a23f8b018 remove deprecated flags LowDiskSpaceThresholdMB and OutOfDiskTransitionFrequency 2017-07-20 13:23:13 -07:00
Kubernetes Submit Queue
d9bf88ceeb Merge pull request #49229 from shyamjvs/remove-old-logdump
Automatic merge from submit-queue

Remove old log-dump.sh script

Ref https://github.com/kubernetes/kubernetes/issues/48513

Since we moved to new kubetest image which no longer needs the old one - https://github.com/kubernetes/test-infra/pull/3585

/cc @wojtek-t @fejta
2017-07-20 12:55:44 -07:00
Ajit Kumar
f45cc00440 Use custom port for node-problem-detector 2017-07-20 11:25:44 -07:00
Kubernetes Submit Queue
088141ca3a Merge pull request #49298 from shyamjvs/fix-log-dump
Automatic merge from submit-queue

Fix bug with sed in log-dump script

/cc @wojtek-t
2017-07-20 09:54:20 -07:00
Kubernetes Submit Queue
c2dd291ecb Merge pull request #49265 from jianglingxia/jlx72014
Automatic merge from submit-queue

fix the typo of Kubernetes Worker

**What this PR does / why we need it**:
fix the typo of Kubernetes Worker that Kubernetes spell error
**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
```NONE
2017-07-20 09:08:01 -07:00
Kubernetes Submit Queue
c35bf2aa7b Merge pull request #49282 from gmarek/disk_size
Automatic merge from submit-queue

Fix master disk size variable usage
2017-07-20 06:04:01 -07:00
Shyam Jeedigunta
ba7167963d Fix bug with sed in log-dump script 2017-07-20 14:34:18 +02:00
gmarek
83d3018085 Fix master disk size variable usage 2017-07-20 14:15:06 +02:00
Konstantinos Tsakalozos
685dff99ab Update status to show failing services. 2017-07-20 14:57:02 +03:00
Kubernetes Submit Queue
35d6cf086c Merge pull request #49212 from kawych/master
Automatic merge from submit-queue

Bump rescheduler version to v0.3.1

**What this PR does / why we need it**:
Bump Rescheduler version to v0.3.1 to log to STDERR.

**Which issue this PR fixes**
Fixes https://github.com/kubernetes/contrib/issues/2518

**Release note**:
```release-note
NONE
```
2017-07-20 02:15:23 -07:00
Konstantinos Tsakalozos
d5a71eb2d1 Simplify master-worker relation missing message 2017-07-20 11:47:50 +03:00
jianglingxia
5de3464755 fix the typo of Kubernetes Worker 2017-07-20 14:08:12 +08:00
Kubernetes Submit Queue
2eb6b95a5d Merge pull request #49148 from mikedanese/deprecate-cluster
Automatic merge from submit-queue (batch tested with PRs 48377, 48940, 49144, 49062, 49148)

add some more deprecation warnings to cluster

Part of https://github.com/kubernetes/kubernetes/issues/49213
@kubernetes/sig-cluster-lifecycle-misc
2017-07-19 19:10:18 -07:00
Ian Chakeres
a18a18367f Check whether NODE_LOCAL_SSDS=0 and handle this case appropriately.
Some versions of seq will count down from 1 if "seq 0" is specified
2017-07-19 17:06:21 -07:00
Kubernetes Submit Queue
c5b47482ae Merge pull request #49121 from mikedanese/remove-kubelet-bearer-token
Automatic merge from submit-queue (batch tested with PRs 47509, 46821, 45319, 49121, 49125)

gce: don't add kubelet bearer token to known tokens
2017-07-19 16:16:31 -07:00
Kubernetes Submit Queue
042913d898 Merge pull request #47509 from billy2180/bump-mounttest-image-version-to-0.8
Automatic merge from submit-queue

Bump e2e mounttest image version to 0.8

Reduce the number of image files required for e2e test run
```release-note
NONE
```
2017-07-19 15:53:31 -07:00
Yu-Ju Hong
780af313d6 glbc: change the label of the l7-lb-controller pod
This ensures that the default http backend service doesn't include this
pod as its endpoint.
2017-07-19 15:15:54 -07:00
Shyam Jeedigunta
d2bf533938 Remove deprecated cluster/log-dump.sh 2017-07-19 21:50:09 +02:00
Kubernetes Submit Queue
7dc0322b0c Merge pull request #49182 from juju-solutions/feature/increase-cidr
Automatic merge from submit-queue (batch tested with PRs 49058, 49072, 49137, 49182, 49045)

Set default CIDR to /16 for Juju deployments

**What this PR does / why we need it**: Increase the number of IPs on a deployment

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/272

**Special notes for your reviewer**:

**Release note**:

```Set default CIDR to /16 for Juju deployments
```
2017-07-19 10:27:27 -07:00
Kubernetes Submit Queue
6af05149aa Merge pull request #49058 from shyamjvs/logexporter-support
Automatic merge from submit-queue

Pass logexporter config through e2e framework

Ref https://github.com/kubernetes/kubernetes/issues/48513

/cc @wojtek-t @fejta
2017-07-19 09:57:47 -07:00
Mike Danese
89bf771e51 add some more deprecation warnings to cluster 2017-07-19 09:43:05 -07:00
Karol Wychowaniec
3a5f4c2a78 Bump rescheduler version to v0.3.1 2017-07-19 17:51:20 +02:00
Shaun Crampton
d2882a8b89 Rev Calico's Typha daemon to v0.2.3 in add-on deployment. 2017-07-19 10:22:51 +01:00
Kubernetes Submit Queue
c3f4e7e59a Merge pull request #49165 from mikedanese/cleanup2
Automatic merge from submit-queue (batch tested with PRs 49120, 46755, 49157, 49165, 48950)

gce: don't print every file in mounter to stdout

This is printing ~3000 lines.
2017-07-19 00:06:27 -07:00
Kubernetes Submit Queue
5e50097c57 Merge pull request #49157 from mikedanese/cleanup1
Automatic merge from submit-queue (batch tested with PRs 49120, 46755, 49157, 49165, 48950)

gce: make some global variables local

/approve no-issue
2017-07-19 00:06:25 -07:00
Konstantinos Tsakalozos
280ea7f485 Set default CIDR to /16 2017-07-19 09:34:23 +03:00
Kubernetes Submit Queue
384c1d285d Merge pull request #48914 from MrHohn/gke-kube-down-log
Automatic merge from submit-queue (batch tested with PRs 48914, 48535, 49099, 48935, 48871)

Log error when fail to execute command in with-retry()

**What this PR does / why we need it**: Enhance gke/util.sh logging.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #48913

**Special notes for your reviewer**:
/cc @krzyzacy 

**Release note**:

```release-note
NONE
```
2017-07-18 21:04:20 -07:00
Mike Danese
7a6917f552 gce: don't print every file in mounter to stdout 2017-07-18 17:04:03 -07:00
Mike Danese
0d662e8845 gce: make some global variables local 2017-07-18 15:55:37 -07:00
Mike Danese
9ee240dfcf aggr: don't write empty CA files
write now if the metadata is not supplied, we write an empty file for
these certs. we should just fail
2017-07-18 14:40:24 -07:00
Shyam Jeedigunta
5cdedd22cf Pass logexporter config through e2e framework 2017-07-18 23:28:06 +02:00
Mike Danese
65cc002877 gce: don't add kubelet bearer token to known tokens
since it is no longer used after kubelet TLS bootstrap.
2017-07-18 08:52:08 -07:00
Kubernetes Submit Queue
013d37d881 Merge pull request #48874 from juju-solutions/feature/sync-channels
Automatic merge from submit-queue (batch tested with PRs 49019, 48919, 49040, 49018, 48874)

Set default snap channel on charms to 1.7 stable

**What this PR does / why we need it**: This PR sets the default snap channel on charms to 1.7/stable. 
This addresses problems where the the user might want to deploy the charm and get the same kubernetes version found on the bundles.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/305

**Special notes for your reviewer**:

**Release note**:

```Set default snap channel on charms to 1.7/stable
```
2017-07-17 21:25:13 -07:00
Kubernetes Submit Queue
3548031fa9 Merge pull request #49040 from shyamjvs/fix-logdump
Automatic merge from submit-queue (batch tested with PRs 49019, 48919, 49040, 49018, 48874)

Fix if condition in cluster/log-dump/log-dump.sh

Fixes https://github.com/kubernetes/kubernetes/issues/49039

/cc @yujuhong
2017-07-17 21:25:09 -07:00
Kubernetes Submit Queue
50ec4384ff Merge pull request #49033 from wwwtyro/rye/handle-prev-port-none
Automatic merge from submit-queue (batch tested with PRs 48231, 47377, 48797, 49020, 49033)

prevent unsetting of nonexistent previous port in kubeapi-load-balancer charm

**What this PR does / why we need it**: prevent unsetting of nonexistent previous port in kubeapi-load-balancer charm

**Release note**:
```release-note
prevent unsetting of nonexistent previous port in kubeapi-load-balancer charm
```
2017-07-17 19:37:48 -07:00
Shyam Jeedigunta
de09f8e569 Fix if condition in cluster/log-dump/log-dump.sh 2017-07-17 20:21:43 +02:00
Rye Terrell
7c06da5b0e prevent unsetting of nonexistent previous port in kubeapi-load-balancer charm 2017-07-17 12:22:54 -05:00
gmarek
af6480210e Create 64-core masters for huge clusters 2017-07-17 15:03:39 +02:00
icereval
60872db873
log node-problem-detector 2017-07-16 16:46:25 -04:00
Kubernetes Submit Queue
60c3dd1bf7 Merge pull request #48958 from wwwtyro/rye/charm-load-balancer-port
Automatic merge from submit-queue (batch tested with PRs 48578, 48895, 48958)

use port configuration

**What this PR does / why we need it**: Uses the `port` config option in the kubeapi-load-balancer charm.

**Release note**:
```release-note
Uses the port config option in the kubeapi-load-balancer charm.
```
2017-07-14 13:45:55 -07:00
Kubernetes Submit Queue
8532cdfd69 Merge pull request #48886 from mikedanese/cleanup
Automatic merge from submit-queue

remove some people from OWNERS so they don't get reviews anymore

These are googlers who don't work on the project anymore but are still
getting reviews assigned to them:
- @bprashanth
- @rjnagal
- @vmarmol
2017-07-14 11:46:10 -07:00
Rye Terrell
ece0461cce use port configuration 2017-07-14 13:31:47 -05:00
Kubernetes Submit Queue
455e44b616 Merge pull request #48812 from crassirostris/change-fluentd-monitoring
Automatic merge from submit-queue (batch tested with PRs 48812, 48276)

Change fluentd-gcp monitoring to use metrics exposed by SD plugin

Following https://github.com/GoogleCloudPlatform/fluent-plugin-google-cloud/pull/135, make fluentd-gcp expose metrics in Prometheus registry and use them instead of counting records in the pipeline.

/cc @piosz @igorpeshansky

```release-note
Fluentd-gcp DaemonSet exposes different set of metrics.
```
2017-07-14 04:43:42 -07:00
Kubernetes Submit Queue
ca0a868823 Merge pull request #48651 from shyamjvs/logexporter-supporter
Automatic merge from submit-queue (batch tested with PRs 48864, 48651, 47703)

Enable logexporter mechanism to dump logs from k8s nodes to GCS directly

Ref https://github.com/kubernetes/kubernetes/issues/48513

This adds support for logexporter from k8s side. Next I'll send a PR adding support from test-infra side.

/cc @kubernetes/sig-scalability-misc @kubernetes/test-infra-maintainers @fejta @wojtek-t @gmarek
2017-07-14 03:10:45 -07:00
Kubernetes Submit Queue
ca1b408bf5 Merge pull request #48864 from jszczepkowski/ha-e2e-fix
Automatic merge from submit-queue

Fixed cluster validation for multizonal clusters.

Fixed cluster validation for multizonal clusters.
This should fix HA master e2e tests.

```release-note
```
2017-07-14 02:48:06 -07:00
Kubernetes Submit Queue
d623bd4c44 Merge pull request #46748 from gyliu513/critical
Automatic merge from submit-queue (batch tested with PRs 46748, 48826)

Added `CriticalAddonsOnly` toleration for npd.

**What this PR does / why we need it**:
We should add `CriticalAddonsOnly` toleration to make sure the daemonset can be scheduled on the node even if already planned to run critical pod.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #47015

**Special notes for your reviewer**:

**Release note**:

```release-note
none
```
2017-07-14 02:03:41 -07:00
Kubernetes Submit Queue
cc35759318 Merge pull request #48475 from gmile/patch-1
Automatic merge from submit-queue

Properly nest code blocks

**What this PR does / why we need it**:

Markdown code blocks are adjusted to better display on GitHub. See [rendered](c3fbec7663/cluster/addons/cluster-loadbalancing/glbc/README.md) version.

**Release note**:

```release-note
Adjust markdown code block in README for Google Load Balancer addon.
```
2017-07-13 18:50:46 -07:00
Kubernetes Submit Queue
b507254bb0 Merge pull request #46825 from zjj2wry/fix_doc_spell
Automatic merge from submit-queue

Update docs for user-guide

**What this PR does / why we need it**:

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2017-07-13 18:50:36 -07:00
Zihong Zheng
722c3e348c Log error when fail to execute command in with-retry() 2017-07-13 14:56:16 -07:00
Mike Danese
c201553f27 remove some people from OWNERS so they don't get reviews anymore
These are googlers who don't work on the project anymore but are still
getting reviews assigned to them:
- bprashanth
- rjnagal
- vmarmol
2017-07-13 10:02:21 -07:00
gmarek
afe1a2c71b Revert "Merge pull request #48560 from nicksardo/gce-network-project"
This reverts commit d4881dd491, reversing
changes made to b5c4346130.
2017-07-13 18:34:24 +02:00
Guangya Liu
d874548eed Added CriticalAddonsOnly toleration for npd. 2017-07-13 23:44:15 +08:00
Konstantinos Tsakalozos
ca0f042854 Set default snap channel on charms to 1.7 stable 2017-07-13 16:24:48 +03:00
Jerzy Szczepkowski
1cef818cd0 Fixed cluster validation for multizonal clusters.
Fixed cluster validation for multizonal clusters.
This should fix HA master e2e tests.
2017-07-13 11:03:57 +02:00
Kubernetes Submit Queue
3c080e83c7 Merge pull request #48642 from freehan/gce-api-endpint
Automatic merge from submit-queue

Support GCE alpha/beta api endpoint override

fixes: https://github.com/kubernetes/kubernetes/issues/48568
2017-07-12 18:23:37 -07:00
Kubernetes Submit Queue
6e3bcd9b02 Merge pull request #48830 from csbell/patch-conversion-typo
Automatic merge from submit-queue (batch tested with PRs 48781, 48817, 48830, 48829, 48053)

Fix yaml-quote typo

Caught this looking through CI logs.

/assign wojtek-t
2017-07-12 15:11:13 -07:00
Minhan Xia
3e8b4a27c4 use overrided api endpoint in gce cloud provider 2017-07-12 15:10:13 -07:00
Kubernetes Submit Queue
d7d4346464 Merge pull request #48566 from vasartori/fluentd-prometheus
Automatic merge from submit-queue (batch tested with PRs 48279, 48566, 48319, 48794, 47952)

Add prometheus plugin on fluentd image.

**What this PR does / why we need it**:
This PR adds the prometheus plugin on Fluentd.

**Special notes for your reviewer**:
The plugin used was: https://github.com/kazegusuri/fluent-plugin-prometheus, on the latest stable version.
All configs used are default.

**Release note**:

```release-note
Fluentd-es addon now exposes a /metrics endpoint for monitoring on port 24231. 
```
2017-07-12 11:52:17 -07:00
Kubernetes Submit Queue
6396b9ddc3 Merge pull request #48279 from abgworrall/cos-nodes-by-default
Automatic merge from submit-queue

Use Container-optimzed OS images for nodes by default

Part of the deprecation of the debian-based ContainerVM images.

```release-note
kube-up and kubemark will default to using cos (GCI) images for nodes.
The previous default was container-vm (CVM, "debian"), which is deprecated.
If you need to explicitly use container-vm for some reason, you should set
KUBE_NODE_OS_DISTRIBUTION=debian
```
2017-07-12 11:46:19 -07:00
Christian Bell
ad365b3479 Fix typo 2017-07-12 11:38:14 -07:00
Kubernetes Submit Queue
7001b9d9d1 Merge pull request #48425 from kawych/master
Automatic merge from submit-queue

Pass cluster name to Heapster with Stackdriver sink.

**What this PR does / why we need it**:
Passes cluster name as argument to Heapster when it's used with Stackdriver sink to allow setting resource label 'cluster_name' in exported metrics.

**Release note**:
```release-note
NONE
```
2017-07-12 10:51:09 -07:00
Mik Vyatskov
125f759907 Change fluentd-gcp monitoring to use metrics exposed by SD plugin 2017-07-12 17:07:15 +02:00
Kubernetes Submit Queue
93344ea18c Merge pull request #48571 from verult/DefaultCOSFlexDir
Automatic merge from submit-queue (batch tested with PRs 48405, 48742, 48748, 48571, 48482)

Setting default FlexVolume driver directory on COS images.

**What this PR does / why we need it**: The original default FlexVolume driver directory is not writable on COS. A new location is necessary to make FlexVolume work.

This directory doesn't exist by default. FlexVolume users need to create this directory, bind mount it, and remount with the executable permission. The other candidate is /home/kubernetes/bin, but the directory is already getting cluttered. I will submit a different PR for a script that automates this step.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #48570
2017-07-12 08:05:23 -07:00
Shyam Jeedigunta
5f8cb3d9ff Enable logexporter mechanism to dump logs from k8s nodes to GCS directly 2017-07-12 14:39:49 +02:00
Kubernetes Submit Queue
787f5e0fe5 Merge pull request #48735 from nicksardo/gce-empty-network-proj
Automatic merge from submit-queue (batch tested with PRs 48698, 48712, 48516, 48734, 48735)

GCE: Allow empty NETWORK_PROJECT_ID env var

Changes:
1. Adds `GCE_API_ENDPOINT` logic to container-linux as it was added to GCI in #47881.
1. Apply `NETWORK_PROJECT_ID` value to gce.conf only if the env var is set.

/sig network
/area platform/gce

**Release note**:
```release-note
NONE
```
2017-07-12 04:56:33 -07:00
Adam Worrall
2196be7c7f Leave testing on CVM by default
Avoid breaking tests that assume they're testing CVM.
2017-07-11 14:53:49 -07:00
Adam Worrall
72f58e09c1 Leave the test jobs running on CVM after all. 2017-07-11 14:47:35 -07:00
Adam Worrall
ddeab2a69a Update NODE_OS_DISTRIBUTION from debian to gci
This in turn drives how NODE_IMAGE is set
2017-07-11 14:47:35 -07:00
Adam Worrall
ba2dd9ea4c Use Container-optimzed OS images for node 2017-07-11 14:47:35 -07:00
Nick Sardo
ebce7d2497 Allow missing NETWORK_PROJECT_ID env var 2017-07-10 14:26:47 -07:00
Cheng Xing
01fd6219cf Setting default FlexVolume driver directory on COS images. 2017-07-10 11:17:23 -07:00
Mik Vyatskov
b11084e76c Bump event-exporter version 2017-07-10 17:32:40 +02:00
Victor Sartori
de50947f30 Bump image version on makefile and DS. 2017-07-07 14:55:54 -03:00
Nick Sardo
06e328627c Use network project id for firewall/route mgmt and zone listing 2017-07-06 16:58:27 -07:00
Victor Sartori
0e52082673 Add prometheus plugin on fluentd image. 2017-07-06 18:24:04 -03:00
Kubernetes Submit Queue
20e629b1c6 Merge pull request #44394 from rthallisey/pre-existing-provider
Automatic merge from submit-queue

Launch kubemark with an existing Kubemark master

In order to expand the use of kubemark, allow developers to use kubemark with a pre-existing Kubernetes cluster.

Ref issue  #44393
2017-07-06 04:41:53 -07:00
Kazuki Suda
63f6198b4e Fix a dead link in cluster/update-storage-objects.sh 2017-07-06 18:37:52 +09:00
Kubernetes Submit Queue
40a21312d1 Merge pull request #48144 from juju-solutions/bug/worker-termination
Automatic merge from submit-queue (batch tested with PRs 48399, 48450, 48144)

Skip errors when unregistering juju kubernetes-workers

**What this PR does / why we need it**: When removing a kubernetes node from using Juju and for some reason kubernetes master fails we should not error the node, instead we should proceed with the removal of the node and the master will recognise that node as unavailable because it will fail heartbeats.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/300

**Special notes for your reviewer**:

**Release note**:

```
Clean decommission of Juju kubernetes worker units 
```
2017-07-05 08:58:39 -07:00
Kubernetes Submit Queue
7b13208d61 Merge pull request #48450 from wwwtyro/rye/lxc-disable-conntrack-max
Automatic merge from submit-queue (batch tested with PRs 48399, 48450, 48144)

configure kube-proxy to run with unset conntrack param when in lxc

**What this PR does / why we need it**: Configures the Juju Charm code to run kube-proxy with `conntrack-max-per-core` set to `0` when in an lxc as a workaround for issues when mounting `/sys/module/nf_conntrack/parameters/hashsize`

**Release note**:

```release-note
Configures the Juju Charm code to run kube-proxy with conntrack-max-per-core set to 0 when in an lxc as a workaround for issues when mounting /sys/module/nf_conntrack/parameters/hashsize
```
2017-07-05 08:58:37 -07:00
Ryan Hallisey
82e1d208f6 Launch kubemark with an existing Kubemark Master
In order to expand the use of kubemark, allow developers to
use kubemark with a pre-existing Kubemark master.
2017-07-05 09:14:53 -04:00
Konstantinos Tsakalozos
90a57931af Skip errors when unregistering juju kubernetes-workers 2017-07-05 16:00:37 +03:00
Kubernetes Submit Queue
5d21390561 Merge pull request #41790 from wojtek-t/allow_for_enabling_conversion_mismatch_detecto
Automatic merge from submit-queue

Add ability to enable patch conversion detector

Will rebase and fix once #41326 is merged.
2017-07-04 13:18:22 -07:00
Eugene Pirogov
c3fbec7663 Properly nest code blocks
Markdown code blocks are adjusted to better display on GitHub
2017-07-04 19:49:48 +03:00
Kubernetes Submit Queue
3823270b9e Merge pull request #48446 from Cynerva/gkk/stop-snaps
Automatic merge from submit-queue (batch tested with PRs 47043, 48448, 47515, 48446)

Fix charms leaving services running after remove-unit

**What this PR does / why we need it**:

This fixes a case where removed charm units can sometimes leave behind running services that interfere with the rest of the cluster.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
Fix charms leaving services running after remove-unit
```
2017-07-04 09:12:57 -07:00
Wojciech Tyczynski
37b5a214bc Add ability to enable patch conversion detector 2017-07-04 14:33:24 +02:00
Rye Terrell
05fbc7a7f8 configure kube-proxy to run with unset conntrack param when in lxc 2017-07-03 21:58:54 -05:00
Kubernetes Submit Queue
e1d9ab205a Merge pull request #48440 from Cynerva/gkk/snap-upgrades-restart-services
Automatic merge from submit-queue (batch tested with PRs 48439, 48440, 48394)

Fix kubernetes charms not restarting services after snap upgrades

**What this PR does / why we need it**:

This fixes a problem where the Kubernetes charms don't restart services after upgrading snaps. This can cause certain fixes not to be picked up (for example https://github.com/juju-solutions/release/pull/10)

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
Fixed kubernetes charms not restarting services after snap upgrades
```
2017-07-03 13:05:28 -07:00
Kubernetes Submit Queue
937369bc21 Merge pull request #48439 from juju-solutions/bug/namespaces-path
Automatic merge from submit-queue (batch tested with PRs 48439, 48440, 48394)

Fix: namespace-create have kubectl in path

**What this PR does / why we need it**: In juju deployed clusters namespace-create action is failing

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/326

**Special notes for your reviewer**:

**Release note**:

```Fix: namespace-create action on Juju deployed clusters
```
2017-07-03 13:05:27 -07:00
George Kraft
f0e08818d8 Fix charms leaving services running after unit removal 2017-07-03 14:55:07 -05:00
Konstantinos Tsakalozos
cea934bcbc Fix: namespace-create have kubectl in path 2017-07-03 18:22:20 +03:00
George Kraft
c21b305fe9 Fix kubernetes charms not restarting services after snap upgrades 2017-07-03 09:47:08 -05:00
Konstantinos Tsakalozos
e2571a853a Non leaders should overwrite any local copies of keys they have with what the leader has. 2017-07-03 17:45:43 +03:00
Karol Wychowaniec
ca5581edcc Pass cluster name to Heapster with Stackdriver sink. 2017-07-03 11:55:57 +02:00
Kubernetes Submit Queue
9848cdb3ac Merge pull request #48281 from hogepodge/configure-swift-store
Automatic merge from submit-queue

Add configuration for swift container name

**What this PR does / why we need it:**
This review updates the OpenStack Heat provider to allow for configuring the name of the Swift object store.

**Which issue this PR fixes:**
fixes #47966

**Special notes for your reviewer**:
Note that the terminology for OpenStack Swift conflicts with K8S terminology. In this instance, container is referring to the organization structure of Swift storage objects.

**Release note**:
```release-note
Adds configuration option for Swift object store container name to OpenStack Heat provider.
```
2017-07-02 08:02:42 -07:00
Kubernetes Submit Queue
dc597291c1 Merge pull request #48351 from juju-solutions/bug/get-pass
Automatic merge from submit-queue (batch tested with PRs 48317, 48313, 48351, 48357, 48115)

Ensure get_password is accessing a file that exists.

**What this PR does / why we need it**: get_password will throw an exception instead of returning None in case the basic_auth.csv file is missing but /root/cdk/ is there in a juju deployment.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/324

**Special notes for your reviewer**:

**Release note**:

```
Fix race condition where /root/cdk is not yet initialised in kubernetes-master setup by Juju  
```
2017-06-30 19:54:27 -07:00
Kubernetes Submit Queue
c0337c92cc Merge pull request #47881 from cadmuxe/endpoint
Automatic merge from submit-queue (batch tested with PRs 47918, 47964, 48151, 47881, 48299)

Add ApiEndpoint support to GCE config.

**What this PR does / why we need it**:
Add the ability to change ApiEndpoint  for GCE.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:
```release-note
None
```
2017-06-30 18:42:40 -07:00
Kubernetes Submit Queue
87c6fb5de2 Merge pull request #42376 from jingxu97/Feb/mounter
Automatic merge from submit-queue (batch tested with PRs 43558, 48261, 42376, 46803, 47058)

Add bind mount /etc/resolv.conf from host to containerized mounter

Currently, in containerized mounter rootfs, there is no DNS setup. If client
try to set up volume with host name instead of IP address, it will fail to resolve
the host name. 
By bind mount the host's /etc/resolv.conf to mounter rootfs, VM hosts name
could be resolved when using host name during mount. 

```release-note
Fixes issue where you could not mount NFS or glusterFS volumes using hostnames on GCI/GKE with COS images.
```
2017-06-30 16:28:46 -07:00
gunjan5
cc162a7f15 set snat to false 2017-06-30 12:53:55 -07:00
Konstantinos Tsakalozos
cd34d8f80d Ensure get_password is accessing a file that exists. 2017-06-30 20:24:35 +03:00
Kubernetes Submit Queue
d19773d855 Merge pull request #47835 from juju-solutions/feature/security
Automatic merge from submit-queue (batch tested with PRs 47850, 47835, 46197, 47250, 48284)

Securing the cluster created by Juju

**What this PR does / why we need it**: This PR secures the deployments done with Juju master. Works around certain security issues inherent to kubernetes (see for example dashboard access)

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```
Securing Juju kubernetes dashboard
```
2017-06-29 15:16:39 -07:00
Chris Hoge
c0621061c8 Add configuration for swift container name
Fixes Issue #47966
2017-06-29 10:15:55 -07:00
Koonwah Chen
c4e84e263c Change KUBE_GCE_API_ENDPOINT to GCE_API_ENDPOINT 2017-06-29 10:04:53 -07:00
Kubernetes Submit Queue
d3aa0d5a8a Merge pull request #46850 from x13n/nanny-version
Automatic merge from submit-queue (batch tested with PRs 46850, 47984)

Update addon-resizer version

Update addon-resizer version and remove the flags that have been deprecated in the new version.

**What this PR does / why we need it**:
ref kubernetes/contrib#2623

**Special notes for your reviewer**:
Need to wait for merging kubernetes/contrib#2623 first.

**Release note**:
```release-note
addon-resizer flapping behavior was removed.
```
2017-06-29 07:18:32 -07:00
Kubernetes Submit Queue
7018479968 Merge pull request #48204 from shyamjvs/logdump-only-n-nodes
Automatic merge from submit-queue

Allow log-dumping only N randomly-chosen nodes in the cluster

This should let us save "lots" (~3-4 hours) of time in our 5000-node cluster scale tests as we copy logs from all the nodes to jenkins worker and then upload all of them to gcs (while we don't need too many).
This will also prevent the jenkins container facing "No space left on device" error while dumping logs, that we saw in runs 12-13 of gce-enormous-cluster.

The longterm fix will be to enable [logexporter](https://github.com/kubernetes/test-infra/tree/master/logexporter) for our tests.

cc @kubernetes/sig-scalability-misc @kubernetes/test-infra-maintainers @gmarek @fejta
2017-06-29 04:23:58 -07:00
Daniel Kłobuszewski
63ccedcfa7 Update addon-resizer version
Also, remove the flags that have been deprecated in the new version.
2017-06-29 11:03:43 +02:00
Koonwah Chen
b3956a689e Add KUBE_GCE_API_ENDPOINT for GCE API endpoint config. 2017-06-28 16:03:18 -07:00
Shyam Jeedigunta
b960a0da12 Allow log-dumping only N randomly-chosen nodes in the cluster 2017-06-28 23:01:08 +02:00