Commit Graph

226 Commits

Author SHA1 Message Date
Derek Carr
440c720805 Merge pull request #5330 from zmerlynn/try_addons_again
Retry object creation with --validate in kube-addons
2015-03-16 10:43:39 -04:00
Victor Marmol
1a7f7245e7 Remove cAdvisor manifest from cluster startup.
cAdvisor is now integrated into the Kubelet and runs inside of it.
2015-03-13 16:06:42 -07:00
Brendan Burns
966e8301a0 Add kubelet to the master. 2015-03-13 12:09:24 -07:00
Satnam Singh
e16bcceceb Merge pull request #5390 from brendandburns/shell_sucks
Configure docker on the master like we do on workers.
2015-03-12 11:42:03 -07:00
Brendan Burns
17ff8fb421 Configure docker on the master like we do on workers. 2015-03-12 10:37:30 -07:00
Saad Ali
7629b616a7 Merge pull request #5348 from dchen1107/docker
Check docker unixsocket periodically through monit. If failed, restart d...
2015-03-11 17:42:58 -07:00
Dawn Chen
fe9e0ca5ce Check docker unixsocket periodically through monit. If failed, restart docker. 2015-03-11 17:20:03 -07:00
Zach Loafman
018b11bc3a Retry object creation with --validate in kube-addons
The better solution is some fence with Salt, but the actual logs
provided in the bug don't support any race condition here, plus the
ordering in the Salt configuration seems correct.

We haven't seen this again in a while, but given the results of the
situation (a borked cluster), I'm proposing a relatively simple
workaround.

Fixes #4357 (dubiously)
2015-03-11 14:40:37 -07:00
derekwaynecarr
f71a91a131 Fix regression introduced when moving to standalone salt 2015-03-11 11:06:00 -04:00
derekwaynecarr
468bf1da75 Enable common set of admission controllers across salt providers 2015-03-11 11:06:00 -04:00
Brian Grant
b3d3db08ef Merge pull request #5064 from justinsb/ignore_stopped_aws_instances
Ignore EC2 instances that are stopped
2015-03-10 11:03:34 -07:00
Alex Robinson
6700a7f230 Merge pull request #5253 from satnam6502/fluentd
Modify Fluentd to ES setup to pass flags
2015-03-10 10:46:26 -07:00
Brendan Burns
00f3779c6f Merge pull request #5119 from zmerlynn/salt_drones
Change GCE to use standalone Saltstack config
2015-03-10 10:29:59 -07:00
Satnam Singh
3ce0ede78e Modify Fluentd to ES setup to pass flags 2015-03-10 10:11:26 -07:00
Alex Robinson
8fd33476f0 Merge pull request #5218 from satnam6502/fluentd-es
Change Fluentd config to read from head of Docker container logs
2015-03-10 09:57:02 -07:00
Zach Loafman
120dba474e Change GCE to use standalone Saltstack config:
Change provisioning to pass all variables to both master and node. Run
Salt in a masterless setup on all nodes ala
http://docs.saltstack.com/en/latest/topics/tutorials/quickstart.html,
which involves ensuring Salt daemon is NOT running after install. Kill
Salt master install. And fix push to actually work in this new flow.

As part of this, the GCE Salt config no longer has access to the Salt
mine, which is primarily obnoxious for two reasons: - The minions
can't use Salt to see the master: this is easily fixed by static
config. - The master can't see the list of all the minions: this is
fixed temporarily by static config in util.sh, but later, by other
means (see
https://github.com/GoogleCloudPlatform/kubernetes/issues/156, which
should eventually remove this direction).

As part of it, flatten all of cluster/gce/templates/* into
configure-vm.sh, using a single, separate piece of YAML to drive the
environment variables, rather than constantly rewriting the startup
script.
2015-03-10 09:04:29 -07:00
derekwaynecarr
22e559d769 Provision network before kubelet 2015-03-10 10:30:50 -04:00
Satnam Singh
2486de1287 Change Fluentd config to read from head of Docker container logs 2015-03-09 23:25:52 -07:00
Justin Santa Barbara
a3ae480acc Ignore EC2 instances that are stopped
Otherwise we pick up previous cluster instances (in EC2, stopped instances hang around for
a while - maybe 30 minutes?)
2015-03-09 19:03:46 -07:00
Dawn Chen
7a47ea8913 Ask monit health check kubelet; if failed, restart. 2015-03-05 17:29:38 -08:00
derekwaynecarr
35ab4ccc6d Further vagrant reliability improvements. 2015-03-03 06:55:34 -08:00
Tim Hockin
d46146da7d Stop using Docker's -r flag
It has been deprecated since 1.3, replaced by restart policies.  We are
not supporting docker < 1.3 any more.
2015-02-28 13:36:42 -08:00
Satnam Singh
16f49f6d09 Update Fluentd to Cloud Logging image 2015-02-26 15:18:22 -08:00
Eric Tune
e77745672c Merge pull request #4577 from satnam6502/cluster-name
Qualify the name of forwarding rules with INSTANCE_PREFIX
2015-02-26 13:17:49 -08:00
Daniel Smith
3038a5624e Merge pull request #4859 from dchen1107/master
Move salt based clusters to Docker 1.5.0
2015-02-26 11:32:07 -08:00
Dawn Chen
59a9030413 Move salt based clusters to Docker 1.5.0 2015-02-26 11:06:53 -08:00
Satnam Singh
19b927ea57 Name a cluster and use it to make forwarding rules for GCE 2015-02-23 17:04:33 -08:00
derekwaynecarr
87a41b0934 Improve vagrant reliablility, fix race condition with openvswitch and docker 2015-02-21 13:31:50 -05:00
Victor Marmol
ff7726d1ef Merge pull request #4655 from saad-ali/fix2884quick
Add monit process monitoring for kubelet and docker
2015-02-20 11:07:06 -08:00
David Oppenheimer
ba749284d0 Have etcd listen on all interfaces so that monit probes succeed. Closes #3852. 2015-02-19 23:14:14 -08:00
saadali
4dea0488e6 Add monit process monitoring for kubelet and docker 2015-02-19 22:22:05 -08:00
Brendan Burns
e4e68e2fa5 Merge pull request #4138 from justinsb/fix_duplicate_master_arg
Remove duplicate --master arg from kube-scheduler
2015-02-19 11:58:23 -08:00
David Oppenheimer
c99a37d3a3 Merge pull request #4587 from gmarek/client2
Fix monit config for etcd
2015-02-19 11:37:47 -08:00
gmarek
19b7c01084 Fix monit config for etcd 2015-02-19 13:11:43 +01:00
Brendan Burns
664de16c81 Remove kubecfg from the client tools we attempt to distribute. 2015-02-18 22:26:26 -08:00
Dawn Chen
23b8b5ea6a Configure docker with larger open files. 2015-02-11 23:52:58 -08:00
Eric Tune
59a06b5073 Don't set --etcd_servers argument for kubelet.
This is ignored by all recent kubelet versions when api_servers is
set.  And it is set in this salt file.
2015-02-11 15:53:26 -08:00
Zach Loafman
6b3bb763f0 Merge pull request #4287 from derekwaynecarr/docker_group_gone
Docker 1.4.1 on Fedora no longer provisions a docker group
2015-02-10 14:28:02 -08:00
Zach Loafman
2fb4bd4724 Merge pull request #4286 from brendandburns/etcd
Add restart to the kube apiserver.
2015-02-10 12:09:34 -08:00
derekwaynecarr
46eb657722 docker does not provision a docker group anymore, so do not add vagrant user to it 2015-02-10 11:15:22 -05:00
Zach Loafman
2f231e0a2a Merge pull request #4268 from rajatchopra/master
Fix vagrant networking round 2
2015-02-10 07:39:48 -08:00
Brendan Burns
c50e650d8b Add restart to the kube apiserver. 2015-02-10 07:35:00 -08:00
Rajat Chopra
1d12cad456 Fix vagrant networking. Include the master as part of overlay. And remove STP for efficiency.
fix e2e services for vagrant
Kubelet should wait on SDN to finish
Do not require docker on master for default clouds
2015-02-09 14:37:27 -08:00
Ravi Sankar Penta
3a5ef05f64 Make periodic sync nodes from cloud provider optional.
--sync_nodes=false gives user flexibility to add/remove nodes in the
cluster using REST api/kubectl cli and at the same time can use
cloud provider for other resources like persistent disks, etc.
2015-02-09 13:42:44 -08:00
Zach Loafman
6b8c332cd3 Revert "Fix vagrant networking" 2015-02-09 13:40:11 -08:00
Rajat Chopra
9f10108758 Kubelet should wait on SDN to finish 2015-02-09 11:22:14 -08:00
Rajat Chopra
b61ea1bbf1 Fix vagrant networking. Include the master as part of overlay. And remove STP for efficiency. 2015-02-05 17:41:27 -08:00
Justin Santa Barbara
e72d3392cc Remove duplicate --master arg from kube-scheduler 2015-02-04 19:05:05 -05:00
Justin Santa Barbara
7790435915 Fix for bool vs string comparisons in salt templates
Fixes #4118
2015-02-04 14:09:10 -05:00
Justin SB
9a29d9f92a Etcd data dir must be empty
Otherwise we get the error "unknown wal version in data dir /var/etcd"

When this is the homedir for etcd, e.g. .bashrc confuses it
2015-02-02 08:09:32 -08:00