Commit Graph

94 Commits

Author SHA1 Message Date
Alex Robinson
84f8f01d45 Merge pull request #14432 from mesosphere/sur-k8sm-492-launch-grace-period
MESOS make launchGracePeriod configurable
2015-10-05 13:27:12 -07:00
Sergiusz Urbaniak
42c84954fd executor,scheduler: make launchGracePeriod configurable 2015-10-02 07:43:49 +02:00
Dr. Stefan Schimanski
39942952cd Fix hostname check logic in scheduler 2015-10-01 06:50:43 +02:00
Brendan Burns
3474324d6a Merge pull request #14271 from mesosphere/sur-k8sm-441-flaky
MESOS: fix flaky TestPlugin_LifeCycle
2015-09-30 11:32:00 -07:00
Sergiusz Urbaniak
56fc0f5900 scheduler: reenable TestPlugin_LifeCycle, increase timeouts 2015-09-30 16:14:13 +02:00
eulerzgy
71b96422f4 set capital in some files 2015-09-30 14:46:20 +08:00
k8s-merge-robot
c807bea089 Merge pull request #13857 from mesosphere/node-labels
Auto commit by PR queue bot
2015-09-28 22:10:51 -07:00
Dr. Stefan Schimanski
67746908e5 Deleting gracefully terminating, not-scheduled pre-scheduled pods
In upstream the kubelet is responsible for all pods which have the spec.NodeName
set. In Mesos we have a two-stage scheduling process:

1. pods with a pre-set spec.NodeName are still scheduled by the scheduler.
2. The kubelet will only see them when a Mesos task was started and the executor
   passes the pod to the kubelet.

With this PR a pod with spec.NodeName which is gracefully terminated, but not
yet scheduled, e.g.

- because the termination happened just after creation and the scheduler was
  not fast enough
- because the NodeSelector does not match

is deleted by the Mesos scheduler.
2015-09-26 23:42:08 +02:00
Dr. Stefan Schimanski
112f80fa4a Simplify FitPredicate for NodeSelector
We can assume that n != nil holds, because otherwise offers are skipped.
2015-09-26 10:36:49 +02:00
Dr. Stefan Schimanski
4d4ebe9f18 Add Mesos slave attributes as node labels
- pre-create node api objects from the scheduler when offers arrive
- decline offers until nodes a registered
- turn slave attributes as k8s.mesosphere.io/attribute-* labels
- update labels from executor Register/Reregister
- watch nodes in scheduler to make non-Mesos labels available for NodeSelector matching
- add unit tests for label predicate
- add e2e test to check that slave attributes really end up as node labels
2015-09-26 09:46:56 +02:00
Dr. Stefan Schimanski
e4dcd97ac3 Dequeue pods in scheduler which are terminating 2015-09-22 16:41:43 +02:00
Dr. Stefan Schimanski
132f8892ed Move task-lost message prefix into messages module 2015-09-21 09:51:05 +02:00
Daniel Martí
586cb9126a Move pkg/util.Time to pkg/api/unversioned.Time
Along with our time.Duration wrapper, as suggested by @lavalamp.
2015-09-17 17:51:27 -07:00
Alex Mohr
bdfd469a85 Merge pull request #14137 from caesarxuchao/13823
rebase 13823
2015-09-17 15:25:52 -07:00
k8s-merge-robot
654dd38b1c Merge pull request #14120 from mesosphere/sttts-cleanup-hostname-override
Auto commit by PR queue bot
2015-09-17 14:23:36 -07:00
Kris
f4ad00d9ae Moving Status object to a common package 2015-09-17 14:09:53 -07:00
Dr. Stefan Schimanski
88466dd21e Cleanup --hostname-override logic in procurment 2015-09-17 18:41:16 +02:00
k8s-merge-robot
a204f31516 Merge pull request #13918 from mesosphere/reconcile-ignore-mirror-pods
Auto commit by PR queue bot
2015-09-17 02:36:31 -07:00
k8s-merge-robot
367e5571bf Merge pull request #13965 from mesosphere/healthz
Auto commit by PR queue bot
2015-09-17 02:13:31 -07:00
k8s-merge-robot
4009021cf4 Merge pull request #14037 from mesosphere/scheduler-slave-registry
Auto commit by PR queue bot
2015-09-16 16:54:40 -07:00
k8s-merge-robot
22f698f5b0 Merge pull request #13953 from liguangbo/change_iff_to_if_and_only_if
Auto commit by PR queue bot
2015-09-16 08:35:20 -07:00
Dr. Stefan Schimanski
eb5a5ffc28 Extract slave hostname registry code in its own module
- remove bleeding of registry-internal objects, without any locking
- rename from SlaveStorage to Registry which fits much better to what
  it actually does
2015-09-16 14:50:31 +02:00
liguangbo
4df68f03a3 change iff to if and only if to improve annotation readability 2015-09-16 08:29:36 +00:00
Sergiusz Urbaniak
d97ccea8c2 scheduler: add healthz endpoint 2015-09-16 09:15:20 +02:00
Sergiusz Urbaniak
0f0b87600e podtask: reduce log verbosity for pods w/o limits 2015-09-15 16:48:17 +02:00
James DeFelice
a5307305f1 avoid recovery for mirrored pods on failover 2015-09-14 20:21:19 +00:00
Wojciech Tyczynski
53ae56f205 Replace "minion" with "node" in bunch of places. 2015-09-14 11:07:11 +02:00
Daniel Smith
b225c1d47a Run gofmt (separate commit for easy rebases) 2015-09-10 17:17:59 -07:00
Daniel Smith
15b30b8b09 Move version agnostic parts of client
pkg/client/unversioned/cache -> pkg/client/cache
pkg/client/unversioned/record -> pkg/client/record
2015-09-10 17:17:59 -07:00
tummychow
78ce5da988 Move util.StringSet into its own package
A lot of packages use StringSet, but they don't use anything else from
the util package. Moving StringSet into another package will shrink
their dependency trees significantly.
2015-09-10 12:04:15 -07:00
James DeFelice
cf908df89c add flags for initial executor cpu and memory resources 2015-09-06 02:16:09 +00:00
Chao Xu
9fc79e9d99 refactor testapi and test scripts to prepare for multiple API groups. 2015-09-04 18:01:32 -07:00
James DeFelice
a1cea8dd87 Flexible resource accounting and pod resource containment:
- new: introduce AllocationStrategy, Predicate, and Procurement to scheduler pkg
- new: --contain-pod-resources flag (workaround for docker+systemd+mesos problems)
- new: --account-for-pod-resources flag (for testing overcommitment)
- bugfix: forward -v flag from minion controller to executor
2015-09-04 00:49:13 +00:00
Abhi Shah
49523500c3 Merge pull request #13566 from mesosphere/reconcile-all-pods
MESOS: task reconciliation must look at NamespaceAll, not NamespaceDefault
2015-09-03 15:35:17 -07:00
k8s-merge-robot
d1c7a5f04e Merge pull request #13362 from mesosphere/proxy-runtime
Auto commit by PR queue bot
2015-09-03 15:26:56 -07:00
k8s-merge-robot
5d8a6049be Merge pull request #12848 from wojtek-t/private_watch_cache
Auto commit by PR queue bot
2015-09-03 13:06:18 -07:00
James DeFelice
d1589b9571 task reconciliation must look at NamespaceAll, not NamespaceDefault 2015-09-03 18:01:45 +00:00
James DeFelice
7fbd290167 moved cgroup-root detection to minion service; kube proxy now configured to run in mesos container 2015-09-02 06:13:26 +00:00
Edward Muller
69da77c6b0 s:code.google.com/p:github.com/scalingdata:
Because code.google.com is going away. Probably #1 of several.
2015-08-31 11:29:33 -07:00
Wojciech Tyczynski
e202f9c797 Add resource version to Store Replace params. 2015-08-31 09:49:12 +02:00
k8s-merge-robot
de60651cc0 Merge pull request #12858 from MatMaul/master
Auto commit by PR queue bot
2015-08-30 21:46:50 -07:00
Dr. Stefan Schimanski
cac58f6db7 Add --path-override to minion to change the PATH env var of subprocesses;
pass hostname-override through to kube-proxy (if defined)
2015-08-27 22:57:04 +00:00
Mathieu Velten
57f18a0e04 Add a dockercfg_path parameter to the Mesos scheduler (the main purpose is to handle private repos auth).
This is implemented by copying the dockercfg file in the executor work directory.
2015-08-26 11:21:08 +02:00
Karl Isenberg
158c42f4be Log Mesos task message on status update
- Log error when task state is unknown
2015-08-24 15:31:31 -07:00
Dr. Stefan Schimanski
f9635d5778 [mesos] Handle TASK_ERROR
- The TASK_ERROR task status was introduced with Mesos 0.21 and is actually used since 0.22.
  It was not handled at all before this patch, leaving errored task in the registry in phase
  "Pending". This will lead to task status updates from the Mesos Master on reconciliation with empty
  slaveId fields, leading to scheduler crashes eventually.
- Handle terminal task with empty slaveId.
  The slave id can be empty for TASK_ERROR.
  The modified code path does not use the slaveId.
2015-08-24 15:31:31 -07:00
Dr. Stefan Schimanski
10bb0424c4 Add NodeSelector support to scheduler 2015-08-20 20:59:10 +02:00
Kris Rousey
ae6c64d9bb Moving everyone to unversioned client 2015-08-18 10:23:03 -07:00
Karl Isenberg
61c9dd876e Improve readability of scheduling failure logs 2015-08-17 12:17:48 +02:00
jiangyaoguo
5d3522dc7a Keep event reason consistant in scheduler and controller 2015-08-13 11:33:32 +08:00
Kris Rousey
2a17dbff13 Moving clientauth package to be under client 2015-08-12 14:43:17 -07:00