containerd

Author	SHA1	Message	Date
Brian Goff	d7b9cb0019	shim: move event context timeout to publsher Before this change, if an event fails to send on the first attempt, subsequent attempts will fail with context.Cancelled because the the caller of publish passes a cancellable timeout, which the publisher uses to send the event. The publisher returns immediately if the send fails, but adds the event to an async queue to try again. Meanwhile the caller will return cancelling the context. Additionally, subsequent attempts may fail to send because the timeout was expected to be for a single request but the queue sleeps for `attempt*time.Second`. In the shim service, the timeout was set to 5s, which means the send will fail with context.DeadlineExceeded before it reaches `maxRequeue` (which is currently 5). This change moves the timeout to the publisher so each send attempt gets its own timeout. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2020-07-20 17:51:10 -07:00
Akihiro Suda	fd99b6566b	decrease log level of cgroup2 ToggleController error when running in UserNS Fix #4312 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-06-24 18:15:16 +09:00
Akihiro Suda	f1a469a035	shim v2 runc: propagate options.Root to Cleanup Previously shim v2 (`io.containerd.runc.{v1,v2}`) always used `/run/containerd/runc` as the runc root. Fix #4326 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-06-17 19:06:36 +09:00
Akihiro Suda	2f601013e6	cgroup2: implement `containerd.events.TaskOOM` event How to test (from https://github.com/opencontainers/runc/pull/2352#issuecomment-620834524): (host)$ sudo swapoff -a (host)$ sudo ctr run -t --rm --memory-limit $((1024102432)) docker.io/library/alpine:latest foo (container)$ sh -c 'VAR=$(seq 1 100000000)' An event `/tasks/oom {"container_id":"foo"}` will be displayed in `ctr events`. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-06-01 14:00:13 +09:00
Tobias Klauser	a9bd451ab4	Avoid duplicate imports of github.com/gogo/protobuf/types Re-use the import aliased as `ptypes`. Signed-off-by: Tobias Klauser <tklauser@distanz.ch>	2020-03-10 09:41:03 +01:00
Ted Yu	a687d3a36d	Check error return from json.Unmarshal Signed-off-by: Ted Yu <yuzhihong@gmail.com> Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2020-03-05 13:38:08 -05:00
Ted Yu	f8ade8debd	Use named error return for service#StartShim Signed-off-by: Ted Yu <yuzhihong@gmail.com>	2020-02-27 06:18:05 -08:00
Seth Pellegrino	66508589d3	fix: eventfd leak for v2 runtime with v1 cgroups There's no OOM monitoring for the v2 cgroups yet, so it seems unlikely that there was a leak in that case. Signed-off-by: Seth Pellegrino <spellegrino@newrelic.com>	2020-01-13 10:49:11 -08:00
Erik Sipsma	fbd46d7094	runtime v2: Close platform in runc shim's Shutdown method. Previously, the platform was closed as part of the Delete method when the process was an init for a task and there were no more tasks after its deletion. This can create problems if another task is created within the shim right after the delete runs, which results in the platform being closed but the shim continuing to run. This change moves closing the platform to the Shutdown method after the shim's context is canceled, which ensures the platform is only closed once the shim is sure its done servicing containers. Signed-off-by: Erik Sipsma <sipsma@amazon.com>	2019-12-19 09:47:40 -05:00
Akihiro Suda	b02e20f12e	cgroup2: enable controllers automatically Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-12-12 02:56:51 +09:00
Akihiro Suda	8f870c233f	support cgroup2 * only shim v2 runc v2 ("io.containerd.runc.v2") is supported * only PID metrics is implemented. Others should be implemented in separate PRs. * lots of code duplication in v1 metrics and v2 metrics. Dedupe should be separate PR. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-12-12 02:56:51 +09:00
Michael Crosby	6cf031e1e4	Pass ttrpc address to shim via env Because of the way go handles flags, passing a flag that is not defined will cause an error. In our case, if we kept this as a flag, then third-party shims would break when they see this new flag. To fix this, I moved this new configuration option to an env var. We should use env vars from here on out to avoid breaking shim compat. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-08-22 20:37:49 +00:00
Kevin Parsons	d7e1b25384	Allow explicit configuration of TTRPC address Previously the TTRPC address was generated as "<GRPC address>.ttrpc". This change now allows explicit configuration of the TTRPC address, with the default still being the old format if no value is specified. As part of this change, a new configuration section is added for TTRPC listener options. Signed-off-by: Kevin Parsons <kevpar@microsoft.com>	2019-08-22 00:56:27 -07:00
Phil Estes	640860a042	Merge pull request #3559 from fuweid/avoid-read-config runtime: only check killall for init process	2019-08-20 13:08:55 -04:00
Wei Fu	1073868e5e	runtime: only check killall for init process When containerd-shim does reaper, the most processes are not init process. Since json.Decode consumes more CPU resource, we should check killall option for init process only. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2019-08-20 19:18:34 +08:00
Michael Crosby	0d27d8f4f2	Unifi reaper logic into package Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-08-16 13:55:05 +00:00
Michael Crosby	6601b406b7	Refactor runtime code for code sharing Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-07-08 11:47:53 -04:00
Michael Crosby	7dfc605fc6	Set shim OOM scores to +1 containerd daemon score This changes the shim's OOM score from a static max killable of -999 to be +1 of the containerd daemon's score. This should allow the shim's to be killed first in an OOM condition but leave the daemon alone for a bit to help cleanup and manage the containers during this situation. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-06-27 11:14:14 -04:00
Michael Crosby	1a8df3f237	Reserve exec id to prevent race ref #2820 Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-06-21 14:52:44 -04:00
Michael Crosby	fe6a2b03ed	Add shim cgroup support for v2 runtimes Closes #3198 Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-05-20 16:04:06 +00:00
Michael Crosby	57fbb16234	Merge pull request #3149 from lifubang/pidnamespace fix killall when use pidnamespace	2019-05-09 14:28:44 -04:00
Michael Crosby	ae87730ad2	Improve shim shutdown logic Shims no longer call `os.Exit` but close the context on shutdown so that events and other resources have hit the `defer`s. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-04-10 18:17:07 -04:00
Michael Crosby	a6f587e4c4	Use ttrpc to publish runtime v2 events Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-04-09 14:38:50 -04:00
Sebastiaan van Stijn	01310eaebc	do not use unkeyed fields in compose literals Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-04-03 22:20:39 +02:00
Lifubang	872296642a	fix shouldKillAllOnExit check for v2 Signed-off-by: Lifubang <lifubang@acmcoder.com>	2019-03-30 11:37:14 +08:00
Michael Crosby	84a24711e8	Add runc.v2 multi-shim Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-02-21 11:09:46 -05:00

26 Commits