containerd

Author	SHA1	Message	Date
Seth Pellegrino	66508589d3	fix: eventfd leak for v2 runtime with v1 cgroups There's no OOM monitoring for the v2 cgroups yet, so it seems unlikely that there was a leak in that case. Signed-off-by: Seth Pellegrino <spellegrino@newrelic.com>	2020-01-13 10:49:11 -08:00
Seth Pellegrino	9456040acb	fix: eventfd leak Only start watching the cgroup for OOMs when the first process starts instead of on every process. Signed-off-by: Seth Pellegrino <spellegrino@newrelic.com>	2020-01-13 10:39:54 -08:00
Li Yuxuan	1fb1d93212	v2: Fix missing ns when openShimLog on windows Related to https://github.com/containerd/containerd/pull/3921#discussion_r363046745 Signed-off-by: Li Yuxuan <liyuxuan04@baidu.com>	2020-01-05 19:42:33 +08:00
Li Yuxuan	d82fa43193	v2: Call shim.Delete at first when create is failed If the context is cancelled during `shim.Create()`, such as the client disconnects unexpectedly. The created shim will never be deleted. What's more, if the context is cancelled during `openShimLog()`, the fifo will be closed and block the shim output. Signed-off-by: Li Yuxuan <liyuxuan04@baidu.com>	2019-12-28 00:02:11 +08:00
Erik Sipsma	fbd46d7094	runtime v2: Close platform in runc shim's Shutdown method. Previously, the platform was closed as part of the Delete method when the process was an init for a task and there were no more tasks after its deletion. This can create problems if another task is created within the shim right after the delete runs, which results in the platform being closed but the shim continuing to run. This change moves closing the platform to the Shutdown method after the shim's context is canceled, which ensures the platform is only closed once the shim is sure its done servicing containers. Signed-off-by: Erik Sipsma <sipsma@amazon.com>	2019-12-19 09:47:40 -05:00
Akihiro Suda	b02e20f12e	cgroup2: enable controllers automatically Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-12-12 02:56:51 +09:00
Akihiro Suda	8f870c233f	support cgroup2 * only shim v2 runc v2 ("io.containerd.runc.v2") is supported * only PID metrics is implemented. Others should be implemented in separate PRs. * lots of code duplication in v1 metrics and v2 metrics. Dedupe should be separate PR. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-12-12 02:56:51 +09:00
Michael Crosby	f8cca26f3c	Handle large output in v2 shim with TTY Reized the I/O buffers to align with the size of the kernel buffers with fifos and move the close aspect of the console to key off of the stdin closing. Fixes #3738 Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-10-11 15:42:05 -04:00
Lantao Liu	ffcb1cc9be	Fix delete error code on the containerd daemon side. Signed-off-by: Lantao Liu <lantaol@google.com>	2019-10-09 00:28:51 -07:00
Lantao Liu	06be794cb2	Fix shim delete error code. Signed-off-by: Lantao Liu <lantaol@google.com>	2019-10-07 23:21:57 -07:00
Derek McGowan	0b224ac7d6	Update metadata interfaces for containers and leases Add more thorough dirty checking across all types which may be deleted and hold references. Signed-off-by: Derek McGowan <derek@mcgstyle.net>	2019-09-23 15:27:39 -07:00
Kathryn Baldauf	b4211d94e2	fail on file not found for shim reconnect on containerd restart Signed-off-by: Kathryn Baldauf <kabaldau@microsoft.com>	2019-09-17 14:49:29 -07:00
Derek McGowan	b039c39186	Merge pull request #3564 from tiborvass/move-cgroups-dep-to-namespaces-pkg runtime/opts: move WithNamespaceCgroupDeletion from containerd to its own package	2019-09-03 10:38:53 -07:00
Kathryn Baldauf	2d8a65b1b2	Export shim publisher functions - Our out of tree shim would like to publish events with ttrpc. These functions should be exposed so our shim doesn't need to reimplement publisher logic. Signed-off-by: Kathryn Baldauf <kabaldau@microsoft.com>	2019-08-27 17:15:15 -07:00
Tibor Vass	6624a70d92	runtime/opts: move WithNamespaceCgroupDeletion from containerd to its own package The cgroup dependency brings in quite a lot only for WithNamespaceCgroupDeletion, which is a namespaces.DeleteOpt. Signed-off-by: Tibor Vass <tibor@docker.com>	2019-08-27 19:02:55 +00:00
chentanjun	8266a3c5e7	fix-up spelling mistake Signed-off-by: chentanjun <2799194073@qq.com>	2019-08-27 13:45:41 +08:00
Michael Crosby	6cf031e1e4	Pass ttrpc address to shim via env Because of the way go handles flags, passing a flag that is not defined will cause an error. In our case, if we kept this as a flag, then third-party shims would break when they see this new flag. To fix this, I moved this new configuration option to an env var. We should use env vars from here on out to avoid breaking shim compat. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-08-22 20:37:49 +00:00
Kevin Parsons	d7e1b25384	Allow explicit configuration of TTRPC address Previously the TTRPC address was generated as "<GRPC address>.ttrpc". This change now allows explicit configuration of the TTRPC address, with the default still being the old format if no value is specified. As part of this change, a new configuration section is added for TTRPC listener options. Signed-off-by: Kevin Parsons <kevpar@microsoft.com>	2019-08-22 00:56:27 -07:00
Phil Estes	640860a042	Merge pull request #3559 from fuweid/avoid-read-config runtime: only check killall for init process	2019-08-20 13:08:55 -04:00
Michael Crosby	08061c7c3c	Merge pull request #3540 from crosbymichael/shim-hang Use non-blocking send and retry for exit events	2019-08-20 09:31:21 -04:00
Wei Fu	1073868e5e	runtime: only check killall for init process When containerd-shim does reaper, the most processes are not init process. Since json.Decode consumes more CPU resource, we should check killall option for init process only. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2019-08-20 19:18:34 +08:00
Phil Estes	fc9335d75c	Merge pull request #3459 from crosbymichael/timeout-config Allow timeouts to be configured in config	2019-08-19 13:16:43 -04:00
Li Yuxuan	04caf1fc4e	Ignore fifo error when using v2 multi-container shim When using a multi-container shim, the fifo of the 2nd to Nth container will not be opened when the ctx is done. This will cause an `ErrReadClosed` that can be ignored. Signed-off-by: Li Yuxuan <liyuxuan04@baidu.com>	2019-08-17 09:40:08 +08:00
Michael Crosby	0d27d8f4f2	Unifi reaper logic into package Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-08-16 13:55:05 +00:00
Shukui Yang	bb4c92c773	Fix shim hung shim.Reap and shim.Default.Wait may deadlock, use Monitor.Notify to fix this issue. Signed-off-by: Shukui Yang <keloyangsk@gmail.com>	2019-08-16 13:55:05 +00:00
Michael Crosby	2e8ea9fd6b	Allow timeouts to be configured in config This adds a singleton `timeout` package that will allow services and user to configure timeouts in the daemon. When a service wants to use a timeout, it should declare a const and register it's default value inside an `init()` function for that package. When the default config is generated, we can use the `timeout` package to provide the available timeout keys so that a user knows that they can configure. These show up in the config as follows: ```toml [timeouts] "io.containerd.timeout.shim.cleanup" = 5 "io.containerd.timeout.shim.load" = 5 "io.containerd.timeout.shim.shutdown" = 3 "io.containerd.timeout.task.state" = 2 ``` Timeouts in the config are specified in seconds. Timeouts are very hard to get right and giving this power to the user to configure things is a huge improvement. Machines can be faster and slower and depending on the CPU or load of the machine, a timeout may need to be adjusted. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-08-13 17:36:32 +00:00
Akihiro Suda	225cc7d5bd	Merge pull request #3494 from jterry75/remove_v2 Completely remove Windows v2 in-tree shim	2019-08-07 02:19:12 +09:00
Li Yuxuan	08483d18ad	v2: Close ttrpc connection when `Delete()` This avoids potential socket leak when the connected v2 shim of runtime serving multiple containers. Signed-off-by: Li Yuxuan <liyuxuan04@baidu.com>	2019-08-06 20:35:59 +08:00
Justin Terry (VM)	4b5dfaee13	Completely remove Windows v2 in-tree shim Signed-off-by: Justin Terry (VM) <juterry@microsoft.com>	2019-08-05 16:49:56 -07:00
Derek McGowan	ac1cb6d5d4	Merge pull request #3467 from kevpar/dial-pipe-err Improve error return from AnonDialer on Windows	2019-08-01 15:41:54 -07:00
Kevin Parsons	daf12cd194	Improve error return from AnonDialer on Windows AnonDialer will now return a "not found" error if the pipe is not found before the timeout is reached. If the pipe exists but the timeout is reached while attempting to connect, the timeout error will still be returned. This will allow the error handling logic to work properly when connecting to the shim log pipe. An error message is only logged if the error is not "not found", so now log noise from log pipes that were never intended to be created by the shim will be hidden. This change also cleans up the control flow for AnonDialer on Windows. The new code should be more easily readable, but the only semantic change is the error return value change. Signed-off-by: Kevin Parsons <kevpar@microsoft.com>	2019-07-30 17:20:37 -07:00
Michael Crosby	eb4b3e8772	Fast path getting pid from task Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-07-26 17:48:00 +00:00
dzzg	c27e48d666	fix mis-spelling in client.go Signed-off-by: dzzg <zhengguang.zhu@daocloud.io>	2019-07-26 13:33:04 +08:00
Akihiro Suda	fab016c7a1	runtime/v1/linux: ignore ErrCgroupDeleted in Task.Start Fix a Rootless Docker-in-Docker issue on Fedora 30: https://github.com/docker-library/docker/pull/165#issuecomment-511717143 Related: #1598 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-07-17 12:19:15 +09:00
Maksym Pavlenko	ef7f46eb7b	Fix linter errors Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-07-14 20:49:40 -07:00
Michael Crosby	6601b406b7	Refactor runtime code for code sharing Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-07-08 11:47:53 -04:00
Phil Estes	2aa8780ce6	Merge pull request #3393 from lifupan/fix_deadshim shimv2: remove the dead task from runtime task list	2019-07-08 11:42:55 -04:00
lifupan	ec8d9d3d7a	shimv2: remove the dead task from runtime task list When shimv2 dead, the container would be cleanup, but the corresponding runtime task still existed in runtime task lists, it should be deleted too. Signed-off-by: lifupan <lifupan@gmail.com>	2019-07-04 15:51:03 +08:00
Derek McGowan	041d8d7051	Merge pull request #3366 from crosbymichael/exec-pid Robust pid locking for shim processes	2019-06-29 15:36:51 +08:00
Michael Crosby	7dfc605fc6	Set shim OOM scores to +1 containerd daemon score This changes the shim's OOM score from a static max killable of -999 to be +1 of the containerd daemon's score. This should allow the shim's to be killed first in an OOM condition but leave the daemon alone for a bit to help cleanup and manage the containers during this situation. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-06-27 11:14:14 -04:00
Michael Crosby	719a2c594e	Robust pid locking for shim processes Closes #2832 Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-06-26 11:43:57 -04:00
Phil Estes	287582585f	Merge pull request #3365 from crosbymichael/exec-lk Reserve exec id to prevent race	2019-06-25 08:59:41 +08:00
Maksym Pavlenko	174c4907d0	Fix shim's file IO logging Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-06-24 13:21:41 -07:00
Michael Crosby	1a8df3f237	Reserve exec id to prevent race ref #2820 Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-06-21 14:52:44 -04:00
Michael Crosby	245052243d	Add timeout for I/O waitgroups Closes #3286 This and a combination of a couple Docker changes are needed to fully resolve the issue on the Docker side. However, this ensures that after processes exit, we still leave some time for the I/O to fully flush before closing. Without this timeout, the delete methods would block forever. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-06-20 16:13:51 -04:00
Wei Fu	111b082e20	Merge pull request #3356 from mxpv/binary-io-path BinaryIO/LogFile creator bug fixing	2019-06-20 10:25:47 +08:00
Maksym Pavlenko	fbf96d302a	Fix path in LogFile creator Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-06-19 16:53:33 -07:00
Maksym Pavlenko	5e0d793801	Fix bugs in BinaryIO creator Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-06-19 11:15:17 -07:00
Ace-Tang	95f9bbf18b	Add timeout in load shim v2 add timeout in connect shim v2 avoid starting containerd hang Signed-off-by: Ace-Tang <aceapril@126.com>	2019-06-19 13:10:18 +08:00
Maksym Pavlenko	bca5667362	Make newBinaryIO public Allow third-party runtime implementations to reuse NewBinaryIO in order to support pluggable shim logging binary protocol. Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-06-12 16:22:10 -07:00

1 2 3 4 5 ...

371 Commits