containerd

Author	SHA1	Message	Date
Akihiro Suda	b23dc1131e	restart: parallelize reconcile() The only shared variable `m.client` is thread-safe, so we can safely parallelize the loops. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-02-25 13:30:00 +09:00
Shiming Zhang	05ef2fe2fb	Fix missing close Signed-off-by: Shiming Zhang <wzshiming@foxmail.com>	2021-02-18 13:21:42 +08:00
Fabiano Fidêncio	d80dbdae68	v2, util: Take the full binary path when starting the shimv2 process The current code simply ignores the full binary path when starting the shimv2 process, and instead fallbacks to a binary in the path, and this is problematic (and confusing) for those using CRI-O, which has this bits vendored. The reason it's problematic with CRI-O is because the user can simply set the full binary path and, instead of having that executed, CRI-O will simply fail to create the container unless that binary is part of the path, which may not be case in a few different scenarios (testing being the most common one). Fixes: #5006 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-02-05 13:35:22 +01:00
IceberGu	b458583b76	runtime: fix shutdown runc v2 service Signed-off-by: IceberGu <wei.cai-nat@daocloud.io>	2021-02-02 15:36:49 +08:00
Phil Estes	49c5c14879	Merge pull request #4906 from payall4u/bugfix/fix-open-shim-fifo bugfix: change the flag of open log fifo to avoid containerd hang on syscall open	2021-02-01 09:01:38 -05:00
payall4u	957fa3379d	change flag from RDONLY to RDWR and close the fifo correct Signed-off-by: Zhiyu Li <payall4u@qq.com>	2021-01-31 19:00:42 +08:00
Aditi Sharma	1423e9199d	Update gogo/protobuf to v1.3.2 bump version 1.3.2 for gogo/protobuf due to CVE-2021-3121 discovered in gogo/protobuf version 1.3.1, CVE has been fixed in 1.3.2 Signed-off-by: Aditi Sharma <adi.sky17@gmail.com>	2021-01-28 12:57:50 +00:00
Maksim An	ddb5e1651a	Enhance logging driver and ctr tasks to support windows Signed-off-by: Maksim An <maksiman@microsoft.com>	2021-01-21 12:17:32 -08:00
Wei Fu	846cb963cc	runtime/v2: should use defer ctx to cleanup Signed-off-by: Wei Fu <fuweid89@gmail.com>	2021-01-11 23:22:38 +08:00
Maksym Pavlenko	c1b01eabc0	Add copyright header to proto files Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-01-05 10:44:07 -08:00
Michael Crosby	dc207b654d	Merge pull request #4860 from masters-of-cats/pr-process-not-found-err Return GRPC not found error instead of plain one	2020-12-21 10:25:11 -05:00
Georgi Sabev	7451dd1ed1	Return GRPC not found error instead of plain one When the shim returns a plain error when a process does not exist, the server is unable to recognise its GRPC status code and assumes UnknownError. This is awkward for containerd client users as they are unable to recognise the actual reason for the error. When the shim returns a NotFound GRPC error, it is properly translated by the server and clients receive a proper NotFound error instead of Unknown Please note that we (CF Garden) would like to have the eventual fix backported to 1.4 as well. Co-authored-by: Danail Branekov <danailster@gmail.com> Signed-off-by: Danail Branekov <danailster@gmail.com> Signed-off-by: Georgi Sabev <georgethebeatle@gmail.com>	2020-12-18 15:33:48 +02:00
Phil Estes	070b698449	Merge pull request #4845 from skaegi/oom_score-max Add bounds on max oom_score_adj value for AdjustOOMScore	2020-12-17 16:22:46 -05:00
Simon Kaegi	da2fd657ab	Add bounds on max oom_score_adj value for AdjustOOMScore oom_score_adj must be in the range -1000 to 1000. In AdjustOOMScore if containerd's score is already at the maximum value we should set that value for the shim instead of trying to set 1001 which is invalid. Signed-off-by: Simon Kaegi <simon_kaegi@ca.ibm.com>	2020-12-14 15:09:24 -05:00
Akihiro Suda	0356d5d4b2	restart: allow passing existing log URI object The new function `WithLogURI(uri *url.URL)` replaces `WithBinaryLogURI(binary string, args map[string]string)` so as to allow passing an existring URI object. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-12 05:11:03 +09:00
Akihiro Suda	7126310a09	Merge pull request #4784 from fuweid/fix-4769 runtime: should not send duplicate task exit event	2020-12-02 15:26:57 +09:00
Wei Fu	faec5d4ffd	runtime: should not send duplicate task exit event If the shim has been killed and ttrpc connection has been closed, the shimErr will not be nil. For this case, the event subscriber, like moby/moby, might have received the exit or delete events. Just in case, we should allow ttrpc-callback-on-close to send the exit and delete events again. And the exit status will depend on result of shimV2.Delete. If not, the shim has been delivered the exit and delete events. So we should remove the task record and prevent duplicate events from ttrpc-callback-on-close. Fix: #4769 Signed-off-by: Wei Fu <fuweid89@gmail.com>	2020-12-01 21:54:04 +08:00
Derek McGowan	4a4bb851f5	Merge pull request from GHSA-36xw-fx78-c5r4 Use path based unix socket for shims	2020-11-30 10:32:18 -08:00
Maksym Pavlenko	0d4734655f	Merge pull request #4647 from katiewasnothere/task_update_annotations_upstream Add annotations to task update request api	2020-11-18 14:44:19 -08:00
Samuel Karp	126b35ca43	containerd-shim: use path-based unix socket This allows filesystem-based ACLs for configuring access to the socket of a shim. Ported from Michael Crosby's similar patch for v2 shims. Signed-off-by: Samuel Karp <skarp@amazon.com>	2020-11-11 11:47:47 -08:00
Michael Crosby	bd908acabd	Use path based unix socket for shims This allows filesystem based ACLs for configuring access to the socket of a shim. Co-authored-by: Samuel Karp <skarp@amazon.com> Signed-off-by: Samuel Karp <skarp@amazon.com> Signed-off-by: Michael Crosby <michael@thepasture.io> Signed-off-by: Michael Crosby <michael.crosby@apple.com>	2020-11-11 11:47:46 -08:00
Kathryn Baldauf	95ba6e9f75	Add annotations to task update request api Signed-off-by: Kathryn Baldauf <kabaldau@microsoft.com>	2020-11-09 14:13:33 -08:00
Maksym Pavlenko	4da306e1e9	Fix panic in shim not logged Fix #4274 Carry #4298 Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2020-10-26 09:05:47 -07:00
Giuseppe Capizzi	8eda32e107	Check if a process exists before returning it Fixes #4632. Signed-off-by: Giuseppe Capizzi <gcapizzi@pivotal.io> Co-authored-by: Danail Branekov <danailster@gmail.com>	2020-10-22 16:50:14 +03:00
Akihiro Suda	915263f269	Merge pull request #4502 from akshat-kmr/master Add logging binary support when terminal is true	2020-10-08 12:14:39 +09:00
Maksym Pavlenko	c59d1cd5b0	Fix linter issues Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2020-10-07 15:42:01 -07:00
Phil Estes	68d97331be	Merge pull request #4538 from fuweid/update-shim-cleanup runtime/v2: cleanup dead shim before delete bundle	2020-09-21 13:32:40 -04:00
Wei Fu	4b05d03903	runtime/v2: cleanup dead shim before delete bundle The shim delete action needs bundle information to cleanup resources created by shim. If the cleanup dead shim is called after delete bundle, the part of resources maybe leaky. The ttrpc client UserOnCloseWait() can make sure that resources are cleanup before delete bundle, which synchronizes task deletion and cleanup deadshim. It might slow down the task deletion, but it can make sure that resources can be cleanup and avoid EBUSY umount case. For example, the sandbox container like Kata/Firecracker might have mount points over the rootfs. If containerd handles task deletion and cleanup deadshim parallelly, the task deletion will meet EBUSY during umount and fail to cleanup bundle, which makes case worse. And also update cleanupAfterDeadshim, which makes sure that cleanupAfterDeadshim must be called after shim disconnected. In some case, shim fails to call runc-create for some reason, but the runc-create already makes runc-init into ready state. If containerd doesn't call shim deletion, the runc-init process will be leaky and hold the cgroup, which makes pod terminating :(. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2020-09-20 11:24:31 +08:00
Akshat Kumar	61da6986c0	Cleanup open pipes if logging binary fails to start Signed-off-by: Akshat Kumar <kshtku@amazon.com>	2020-09-10 20:06:51 -07:00
Brian Goff	dab7bd0c45	Always consume shim logs These fifos fill up if unconsumed, so always consume them. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2020-09-10 10:23:29 -07:00
Brian Goff	5f9d15eaac	shimv1: downgrade poroccess missing log to debug This `Info` log shows up for all exec processes that use the v1 shim with Docker because Docker deletes the process once it receives the exit event from containerd. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2020-09-01 10:31:41 -07:00
Akshat Kumar	4cc99e57a7	Remove unnecessary logging binary helpers and add godoc Signed-off-by: Akshat Kumar <kshtku@amazon.com>	2020-08-26 09:15:02 -07:00
Akshat Kumar	7a9fbec5fb	Add logging binary support when terminal is true Currently the shims only support starting the logging binary process if the io.Creator Config does not specify Terminal: true. This means that the program using containerd will only be able to specify FIFO io when Terminal: true, rather than allowing the shim to fork the logging binary process. Hence, containerd consumers face an inconsistent behavior regarding logging binary management depending on the Terminal option. Allowing the shim to fork the logging binary process will introduce consistency between the running container and the logging process. Otherwise, the logging process may die if its parent process dies whereas the container will keep running, resulting in the loss of container logs. Signed-off-by: Akshat Kumar <kshtku@amazon.com>	2020-08-25 17:28:29 -07:00
Wei Fu	73b1449278	runtime: ignore ErrNotExist when remove rootfs Signed-off-by: Wei Fu <fuweid89@gmail.com>	2020-08-12 20:04:50 +08:00
Brian Goff	d7b9cb0019	shim: move event context timeout to publsher Before this change, if an event fails to send on the first attempt, subsequent attempts will fail with context.Cancelled because the the caller of publish passes a cancellable timeout, which the publisher uses to send the event. The publisher returns immediately if the send fails, but adds the event to an async queue to try again. Meanwhile the caller will return cancelling the context. Additionally, subsequent attempts may fail to send because the timeout was expected to be for a single request but the queue sleeps for `attempt*time.Second`. In the shim service, the timeout was set to 5s, which means the send will fail with context.DeadlineExceeded before it reaches `maxRequeue` (which is currently 5). This change moves the timeout to the publisher so each send attempt gets its own timeout. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2020-07-20 17:51:10 -07:00
Akihiro Suda	fd99b6566b	decrease log level of cgroup2 ToggleController error when running in UserNS Fix #4312 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-06-24 18:15:16 +09:00
Phil Estes	fb80a49ec1	Merge pull request #4327 from AkihiroSuda/fix-4326 shim v2 runc: propagate options.Root to Cleanup	2020-06-17 09:23:53 -04:00
Akihiro Suda	f1a469a035	shim v2 runc: propagate options.Root to Cleanup Previously shim v2 (`io.containerd.runc.{v1,v2}`) always used `/run/containerd/runc` as the runc root. Fix #4326 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-06-17 19:06:36 +09:00
Wei Fu	d656fa38ca	restart plugin: support binary log uri Introduce LogURIGenerator helper function in cio package. It is used in the restart options, like WithBinaryLogURI and WithFileLogURI. And restart.LogPathLabel might be used in production and work well. In order to reduce breaking change, the LogPathLabel is still recognized if new LogURILabel is not set. In next release 1.5, the LogPathLabel will be removed. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2020-06-10 00:09:24 +08:00
Michael Crosby	7ce8a9d7d3	Merge pull request #4204 from ashrayjain/aj/add-kill-retry Make killing shims more resilient	2020-06-03 11:10:43 -04:00
Ashray Jain	3e95727f39	Make killing shims more resilient Currently, we send a single SIGKILL to the shim process once and then we spin in a loop where we use kill(pid, 0) to detect when the pid has disappeared completely. Unfortunately, this has a race condition since pids can be reused causing us to spin in an infinite loop when that happens. This adds a timeout to this loop which logs a warning and exits the infinite loop. Signed-off-by: Ashray Jain <ashrayj@palantir.com>	2020-06-03 12:57:08 +01:00
Akihiro Suda	2f601013e6	cgroup2: implement `containerd.events.TaskOOM` event How to test (from https://github.com/opencontainers/runc/pull/2352#issuecomment-620834524): (host)$ sudo swapoff -a (host)$ sudo ctr run -t --rm --memory-limit $((1024102432)) docker.io/library/alpine:latest foo (container)$ sh -c 'VAR=$(seq 1 100000000)' An event `/tasks/oom {"container_id":"foo"}` will be displayed in `ctr events`. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-06-01 14:00:13 +09:00
Sebastiaan van Stijn	dc92ad6520	Replace errors.Cause() with errors.Is() Dependencies may be switching to use the new `%w` formatting option to wrap errors; switching to use `errors.Is()` makes sure that we are still able to unwrap the error and detect the underlying cause. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-05-08 14:36:45 +02:00
Sebastiaan van Stijn	1b66fecad3	Integrate sys.SetSubreaper, sys.GetSubreaper in sys/reaper package Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-05-04 08:44:02 +02:00
Wei Fu	9687ba6315	test: TestRuntimeWithEmptyMaxEnvProcs should cleanup TestRuntimeWithEmptyMaxEnvProcs should restore the GoMaxProcs after test so that the temporary change of GoMaxProcs will not impact other case, like TestRuntimeWithNonEmptyMaxEnvProcs. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2020-04-23 22:09:10 +08:00
Wei Fu	0116352e1b	runtime: ignore ttrpc.ErrClosed when delete task For some reason, shimv2 process doesn't exist. The ttrpc doesn't detect the connection closed by server until delete task. For this case, we should ignore the ttrpc.ErrClosed and let task manager handle the cleanup. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2020-04-20 23:34:49 +08:00
Michael Crosby	2ed8d12bb0	Merge pull request #3845 from fahedouch/v2_shim_test v2 runtime shim test	2020-04-13 12:26:05 -04:00
Maksym Pavlenko	0caa233158	Rework shim logger shutdown process Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2020-04-07 12:42:04 -07:00
Michael Crosby	649f2aac66	add -v to shim binaries Request came from a slack message that shims do not output their versions making it hard for users and operators to know what version of a shim they have on the system. This adds a `-v` flag to the shims so that users can see if a shim is in sync with containerd or what versions of shims that they are running. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2020-03-17 13:23:06 -04:00
Maksym Pavlenko	2532bdf43f	Merge pull request #4100 from lifubang/publisher fix dial error when clean up a dead shim	2020-03-14 15:19:48 -07:00

... 2 3 4 5 6 ...

587 Commits