containerd

Author	SHA1	Message	Date
Michael Crosby	82af36e59b	Merge pull request #5828 from cpuguy83/shimv2_exit_on_signals shimv2: handle sigint/sigterm	2022-01-31 10:47:39 -05:00
Brian Goff	3ffb6a6113	shimv2: handle sigint/sigterm This causes sigint/sigterm to trigger a shutdown of the shim. It is needed because otherwise the v2 shim hangs system shutdown. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2022-01-25 17:57:28 +00:00
Wei Fu	31a710c492	fix: should not send 137 code event if cmd is notfound ShimV2 has shim.Delete command to cleanup task's temporary resource, like bundle folder. Since the shim server exits and no persistent store is for task's exit code, the result of shim.Delete is always 137 exit code, like the task has been killed. And the result of shim.Delete can be used as task event only when the shim server is killed somehow after container is running. Therefore, dockerd, which watches task exit event to update status of container, can report correct status. Back to the issue #6429, the container is not running because the entrypoint is not found. Based on this design, we should not send 137 exitcode event to subscriber. This commit is aimed to remove shim instance first and then the `cleanupAfterDeadShim` should not send event. Similar Issue: #4769 Fix #6429 Signed-off-by: Wei Fu <fuweid89@gmail.com>	2022-01-22 00:58:33 +08:00
Jeff Zvier	356ca75757	containerd-shim-runc-v2: return init pid when clean dead shim If containerd-shim-runc-v2 process dead abnormally, such as received kill 9 signal, panic or other unkown reasons, the containerd-shim-runc-v2 server can not reap runc container and forward init process exit event. This will lead the container leaked in dockerd. When shim dead, containerd will clean dead shim, here read init process pid and forward exit event with pid at the same time. Signed-off-by: Jeff Zvier <zvier20@gmail.com>	2022-01-20 17:06:55 +08:00
宁明晓10296073	b35fb7d447	remove io/ioutil Signed-off-by: ningmingxiao <ning.mingxiao@zte.com.cn>	2022-01-11 16:07:23 +08:00
haoyun	bbe46b8c43	feat: replace github.com/pkg/errors to errors Signed-off-by: haoyun <yun.hao@daocloud.io> Co-authored-by: zounengren <zouyee1989@gmail.com>	2022-01-07 10:27:03 +08:00
Phil Estes	330961c2d5	Merge pull request #6358 from jonyhy96/feat-error refactor: functions for error log and error return	2021-12-14 10:16:54 -05:00
Fu Wei	d47fa40d1b	Merge pull request #6021 from dmcgowan/runc-shim-plugin	2021-12-14 10:19:23 +08:00
Derek McGowan	f83ab813d2	Use task plugin for runc shim Signed-off-by: Derek McGowan <derek@mcg.dev>	2021-12-13 10:37:13 -08:00
Derek McGowan	04e57d71b2	Seperate shim manager and task service Create new shim manager interface and deprecate older shim manager interface. Signed-off-by: Derek McGowan <derek@mcg.dev>	2021-12-13 10:37:12 -08:00
haoyun	c0d07094be	feat: Errorf usage Signed-off-by: haoyun <yun.hao@daocloud.io>	2021-12-13 14:31:53 +08:00
Maksym Pavlenko	6bccd67e84	Revert shim plugin migration Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-12-02 10:35:15 -08:00
Maksym Pavlenko	5015130f7a	Fix executable file not found when restoring shims Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-22 17:46:03 -08:00
Derek McGowan	6835a94707	Split runc shim into plugin components Signed-off-by: Derek McGowan <derek@mcg.dev>	2021-11-15 20:16:45 -08:00
Maksym Pavlenko	e17fe37e01	Fix package alias Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-10 14:29:41 -08:00
Maksym Pavlenko	6870f3b1b8	Support custom runtime path when launching tasks Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-09 13:31:46 -08:00
Maksym Pavlenko	d022fbe789	Address PR comments Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-02 11:19:43 -07:00
Maksym Pavlenko	2cec3a34b1	Migrate task directory Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:37:01 -07:00
Maksym Pavlenko	8b788d9dfe	Expose shim process interface Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:37:01 -07:00
Maksym Pavlenko	733519677f	Fix after rebase Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:37:01 -07:00
Maksym Pavlenko	df8c206a92	Cleanup shim loading Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:37:01 -07:00
Maksym Pavlenko	b554b577b0	Move shim restore to a separate file Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:37:01 -07:00
Maksym Pavlenko	a3d298193c	Fix backward compatibility with old task shims Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:37:01 -07:00
Maksym Pavlenko	33786ee4d2	Add plugin dependency between shim and shim services Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:37:00 -07:00
Maksym Pavlenko	fb5f6ce3c9	Rework task create and cleanup flow Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:37:00 -07:00
Maksym Pavlenko	7c4ead285d	Add task manager Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:36:58 -07:00
Maksym Pavlenko	2d5d3541e6	Rename task manager to shim manager Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-11-01 07:36:34 -07:00
zounengren	1f1cad3912	io/ioutil package has been deprecated in Go 1.16 that replaces io/ioutil functions Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2021-10-13 09:18:31 +08:00
Michael Crosby	e48bbe8394	add runc shim support for sched core In linux 5.14 and hopefully some backports, core scheduling allows processes to be co scheduled within the same domain on SMT enabled systems. The containerd impl sets the core sched domain when launching a shim. This allows a clean way for each shim(container/pod) to be in its own domain and any additional containers, (v2 pods) be be launched with the same domain as well as any exec'd process added to the container. kernel docs: https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/core-scheduling.html Signed-off-by: Michael Crosby <michael@thepasture.io>	2021-10-08 16:18:09 +00:00
Derek McGowan	2d48b6a864	Merge pull request #6031 from fuweid/carry-5648 runtime: should fail fast if dial error on shim	2021-10-07 09:43:10 -07:00
Derek McGowan	3f00006f72	Merge pull request from GHSA-c2h3-6mxw-7mvq v1 & v2 runtimes: reduce permissions for bundle dir	2021-10-04 08:24:47 -07:00
Samuel Karp	7d56b24f1a	v2 runtime: reduce permissions for bundle dir Bundle directory permissions should be 0700 by default. On Linux with user namespaces enabled, the remapped root also needs access to the bundle directory. In this case, the bundle directory is modified to 0710 and group ownership is changed to the remapped root group. Signed-off-by: Samuel Karp <skarp@amazon.com>	2021-09-22 16:13:09 -07:00
Wei Fu	f7658e37d9	runtime: should fail fast if dial error on shim In linux platform, the shim server always listens on the socket before the containerd task manager dial it. It is unlikely that containerd task manager should handle reconnect because the shim can't restart. For this case, the containerd task manager should fail fast if there is ENOENT or ECONNREFUSED error. And if the socket file is deleted during cleanup the exited task, it maybe cause that containerd task manager takes long time to reload the dead shim. For that task.v2 manager, the race case is like: ``` TaskService.Delete TaskManager.Delete(runtime/v2/manager.go) shim.delete(runtime/v2/shim.go) shimv2api.Shutdown(runtime/v2/task/shim.pb.go) <- containerd has been killed or restarted somehow bundle.Delete ``` The shimv2api.Shutdown will cause that the shim deletes socket file (containerd-shim-runc-v2 does). But the bundle is still there. During reloading, the containerd will wait for the socket file appears again in 100 seconds. It is not reasonable. The Reconnect should prevent this case by fast fail. Closes: #5648. Fixes: #5597. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2021-09-23 00:00:28 +08:00
Eng Zer Jun	50da673592	refactor: move from io/ioutil to io and os package The io/ioutil package has been deprecated as of Go 1.16, see https://golang.org/doc/go1.16#ioutil. This commit replaces the existing io/ioutil functions with their new definitions in io and os packages. Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2021-09-21 09:50:38 +08:00
Fu Wei	e1ad779107	Merge pull request #5817 from dmcgowan/shim-plugins Add support for shim plugins	2021-09-12 18:18:20 +08:00
Fu Wei	d9f921e4f0	Merge pull request #5906 from thaJeztah/replace_os_exec	2021-09-11 10:38:53 +08:00
Phil Estes	99987f2a5e	Merge pull request #5936 from ukontainer/feature-darwin-runtime-shim darwin: runtime support	2021-09-08 09:34:27 -04:00
zounengren	8e850bc0fe	replace deprecated Dail with DailContext Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2021-09-08 06:41:33 +08:00
Hajime Tazaki	5dd38792a8	darwin: use the default values for socketRoot variable Since the /run directory on macOS is read-only, darwin containerd should use a different directory. Use the pre-defined default values instead to avoid this issue. Fixes: `bd908acab` ("Use path based unix socket for shims") Signed-off-by: Hajime Tazaki <thehajime@gmail.com>	2021-09-03 09:48:21 +09:00
Sebastiaan van Stijn	2ac9968401	replace uses of os/exec with golang.org/x/sys/execabs Go 1.15.7 contained a security fix for CVE-2021-3115, which allowed arbitrary code to be executed at build time when using cgo on Windows. This issue also affects Unix users who have “.” listed explicitly in their PATH and are running “go get” outside of a module or with module mode disabled. This issue is not limited to the go command itself, and can also affect binaries that use `os.Command`, `os.LookPath`, etc. From the related blogpost (ttps://blog.golang.org/path-security): > Are your own programs affected? > > If you use exec.LookPath or exec.Command in your own programs, you only need to > be concerned if you (or your users) run your program in a directory with untrusted > contents. If so, then a subprocess could be started using an executable from dot > instead of from a system directory. (Again, using an executable from dot happens > always on Windows and only with uncommon PATH settings on Unix.) > > If you are concerned, then we’ve published the more restricted variant of os/exec > as golang.org/x/sys/execabs. You can use it in your program by simply replacing This patch replaces all uses of `os/exec` with `golang.org/x/sys/execabs`. While some uses of `os/exec` should not be problematic (e.g. part of tests), it is probably good to be consistent, in case code gets moved around. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-08-25 18:11:09 +02:00
Akihiro Suda	d3aa7ee9f0	Run `go fmt` with Go 1.17 The new `go fmt` adds `//go:build` lines (https://golang.org/doc/go1.17#tools). Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-08-22 09:31:50 +09:00
Derek McGowan	8d135d2842	Add support for shim plugins Refactor shim v2 to load and register plugins. Update init shim interface to not require task service implementation on returned service, but register as plugin if it is. Signed-off-by: Derek McGowan <derek@mcg.dev>	2021-08-17 11:06:09 -07:00
Phil Estes	7d4c95ff04	Merge pull request #5813 from mxpv/shim_cleanup Cleanup v2 shim	2021-08-11 11:47:47 -04:00
Michael Crosby	218db0f9af	Merge pull request #5835 from dmcgowan/plugin-events-cleanup Move plugin context events into separate plugin	2021-08-07 21:47:11 -04:00
Derek McGowan	0a0621bb47	Move plugin context events into separate plugin Signed-off-by: Derek McGowan <derek@mcg.dev>	2021-08-05 22:59:20 -07:00
Derek McGowan	6f027e38a8	Remove redundant build tags Remove build tags which are already implied by the name of the file. Ensures build tags are used consistently Signed-off-by: Derek McGowan <derek@mcg.dev>	2021-08-05 22:27:46 -07:00
Maksym Pavlenko	d30d897ef9	Cleanup v2 shim Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-08-04 10:38:05 -07:00
Maksym Pavlenko	fcd9c41991	Merge pull request #5746 from lifupan/main runtime: fix the issue of create new socket with abstract address	2021-07-29 15:40:28 -07:00
fupan.lfp	4ab3e7a53a	runtime: fix the issue of create new socket with abstract address For the abstract socket adress there's no need to chmod the address's file, cause the file didn't exist actually. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2021-07-27 23:24:26 +08:00
jerryzhuang	7a10fd4fcc	respect context timeout in shim binary call Signed-off-by: jerryzhuang <zhuangqhc@gmail.com>	2021-07-27 22:28:05 +08:00
yylt	0d45ac14e9	interface about shim build check Signed-off-by: Yang Yang <yang8518296@163.com>	2021-07-22 09:03:12 +08:00
Sebastiaan van Stijn	dbef1d56d7	runtime: runc v2: remove redundant validation cgroupsv2.LoadManager() already performs VerifyGroupPath(), and returns an error if the path is invalid, so this check is redundant. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-07-14 23:24:31 +02:00
Maksym Pavlenko	f0a32c66da	Merge pull request #5617 from fidencio/wip/shimv2-debug runtime,v2: Enable debug when containerd is on debug+ log level	2021-06-17 10:08:51 -07:00
Fabiano Fidêncio	87a2e0b2a2	runtime,v2: Enable debug when containerd is on debug+ log level Currently the shimv2 debug is only enabled when containerd is, specifically, on debug mode. However, it should be enabled whenever the CRI runtime is on debug or any other lower mode, as in trace mode. Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-06-17 12:43:02 +02:00
Shiming Zhang	7966a6652a	Cleanup code Signed-off-by: Shiming Zhang <wzshiming@foxmail.com>	2021-04-19 16:59:45 +08:00
Maksym Pavlenko	69a30ad581	Merge pull request #5378 from Iceber/check-flag runtime/shim: check the namespace flag first	2021-04-18 09:10:46 -07:00
Phil Estes	1e5cb4edcb	Merge pull request #5368 from mxpv/runtime_cleanup Runtime cleanup	2021-04-16 14:50:15 -04:00
Iceber Gu	34780d67ad	runtime/shim: check the namespace flag first Signed-off-by: Iceber Gu <wei.cai-nat@daocloud.io>	2021-04-16 17:32:21 +08:00
Samuel Karp	b431fe4fc0	freebsd: don't run shim delete in deleted dir fork/exec can fail and log a warning like this in containerd's log: failed to clean up after shim disconnected error=": fork/exec /usr/local/bin/containerd-shim-[my-shim]: no such file or directory" id=test namespace=default Passing the bundle path on the command line allows the shim delete command to run successfully. Signed-off-by: Samuel Karp <me@samuelkarp.com>	2021-04-15 18:09:29 -07:00
Maksym Pavlenko	993b863993	Add shim start opts Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-04-15 11:55:24 -07:00
Maksym Pavlenko	0ad8c0a169	Decouple shim start from task creation Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-04-11 18:51:27 -07:00
Sebastiaan van Stijn	7bb73da6b9	runtime/v2/shim: remove unused SetScore() and remove sys.OOMScoreMaxKillable The shim.SetScore() utility was no longer used since `7dfc605fc6`. Checking for uses outside of this repository, I found only one external use of this in gVisor; `a9441aea27/pkg/shim/service.go (L262-L264)` Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-04-07 19:16:58 +02:00
Sebastiaan van Stijn	91e7d21ee8	sys: add AdjustOOMScore() utility Handle the limits in this function so that consumers don't have to perform the boundary checks. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-04-07 19:16:56 +02:00
Sebastiaan van Stijn	708299ca40	Move RunningInUserNS() to its own package This allows using the utility without bringing whole of "sys" with it. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-03-23 11:29:53 +01:00
Maksym Pavlenko	56f17a0856	Merge pull request #5148 from wzshiming/fix/defer-cleanup runtime/v2: Fix defer cleanup for TaskManager.Create	2021-03-20 13:24:42 -07:00
Shiming Zhang	30e1e66e5c	runtime/v2: Fix defer cleanup Signed-off-by: Shiming Zhang <wzshiming@foxmail.com>	2021-03-20 18:40:36 +08:00
Michael Crosby	e0c94bb269	Merge pull request #4708 from kzys/enable-criu Re-enable CRIU tests by not using overlayfs snapshotter	2021-03-19 14:23:05 -04:00
Wei Fu	9fdc96c095	runtime/v2: add comment for checkCopyShimLogError After #4906, containerd opens fifo in read/write mode in linux platform The original comment doesn't correct and is removed by #5174. ``` // original comment // When using a multi-container shim, the fifo of the 2nd to Nth // container will not be opened when the ctx is done. This will // cause an ErrReadClosed that can be ignored. ``` However, we should add comment for checkCopyShimLogError to mention why we call checkCopyShimLogError. The checkCopyShimLogError, it is to prevent the flood of expected error messages after task die and the expected errors depend on platform. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2021-03-18 13:02:28 +08:00
Phil Estes	a0cc9b432d	Merge pull request #5195 from fuweid/fix-5173 runtime/v2/runc: fix leaking socket path	2021-03-17 09:33:41 -04:00
Kazuyoshi Kato	b520428b5a	Fix CRIU - process.Init#io could be nil - Make sure CreateTaskRequest#Options is not empty before unmarshaling Signed-off-by: Kazuyoshi Kato <katokazu@amazon.com>	2021-03-16 16:46:45 -07:00
Iceber Gu	5e484c9613	runtime/v2/runc: fix the defer cleanup of the NewContainer Signed-off-by: Iceber Gu <wei.cai-nat@daocloud.io>	2021-03-16 11:41:17 +08:00
Phil Estes	a1138182d5	Merge pull request #5180 from dmcgowan/lint-enforce-comments Fix exported comments enforcer in CI	2021-03-15 10:50:06 -04:00
Wei Fu	d895118c7c	runtime/v2/runc: fix leaking socket path When runC shimv2 starts, the StartShim interface will re-exec itself as long-running process, which will read the `address` during initializing. ```happycase Process containerd-shim-runc-v1/v2 start containerd-shim-runc-v1/v2 initializing socket reexec containerd-shim-runc-v1/v2 write address into file initializing read address write back to containerd daemon serving ... remove address in Shutdown call ``` However, there is no synchronization after reexec. Then the data race is like: ```leaking-case Process containerd-shim-runc-v1/v2 start containerd-shim-runc-v1/v2 initializing socket reexec containerd-shim-runc-v1/v2 initializing read address write address into file write back to containerd daemon serving ... fail to remove address because of empty address ``` The `address` should be writen into file first before reexec. And if shutdown the whole service before cleanup temporary resource (like socket file), the Shutdown caller will receive `ttrpc: closed` sometime, which depends on go runtime scheduler. Then it also causes leaking socket files. Since the shimV2-Delete binary API must be called to cleanup shim temporary resource and shimV2-runC-v1 doesn't support grouping multi containers in one, it is safe to remove the socket file in the binary call for shimV2-runC-v1. But for the shimV2-runC-v2 shim, we still cleanup socket in Shutdown. Hopefully we can find a way to cleanup socket in shimV2-Delete binary call. Fix: #5173 Signed-off-by: Wei Fu <fuweid89@gmail.com>	2021-03-15 18:32:00 +08:00
Wei Fu	eabd9b98b6	runtime: ignore file-already-closed error if dead shim fix: #5130 Signed-off-by: Wei Fu <fuweid89@gmail.com>	2021-03-15 12:18:26 +08:00
Derek McGowan	35eeb24a17	Fix exported comments enforcer in CI Add comments where missing and fix incorrect comments Signed-off-by: Derek McGowan <derek@mcg.dev>	2021-03-12 08:47:05 -08:00
Kevin Parsons	c9afc4250a	Fix error checking when resolving shim binary path Previously a typo was introduced that caused the wrong error to be checked against when calling exec.LookPath. This had the effect that containerd would never locate the shim binary if it was in the same directory as containerd's binary, but not in PATH. Signed-off-by: Kevin Parsons <kevpar@microsoft.com>	2021-03-08 16:24:19 -08:00
Maksym Pavlenko	134f7a7370	Merge pull request #5007 from fidencio/wip/allow-shimv2-to-also-be-loaded-from-an-arbitrary-path v2, util: Take the full binary path when starting the shimv2 process	2021-03-01 14:52:27 -08:00
Shiming Zhang	05ef2fe2fb	Fix missing close Signed-off-by: Shiming Zhang <wzshiming@foxmail.com>	2021-02-18 13:21:42 +08:00
Fabiano Fidêncio	d80dbdae68	v2, util: Take the full binary path when starting the shimv2 process The current code simply ignores the full binary path when starting the shimv2 process, and instead fallbacks to a binary in the path, and this is problematic (and confusing) for those using CRI-O, which has this bits vendored. The reason it's problematic with CRI-O is because the user can simply set the full binary path and, instead of having that executed, CRI-O will simply fail to create the container unless that binary is part of the path, which may not be case in a few different scenarios (testing being the most common one). Fixes: #5006 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-02-05 13:35:22 +01:00
IceberGu	b458583b76	runtime: fix shutdown runc v2 service Signed-off-by: IceberGu <wei.cai-nat@daocloud.io>	2021-02-02 15:36:49 +08:00
Phil Estes	49c5c14879	Merge pull request #4906 from payall4u/bugfix/fix-open-shim-fifo bugfix: change the flag of open log fifo to avoid containerd hang on syscall open	2021-02-01 09:01:38 -05:00
payall4u	957fa3379d	change flag from RDONLY to RDWR and close the fifo correct Signed-off-by: Zhiyu Li <payall4u@qq.com>	2021-01-31 19:00:42 +08:00
Aditi Sharma	1423e9199d	Update gogo/protobuf to v1.3.2 bump version 1.3.2 for gogo/protobuf due to CVE-2021-3121 discovered in gogo/protobuf version 1.3.1, CVE has been fixed in 1.3.2 Signed-off-by: Aditi Sharma <adi.sky17@gmail.com>	2021-01-28 12:57:50 +00:00
Maksim An	ddb5e1651a	Enhance logging driver and ctr tasks to support windows Signed-off-by: Maksim An <maksiman@microsoft.com>	2021-01-21 12:17:32 -08:00
Wei Fu	846cb963cc	runtime/v2: should use defer ctx to cleanup Signed-off-by: Wei Fu <fuweid89@gmail.com>	2021-01-11 23:22:38 +08:00
Maksym Pavlenko	c1b01eabc0	Add copyright header to proto files Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-01-05 10:44:07 -08:00
Michael Crosby	dc207b654d	Merge pull request #4860 from masters-of-cats/pr-process-not-found-err Return GRPC not found error instead of plain one	2020-12-21 10:25:11 -05:00
Georgi Sabev	7451dd1ed1	Return GRPC not found error instead of plain one When the shim returns a plain error when a process does not exist, the server is unable to recognise its GRPC status code and assumes UnknownError. This is awkward for containerd client users as they are unable to recognise the actual reason for the error. When the shim returns a NotFound GRPC error, it is properly translated by the server and clients receive a proper NotFound error instead of Unknown Please note that we (CF Garden) would like to have the eventual fix backported to 1.4 as well. Co-authored-by: Danail Branekov <danailster@gmail.com> Signed-off-by: Danail Branekov <danailster@gmail.com> Signed-off-by: Georgi Sabev <georgethebeatle@gmail.com>	2020-12-18 15:33:48 +02:00
Simon Kaegi	da2fd657ab	Add bounds on max oom_score_adj value for AdjustOOMScore oom_score_adj must be in the range -1000 to 1000. In AdjustOOMScore if containerd's score is already at the maximum value we should set that value for the shim instead of trying to set 1001 which is invalid. Signed-off-by: Simon Kaegi <simon_kaegi@ca.ibm.com>	2020-12-14 15:09:24 -05:00
Akihiro Suda	7126310a09	Merge pull request #4784 from fuweid/fix-4769 runtime: should not send duplicate task exit event	2020-12-02 15:26:57 +09:00
Wei Fu	faec5d4ffd	runtime: should not send duplicate task exit event If the shim has been killed and ttrpc connection has been closed, the shimErr will not be nil. For this case, the event subscriber, like moby/moby, might have received the exit or delete events. Just in case, we should allow ttrpc-callback-on-close to send the exit and delete events again. And the exit status will depend on result of shimV2.Delete. If not, the shim has been delivered the exit and delete events. So we should remove the task record and prevent duplicate events from ttrpc-callback-on-close. Fix: #4769 Signed-off-by: Wei Fu <fuweid89@gmail.com>	2020-12-01 21:54:04 +08:00
Derek McGowan	4a4bb851f5	Merge pull request from GHSA-36xw-fx78-c5r4 Use path based unix socket for shims	2020-11-30 10:32:18 -08:00
Maksym Pavlenko	0d4734655f	Merge pull request #4647 from katiewasnothere/task_update_annotations_upstream Add annotations to task update request api	2020-11-18 14:44:19 -08:00
Michael Crosby	bd908acabd	Use path based unix socket for shims This allows filesystem based ACLs for configuring access to the socket of a shim. Co-authored-by: Samuel Karp <skarp@amazon.com> Signed-off-by: Samuel Karp <skarp@amazon.com> Signed-off-by: Michael Crosby <michael@thepasture.io> Signed-off-by: Michael Crosby <michael.crosby@apple.com>	2020-11-11 11:47:46 -08:00
Kathryn Baldauf	95ba6e9f75	Add annotations to task update request api Signed-off-by: Kathryn Baldauf <kabaldau@microsoft.com>	2020-11-09 14:13:33 -08:00
Maksym Pavlenko	4da306e1e9	Fix panic in shim not logged Fix #4274 Carry #4298 Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2020-10-26 09:05:47 -07:00
Giuseppe Capizzi	8eda32e107	Check if a process exists before returning it Fixes #4632. Signed-off-by: Giuseppe Capizzi <gcapizzi@pivotal.io> Co-authored-by: Danail Branekov <danailster@gmail.com>	2020-10-22 16:50:14 +03:00
Akihiro Suda	915263f269	Merge pull request #4502 from akshat-kmr/master Add logging binary support when terminal is true	2020-10-08 12:14:39 +09:00
Maksym Pavlenko	c59d1cd5b0	Fix linter issues Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2020-10-07 15:42:01 -07:00
Wei Fu	4b05d03903	runtime/v2: cleanup dead shim before delete bundle The shim delete action needs bundle information to cleanup resources created by shim. If the cleanup dead shim is called after delete bundle, the part of resources maybe leaky. The ttrpc client UserOnCloseWait() can make sure that resources are cleanup before delete bundle, which synchronizes task deletion and cleanup deadshim. It might slow down the task deletion, but it can make sure that resources can be cleanup and avoid EBUSY umount case. For example, the sandbox container like Kata/Firecracker might have mount points over the rootfs. If containerd handles task deletion and cleanup deadshim parallelly, the task deletion will meet EBUSY during umount and fail to cleanup bundle, which makes case worse. And also update cleanupAfterDeadshim, which makes sure that cleanupAfterDeadshim must be called after shim disconnected. In some case, shim fails to call runc-create for some reason, but the runc-create already makes runc-init into ready state. If containerd doesn't call shim deletion, the runc-init process will be leaky and hold the cgroup, which makes pod terminating :(. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2020-09-20 11:24:31 +08:00

1 2 3 4 5 ...

344 Commits