containerd

Author	SHA1	Message	Date
Maksym Pavlenko	2b87d4554f	Add retries when deleting a devmapper device Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2020-12-09 09:13:34 -08:00
Kazuyoshi Kato	a1f6c9dd88	snapshots/devmapper: fix rollback The rollback mechanism is implemented by calling deleteDevice() and RemoveDevice(). But RemoveDevice() is internally calling deleteDevice() as well. Since a device will be deleted by first deleteDevice(), RemoveDevice() always will see ENODATA. The specific error must be ignored to remove the device's metadata correctly. Signed-off-by: Kazuyoshi Kato <katokazu@amazon.com>	2020-08-17 15:41:03 -07:00
Kazuyoshi Kato	c383436af7	snapshots/devmapper: suspend a device to avoid data corruption According to https://github.com/torvalds/linux/blob/v5.7/Documentation/admin-guide/device-mapper/thin-provisioning.rst#internal-snapshots; > If the origin device that you wish to snapshot is active, you > must suspend it before creating the snapshot to avoid corruption. However the devmapper snapshotter was not doing that. Signed-off-by: Kazuyoshi Kato <katokazu@amazon.com>	2020-07-16 15:08:07 -07:00
Eric Ren	63b7587cd6	snapshots/devmapper: fix race windown causing IO hangup The issue beblow happens several times beforing the root cause found: 1. A `fdisk -l` process has being hung up for a long time; 2. A image layer snapshot device is visiable to dmsetup, which should not happen because it should be deactivated after `Commit()`; The backtrace of `fdisk` is always the same over time: ```bash [<ffffffff810bbc6a>] io_schedule+0x2a/0x80 [<ffffffff81295a3f>] do_blockdev_direct_IO+0x1e9f/0x2f10 [<ffffffff81296aea>] __blockdev_direct_IO+0x3a/0x40 [<ffffffff81290e43>] blkdev_direct_IO+0x43/0x50 [<ffffffff811b8a14>] generic_file_read_iter+0x374/0x960 [<ffffffff81291ad5>] blkdev_read_iter+0x35/0x40 [<ffffffff8125229b>] new_sync_read+0xfb/0x240 [<ffffffff81252406>] __vfs_read+0x26/0x40 [<ffffffff81252b96>] vfs_read+0x96/0x130 [<ffffffff812540e5>] SyS_read+0x55/0xc0 [<ffffffff81003c04>] do_syscall_64+0x74/0x180 ``` The root cause is, in Commit(), there's a race window between `SuspendDevice()` and `DeactivateDevice()`, which may cause the IOs of a process or command like `fdisk` on the "suspended" device hang up forever. It has twofold: 1. The IOs suspends on the devices; 2. The device is in `Suspended` state, because it's deactivated with `deferred` flag and without `force` flag; So they cannot make progress. One reproducer is: 1. enlarge the race window by putting sleep seconds there; 2. run `while true; do sudo fdisk -l; sleep 0.5; done` on one terminal; 3. and pull image on another terminal; Fixes it by: 1. Resume the devices again after flushing IO by suspend; 2. Remove device without `deferred` flag; Fix: #4234 Signed-off-by: Eric Ren <renzhen@linux.alibaba.com>	2020-05-07 07:46:45 +08:00
Eric Ren	b6bf7b97c2	devmapper: async remove device using Cleanup Fix: #3923 Signed-off-by: Eric Ren <renzhen@linux.alibaba.com>	2020-02-29 13:32:48 +08:00
Maksym Pavlenko	75efbaf678	Attempt to make device mapper snapshotter tests less flaky Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-12-13 13:20:23 -08:00
renzhen.rz	4d11bb36ad	devmapper: activate dm device if snap device marked as activated - reproducer 1. stop a container; 2. reboot, or dmsetup remove its corresponding dm device; 3. start the container, it will fail like: """ Error: failed to start containers: {"message":"failed to create container(4f33d2760760c41518a84821153ccdf7f80980b797b783cdd75178fc6ca0bf4b) on containerd: failed to create task for container(4f33d2760760c41518a84821153ccdf7f80980b797b783cdd75178fc6ca0bf4b): failed to mount rootfs component &{ext4 /dev/mapper/vg0-mythinpool-snap-2 []}: no such file or directory: unknown"} """ - how the fix works activate the dm device if necessary, and give a warn msg: """ time="2019-08-21T22:44:08.422695797+08:00" level=warning msg="devmapper device \"vg0-mythinpool-snap-2\" marked as \"Activated\" but not active, activating it" """ Signed-off-by: Eric Ren <renzhen@linux.alibaba.com>	2019-08-23 10:19:28 +08:00
Maksym Pavlenko	0a4bf1bd1e	Mark faulty devices Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-08-05 12:05:36 -07:00
Maksym Pavlenko	3741fd8591	Remove deferred flag when removing devmapper device Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-07-31 11:28:33 -07:00
Maksym Pavlenko	4d5a0e19eb	Mark faulty device in one transaction Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-07-30 16:26:55 -07:00
Maksym Pavlenko	878a3205cd	Better error recovery in devmapper Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-07-30 15:17:17 -07:00
renzhen.rz	3887053177	snapshots/devmapper: deactivate thin device after committed 1. reason to deactivate committed snapshot The thin device will not be used for IO after committed, and further thin snapshotting is OK using an inactive thin device as origin. The benefits to deactivate are: - device is not unneccesary visible avoiding any unexpected IO; - save useless kernel data structs for maintaining active dm. Quote from kernel doc (Documentation/device-mapper/provisioning.txt): " ii) Using an internal snapshot. Once created, the user doesn't have to worry about any connection between the origin and the snapshot. Indeed the snapshot is no different from any other thinly-provisioned device and can be snapshotted itself via the same method. It's perfectly legal to have only one of them active, and there's no ordering requirement on activating or removing them both. (This differs from conventional device-mapper snapshots.) " 2. an thinpool metadata bug is naturally removed An problem happens when failed to suspend/resume origin thin device when creating snapshot: "failed to create snapshot device from parent vg0-mythinpool-snap-3" error="failed to save initial metadata for snapshot "vg0-mythinpool-snap-19": object already exists" This issue occurs because when failed to create snapshot, the snapshotter.store can be rollbacked, but the thin pool metadata boltdb failed to rollback in PoolDevice.CreateSnapshotDevice(), therefore metadata becomes inconsistent: the snapshotID is not taken in snapshotter.store, but saved in pool metadata boltdb. The cause is, in PoolDevice.CreateSnapshotDevice(), the defer calls are invoked on "first-in-last-out" order. When the error happens on the "resume device" defer call, the metadata is saved and snapshot is created, which has no chance to be rollbacked. Signed-off-by: Eric Ren <renzhen@linux.alibaba.com>	2019-05-09 10:58:21 +08:00
Maksym Pavlenko	87289a0c62	devmapper: implement Usage Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2019-03-27 14:50:12 -07:00
Maksym Pavlenko	95f0a4903c	devmapper: rollback thin devices on error Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-02-21 17:40:10 -08:00
Maksym Pavlenko	adf5c640f4	devmapper: don't create or reload thin-pool from snapshotter Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-02-21 16:26:46 -08:00
Maksym Pavlenko	7efda48c53	devmapper: more precise way of checking if device is activated Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-02-21 16:26:46 -08:00
Maksym Pavlenko	37cdedc61c	devmapper: add linux tags, fix build Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-02-21 16:26:46 -08:00
Maksym Pavlenko	3a75882520	devmapper: add pool device manager Signed-off-by: Maksym Pavlenko <makpav@amazon.com>	2019-02-21 16:25:55 -08:00

18 Commits