github/ocf - ocf - Gitea: Git with a cup of tea

github/ocf

Author	SHA1	Message	Date
Michal Mielewczyk	5d74aec921	Add missing return in raw_ram_zero() in error path Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2022-01-12 07:46:49 +01:00
Adam Rutkowski	294e02bc1b	Fail cache recovery in case of erroneous mapping Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-01-10 11:10:02 +01:00
Adam Rutkowski	9693b82cf9	Only flush superblock at the end of cache attach The purpose of this change is not to write superblock to the cache drive untill all other sections are initilized on disk in attach() path. Combined with superblock clearing at the erarlier stage of attach(), this assures there are no residual mappings in the collision section in case of power failure during attach with pre-existing metadata. This is implemented by removing ocf_metadata_flush_all_set_status() step at the beginning of ocf_metadata_flush_all(). ocf_metadata_flush_all() is called, except for the attach() case described above, in two cases: 1. at the end of cache load - potentially after cache recovery 2. during detaching cache drive in cache stop. To make sure there are no regressions in the first case, an explicit _ocf_mngt_attach_shutdown_status() is added to load pipeline before ocf_metadata_flush_all(). The second case is always ran after cache drive is attached, so dirty status bit must have already be written to the disk. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-01-05 13:06:59 +01:00
Adam Rutkowski	196437f9bc	Zero superblock before writing metadata This is the first step towards atomic initialization of metadata on cache disk. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-01-05 13:06:59 +01:00
Robert Baldyga	c6644116ae	Merge pull request #614 from robertbaldyga/redesign-standby Redesign failover standby API	2022-01-04 14:07:05 +01:00
Robert Baldyga	4aa3d8f9df	Remove "unsafe" path from standby load Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-03 20:10:40 +01:00
Jan Musial	ae18ce274e	Fix cache size requirements and some logging Signed-off-by: Jan Musial <jan.musial@intel.com>	2022-01-03 14:30:07 +01:00
Robert Baldyga	b40fa0c2bf	Fix closing volume on standby stop Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 20:54:45 +01:00
Robert Baldyga	86a2896bcf	Rename "bind" to "standby" Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 20:32:03 +01:00
Robert Baldyga	b25cd91b86	Remove unused ocf_metadata_load_unsafe() Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 20:31:43 +01:00
Robert Baldyga	716b5751d6	Redesign failover standby API Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 20:31:40 +01:00
Robert Baldyga	4cabc60d40	Avoid loading runtime metadata sections during recovery Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 14:04:19 +01:00
Robert Baldyga	4625763df5	Return error on CRC mismatch during recovery Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 14:04:19 +01:00
Robert Baldyga	e73cbad2c7	Merge pull request #631 from mmichal10/dont-stop-cleaner Don't stop cleaner in activate rollback	2021-12-27 16:51:32 +01:00
Robert Baldyga	0ac66ce4aa	Fix cache stop after standby detach Don't attempt to close cache volume if cache is in standby detached state. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-23 22:39:37 +01:00
Michal Mielewczyk	a8bdba0cb2	Don't stop cleaner in activate rollback Activate is not responsible for starting cleaner so rollback shouldn't stop it eiter. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-12-23 14:46:28 +01:00
Bob Chen	b6de614ada	fix volume_close completion order Signed-off-by: Bob Chen <beef9999@qq.com>	2021-12-22 15:18:34 +08:00
Robert Baldyga	a2916313ee	Revert "fix volume_close completion order"	2021-12-21 20:33:34 +01:00
chenbo	aa6e674034	fix volume_close completion order	2021-12-20 20:10:07 +08:00
Robert Baldyga	0751b2c0c0	Fix metadata flapping Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-15 22:21:57 +01:00
Robert Baldyga	cac5869406	Merge pull request #603 from robertbaldyga/metadata-flapping Introduce flapping of metadata config sections	2021-12-15 17:11:15 +01:00
Robert Baldyga	df9a9f2722	Read superblock sections from cache volume during activate Because of metadata flapping it is much more complicated to capture those sections in flight in standby mode, so we read them directly from the cache volume during the activate. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-15 15:30:34 +01:00
Robert Baldyga	99c8c05f3f	Introduce flapping of metadata config sections This feature provides double buffering of config sections to prevent situation when power failure during metadata flush leads to partially updated metadata. Flapping mechanism makes it always possible to perform graceful rollback to previous config metadata content in such situation. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-15 15:30:34 +01:00
Neil Sun	7f82ef3048	Fix incorrect page count calculation with large PAGE_SIZE e.g., PAGE_SIZE 65536, cache line 8k. fix https://github.com/Open-CAS/open-cas-linux/issues/1015 Signed-off-by: Sun Feng <loyou85@gmail.com>	2021-12-14 20:07:59 +08:00
Robert Baldyga	60218759d2	Merge pull request #597 from rafalste/fix_core_zero_size_error Fix core-zero-size error	2021-12-08 22:04:27 +01:00
Robert Baldyga	21c4673251	Merge pull request #600 from mmichal10/cleaning-cmpl Call completion if failed to perform cleaning	2021-12-08 22:00:58 +01:00
Michal Mielewczyk	d6f2998890	Call completion if failed to perform cleaning Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-12-08 14:50:50 +01:00
Michal Mielewczyk	911a5cddf0	Deinit all registered volume types Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-12-08 14:16:49 +01:00
Michal Mielewczyk	655f732748	Don't access freed memory Instead of accessing memory of a freed IO, redo size calculations Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-12-08 14:16:49 +01:00
Michal Mielewczyk	244712b020	Prevent race condition in fast path Request submitted in fast path may be freed before the sequential cutoff stats are updated. Increment request reference counter to prevent it. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-12-08 09:00:04 +01:00
Rafal Stefanowski	b57bad4652	Fix core-zero-size error Move error print to where it belongs, preventing this message to pop up when same error code is reported elsewhere for other reason. Signed-off-by: Rafal Stefanowski <rafal.stefanowski@intel.com>	2021-12-06 12:30:29 +01:00
Adam Rutkowski	b1494f4642	Remove option to failover without detach Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-11-30 15:18:08 +01:00
Adam Rutkowski	b455a393dd	extra assertion in metadata passive update Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-11-30 12:04:57 +01:00
Adam Rutkowski	d0b00817f3	fix cacheline reset in passive metadata update Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-11-30 19:34:52 +01:00
Krzysztof Majzerowicz-Jaszcz	133ea307c8	Fix for issues #988 and #997 This patch fixes the issue 988 (and 997) causing a kernel stack overflow. Signed-off-by: Krzysztof Majzerowicz-Jaszcz <krzysztof.majzerowicz-jaszcz@intel.com>	2021-11-24 08:15:07 +01:00
Michal Mielewczyk	4ab22ee2dc	Maintain runtime struct during failover standby To allow the fastest switching from the passive-standby to active mode, the runtime metadata must be kept 100% synced with the metadata on the drive and in the RAM thus recovery is required after each collision section update. To avoid long-lasting recovering of all the cachelines each time the collision section is being updated, the passive update procedure recovers only those which have its MD entries on the updated pages. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:58:09 +01:00
Michal Mielewczyk	a6989d1881	Pio concurrency	2021-11-19 11:58:09 +01:00
Michal Mielewczyk	52824adaaf	Additional cleaning policy info outside of the SB Starting cache in a standby mode requires access to a valid cleaning policy type. If the policy is stored only in the superblock, it may be overridden by one of the metadata passive updates. To prevent losing the information it should be stored in cache's runtime metadata. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	0e529479d6	Init cleaner during passive start Initializing cleaning policy is very time consuming. To reduce the time required for activating cache instance the initialization sholud be done during passitve start Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	390e80794d	Refactor cleaning policy initialization Extract cleaning policy initialization to a separate function Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	6d4e6af5b6	Recovery on passive start Adjust recovery procedure to allow rebuilding metadata from partialy valid metadata Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	11dacd6a84	Set dirty shutdown status on standby init Since part of the recovery is done during `standby init`, the correct shutdown status has to be set Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	8f58add152	Lru populate unsafe The unsafe mode is useful if the metadata of added cores is incomplete. Such scenario is possible when starting cache to standby mode from partially vaild metadata. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	4deaa1e133	Reset all the status bits during recovery Make sure all the invalid cachelines have reset status bits. This allows to recognize invalid cachelines easily during populate. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	fc7c901c8b	Skip collision init on cache start passive Recovery during passive start is based on the assuption that metadata collision section stored on disk might be partially valid. Reseting this data would make rebuilding metadata impossible. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	048bbedd71	Fix metadata_clear_valid_if_clean() The function should return the cacheline's valid status Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	bb0ff67fe9	Metadata `clear_dirty_if_invalid()` utility Fix cacheline's metadata if it is dirty and invalid Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	800190153b	Extend lru list API Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	5ca1404f06	Fix spelling in the error message Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	ccd0abfea5	Add cache line recovery utils to OCF internal API Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00

1 2 3 4 5 ...

757 Commits