github/ocf - ocf - 悟空信创化平台

github/ocf

Author	SHA1	Message	Date
Robert Baldyga	d4df912f46	Add option to disable cleaner This allows to avoid allocating cleaner metadata section and effectively save up to 20% of metadata memory footprint. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-04-28 13:04:27 +02:00
Adam Rutkowski	8f24556cec	Add missing pio deinitialization in standby stop pipeline Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-04-07 12:23:03 +02:00
Adam Rutkowski	550a479cde	fix typo in cache mngmt Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-04-07 12:23:03 +02:00
Robert Baldyga	c677f65212	Avoid double initialization of cleaning policy in standby mode Cleaning policy is initialized on standby activate, after all the metadata from primary cache is flushed and the actual recovery is being performed. Thus initializing it earlier on standby attach is incorrect. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-04-04 12:08:27 +02:00
Adam Rutkowski	77380d6579	Fix core load cleanup loop conf_meta->core_count is not modified during load/recovery in the latest version. Thus in case of error in cores initialization, in order to iterate over the initialized cores we must depend on core->added only, regardles of conf_meta->core_count value. for_each_core() macro does exactly this. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-04-01 13:53:25 +02:00
Robert Baldyga	9ebb0de878	Do not modify core_count on cache load / activate Increment core_count only on core addition, and decrement it only on core removal. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-03-31 10:00:24 +02:00
Robert Baldyga	9c751dd2b8	Manage valid_core_bitmap properly Set bit only on core addition and clean it on core removal. This allows to avoid conf metadata modification in load / standby load paths, which effectively prevents issues with metadata mismatch during consequent standby activate attempts after initial activate failure. Previously the first attempt changed the metadata, so on comparison with metadata on drive failed on any following attempt, leading to inability to activate the cache. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-03-30 23:46:06 +02:00
Robert Baldyga	d550c8f4ef	Fix minor coding style issues Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-03-30 22:15:50 +02:00
Robert Baldyga	af43a240d3	Return more specific error on CRC mismatch Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-03-28 22:42:59 +02:00
Robert Baldyga	84aa968877	Check for load error before accessing metadata content Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-03-28 22:08:05 +02:00
Adam Rutkowski	4a839cd332	Verify standby/active cache state in OCF entry points Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-03-28 09:42:02 +02:00
Robert Baldyga	d5b2c65a39	Remove "metadata_layout" parameter of the cache This feature is replaced with LRU list shuffling. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-03-07 17:48:25 +01:00
Michal Mielewczyk	116676c18d	Verify cache id duing the activate Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2022-02-17 15:02:03 +01:00
Robert Baldyga	49abe816ce	Merge pull request #649 from pdebski21/1023 fix for issue #1023	2022-02-07 16:17:14 +01:00
Robert Baldyga	805ea14529	Remove runtime recovery in standby mode Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-02-01 03:11:50 +01:00
Robert Baldyga	76684ed8a9	Merge pull request #642 from robertbaldyga/parallelize Parallelize metadata initialization	2022-02-07 13:53:45 +01:00
Robert Baldyga	c176daeec1	Merge pull request #640 from pdebski21/superblock_mismatch added error code for superblock mismatch	2022-02-03 15:30:03 +01:00
Robert Baldyga	b70492ad3d	Parallelize ALRU recovery Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-31 06:59:28 +01:00
Robert Baldyga	8cc71cc9cb	Remove ocf_cleaning_init_cache_block() from metadata rebuild Cleaning policy initializaton initializes metadata for all cache lines anyway, so this step is not needed. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-28 19:30:41 +01:00
Robert Baldyga	48bed40dd7	Reconstruct freelist during metadata rebuild Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-28 19:30:39 +01:00
Robert Baldyga	f3e4f8c2db	Parallelize ocf_mngt_rebuild_metadata() Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-28 19:29:52 +01:00
Robert Baldyga	036aca41b3	Parallelize ocf_lru_populate() Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-28 19:29:21 +01:00
Robert Baldyga	25e2551964	Check core status during recovery based on core metadata Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-28 19:29:21 +01:00
Robert Baldyga	568c565497	Init properties before loading superblock Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-28 19:29:21 +01:00
Piotr Debski	9b980d3f22	fix for issue #1023 Better error for core size mismatch during activation/load adding pyocf test for new error code Signed-off-by: Piotr Debski <piotr.debski@intel.com>	2022-01-25 05:18:16 +01:00
Adam Rutkowski	a32a787e3d	Fix error handling in cache attach Only close cores in error handling if attach parameter "open_cores" is set to true. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-01-13 17:26:47 +01:00
Adam Rutkowski	294e02bc1b	Fail cache recovery in case of erroneous mapping Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-01-10 11:10:02 +01:00
Piotr Debski	609a22cfda	added ERROR code for superblock mismatch Signed-off-by: Piotr Debski <piotr.debski@intel.com>	2022-01-08 23:06:10 +01:00
Adam Rutkowski	9693b82cf9	Only flush superblock at the end of cache attach The purpose of this change is not to write superblock to the cache drive untill all other sections are initilized on disk in attach() path. Combined with superblock clearing at the erarlier stage of attach(), this assures there are no residual mappings in the collision section in case of power failure during attach with pre-existing metadata. This is implemented by removing ocf_metadata_flush_all_set_status() step at the beginning of ocf_metadata_flush_all(). ocf_metadata_flush_all() is called, except for the attach() case described above, in two cases: 1. at the end of cache load - potentially after cache recovery 2. during detaching cache drive in cache stop. To make sure there are no regressions in the first case, an explicit _ocf_mngt_attach_shutdown_status() is added to load pipeline before ocf_metadata_flush_all(). The second case is always ran after cache drive is attached, so dirty status bit must have already be written to the disk. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-01-05 13:06:59 +01:00
Adam Rutkowski	196437f9bc	Zero superblock before writing metadata This is the first step towards atomic initialization of metadata on cache disk. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2022-01-05 13:06:59 +01:00
Robert Baldyga	c6644116ae	Merge pull request #614 from robertbaldyga/redesign-standby Redesign failover standby API	2022-01-04 14:07:05 +01:00
Robert Baldyga	4aa3d8f9df	Remove "unsafe" path from standby load Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2022-01-03 20:10:40 +01:00
Jan Musial	ae18ce274e	Fix cache size requirements and some logging Signed-off-by: Jan Musial <jan.musial@intel.com>	2022-01-03 14:30:07 +01:00
Robert Baldyga	b40fa0c2bf	Fix closing volume on standby stop Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 20:54:45 +01:00
Robert Baldyga	86a2896bcf	Rename "bind" to "standby" Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 20:32:03 +01:00
Robert Baldyga	716b5751d6	Redesign failover standby API Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 20:31:40 +01:00
Robert Baldyga	4cabc60d40	Avoid loading runtime metadata sections during recovery Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-29 14:04:19 +01:00
Robert Baldyga	e73cbad2c7	Merge pull request #631 from mmichal10/dont-stop-cleaner Don't stop cleaner in activate rollback	2021-12-27 16:51:32 +01:00
Robert Baldyga	0ac66ce4aa	Fix cache stop after standby detach Don't attempt to close cache volume if cache is in standby detached state. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-23 22:39:37 +01:00
Michal Mielewczyk	a8bdba0cb2	Don't stop cleaner in activate rollback Activate is not responsible for starting cleaner so rollback shouldn't stop it eiter. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-12-23 14:46:28 +01:00
Robert Baldyga	df9a9f2722	Read superblock sections from cache volume during activate Because of metadata flapping it is much more complicated to capture those sections in flight in standby mode, so we read them directly from the cache volume during the activate. Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2021-12-15 15:30:34 +01:00
Adam Rutkowski	b1494f4642	Remove option to failover without detach Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-11-30 15:18:08 +01:00
Michal Mielewczyk	4ab22ee2dc	Maintain runtime struct during failover standby To allow the fastest switching from the passive-standby to active mode, the runtime metadata must be kept 100% synced with the metadata on the drive and in the RAM thus recovery is required after each collision section update. To avoid long-lasting recovering of all the cachelines each time the collision section is being updated, the passive update procedure recovers only those which have its MD entries on the updated pages. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:58:09 +01:00
Michal Mielewczyk	52824adaaf	Additional cleaning policy info outside of the SB Starting cache in a standby mode requires access to a valid cleaning policy type. If the policy is stored only in the superblock, it may be overridden by one of the metadata passive updates. To prevent losing the information it should be stored in cache's runtime metadata. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	0e529479d6	Init cleaner during passive start Initializing cleaning policy is very time consuming. To reduce the time required for activating cache instance the initialization sholud be done during passitve start Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	390e80794d	Refactor cleaning policy initialization Extract cleaning policy initialization to a separate function Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	6d4e6af5b6	Recovery on passive start Adjust recovery procedure to allow rebuilding metadata from partialy valid metadata Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	11dacd6a84	Set dirty shutdown status on standby init Since part of the recovery is done during `standby init`, the correct shutdown status has to be set Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	8f58add152	Lru populate unsafe The unsafe mode is useful if the metadata of added cores is incomplete. Such scenario is possible when starting cache to standby mode from partially vaild metadata. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00
Michal Mielewczyk	4deaa1e133	Reset all the status bits during recovery Make sure all the invalid cachelines have reset status bits. This allows to recognize invalid cachelines easily during populate. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-11-19 11:53:48 +01:00

1 2 3 4 5

212 Commits