github/ocf - ocf - Gitea: Git with a cup of tea

github/ocf

Author	SHA1	Message	Date
Adam Rutkowski	d5b16c273e	Check for hit after upgrading hash bucket lock Lookup is repeated after request is identified as miss and hash bucket lock is upgraded (in order to map missing cachelines). At this point cachelines status might change and the request might turn out to be a hit after all. Adding check for this condition removes unnecessary calls to remap logic. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-06-15 23:11:02 +02:00
Adam Rutkowski	719676c444	Fix repartitioning in request refresh path update_req_info() should include REMAPPED cachelines in repart stats (number of cachelines within request belonging to other than the target partition). Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-31 12:13:48 -05:00
Michał Mielewczyk	a6c8cbb1ac	Merge pull request #479 from arutk/lru_fix3 Always call LRU_set_hot() under hash bucket lock	2021-03-26 11:04:59 +01:00
Adam Rutkowski	9486b7796f	Remove early return from engine_map() Removing conditional early return from engine_map() function in case of insufficient free cachelines. The reasons are: 1. current implementation does not treat unssufficient free cachelines condition as an error, 2. the check is based on stale request info, so it is inaccurate, 3. it is easier to hit more paths with functional tests, 4. partially mapping request from the freelist becomes more common rather than being a corner case dependent on racy timings between threads Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-26 07:40:24 -05:00
Adam Rutkowski	a3f2a214b6	Always call LRU_set_hot() under hash bucket lock set_hot() depends on cacheline metadata status to determine on which list the element is located (dirty cs clean list). Thus at least hash bucket lock is required when calling set_hot(). Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-25 18:50:13 -05:00
Robert Baldyga	87244c04d7	Merge pull request #472 from mmichal10/lock-on-setting-hot Update cleaning lru under metadata lock	2021-03-19 09:54:32 +01:00
Michal Mielewczyk	841f8122d7	Update cleaning lru under metadata lock This prevents deinitializing cleaning policy structures during IO. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-03-18 09:55:21 +01:00
Michał Mielewczyk	df969cde16	Merge pull request #470 from arutk/lru_fix Parallel eviction fixes	2021-03-17 11:41:07 +01:00
Adam Rutkowski	c565c5c3f5	Add comments warning about stale request map info Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-17 11:28:02 -05:00
Adam Rutkowski	98124aa13d	Add missing lookup in engine_map() Early return from engine_map() in case of insufficient free cachelines on the freelist is opportunistic, as both request map info and freelist count are not accurate. Map info is stale as it is to be refreshed in engine_map() after hash bucket lock had been upgraded. Freelist count on other hand is subject to change asynchronously. The implementation assumption however is that after engine_map() request is fully traversed (engine_map() is equivalent to engine_lookup() followed by an attempt to map missing cachelines). So in case of early return we must take care of repeating the lookup. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-17 11:23:24 -05:00
Adam Rutkowski	e5fa15bdb2	Remove early return from engine_map() in case of hit At this point cacheline status in request map is stale, as lookup was performed before upgrading hash bucket lock. If indeed all cachelines are mapped, this will be determined in the main loop of engine_map(). Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-17 11:21:03 -05:00
Adam Rutkowski	736fb2efc0	Call LRU set_hot() immediately after cache insert This assures that cacheline with LOOKUP_INSERTED status is always present on the LRU list. This fixes an ENV_BUG() caused by an attempt to remove a cacheline from LRU list which was not there. This happened when cacheline was mapped from freelist (LOOKUP_INSERTED) but the entire request mapping failed and generic cleanup routines attempted to invalidate cacheline, including removing it from the LRU list. As engine_set_hot() is called after successfull mapping, the inserted cacheline was not yet present on the LRU list. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-16 19:59:09 -05:00
Michal Mielewczyk	4e8c037d7b	Fix `ocf_engine_unmapped_count()` Inserted entries should be considered mapped. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-03-16 15:36:47 +01:00
Adam Rutkowski	7927b0b74f	Optimize set_hot calls in eviction path Split traversation into two distinct phases: lookup() and lru set_hot(). prepare_cachelines() now only calls set_hot() once after lookup and insert are finished. lookup() is called explicitly only once in prepare_cachelines() at the very beginning of the procedure. If request is a miss then then map() performs operations equivalent to lookup() supplemented by an attempt to map cachelines. Both lookup() and set_hot() are called via traverse() from the engines which do not attempt mapping, and thus do not call prepare_clines(). Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:47 +01:00
Adam Rutkowski	1c6168d82b	Do not unmap inserted cachelines before eviction Unmapping cachelines previously mapped from freelist before eviction is a waste of resources. Also if map does not erarly exit upon first mapping error, we can have request fully traversed (and partially mapped) after mapping and thus skip lookup in eviction. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:47 +01:00
Adam Rutkowski	81fc7ab5c5	Parallel eviction Eviction changes allowing to evict (remap) cachelines while holding hash bucket write lock instead of global metadata write lock. As eviction (replacement) is now tightly coupled with request, each request uses eviction size equal to number of its unmapped cachelines. Evicting without global metadata write lock is possible thanks to the fact that remaping is always performed while exclusively holding cacheline (read or write) lock. So for a cacheline on LRU list we acquire cacheline lock, safely resolve hash and consequently write-lock hash bucket. Since cacheline lock is acquired under hash bucket (everywhere except for new eviction implementation), we are certain that noone acquires cacheline lock behind our back. Concurrent eviction threads are eliminated by holding eviction list lock for the duration of critial locking operations. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:47 +01:00
Adam Rutkowski	1411314678	Add getter function for cache->device->concurrency.cache_line The purpose of this change is to facilitate unit testing. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:47 +01:00
Adam Rutkowski	ce2ff14150	Move request engine callbacks to req structure Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:47 +01:00
Adam Rutkowski	0e699fc982	Refactor ocf_engine_remap .. so that the main part, responsible strictly for mapping given LBA to given collision index, is encapsulated in a function ocf_map_cache_line with external linkage. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:46 +01:00
Adam Rutkowski	3bd0f6b6c4	Change sequential request detection logic Changing sequential request detection so that a miss request is recognized as sequential after needed cachelines are evicted and mapped to the request in a sequential order. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:46 +01:00
Adam Rutkowski	056217d103	Rename cleaner attribute cache_line_lock to lock_cacheline .. to make it clean that true means cleaner must lock cachelines rather than the lock is already being held. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:46 +01:00
Adam Rutkowski	07d1079baa	Add LOOKUP_REMAPPED status to allow iterative cacheline lock Allowing request cacheline lock to be called on partially locked request. This is going to be usefull for upcomming eviction improvements, where request will first have evicted (LOOKUP_REMAPPED) cachelines assigned to it in a locked state, followed by standard request cacheline lock call in order to lock previously inserted (LOOKUP_HIT) or mapped from freelist (LOOKUP_INSERTED) cachelines. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:46 +01:00
Adam Rutkowski	b34f5fd721	Rename LOOKUP_MAPPED to LOOKUP_INSERTED Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-03-05 11:20:46 +01:00
Adam Rutkowski	cf5f82b253	Use cline concurrency ctx instead of cache Cacheline concurrency functions have their interface changed so that the cacheline concurrency private context is explicitly on the parameter list, rather than being taken from cache->device->concurrency.cache_line. Cache pointer is no longer provided as a parameter to these functions. Cacheline concurrency context now has a pointer to cache structure (for logging purposes only). The purpose of this change is to facilitate unit testing. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-02-24 17:29:39 -06:00
Adam Rutkowski	cd9e42f987	Properly lock hash bucket for status bits operations Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-02-18 15:02:50 -06:00
Adam Rutkowski	10c3c3de36	Renaming hash bucket locking functions 1. new abbreviated previx: ocf_hb (HB stands for hash bucket) 2. clear distinction between functions requiring caller to hold metadata shared global lock ("naked") vs the ones which acquire global lock on its own ("prot" for protected) 3. clear distinction between hash bucket locking functions accepting hash bucket id ("id"), core line and lba ("cline") and entire request ("req"). Resulting naming scheme: ocf_hb_(id/cline/req)_(prot/naked)_(lock/unlock/trylock)_(rd/wr) Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-02-12 18:08:15 -06:00
Robert Baldyga	af8177d2ba	Merge pull request #458 from mmichal10/fix-cleaning Fix updating hot cachelines cleaning list	2021-02-11 11:30:07 +01:00
Robert Baldyga	d03ea719cd	Merge pull request #451 from arutk/exact_evict_count only request evict size equal to request unmapped count	2021-02-11 10:47:12 +01:00
Michal Mielewczyk	fa41d4fc88	Fix updating hot cachelines cleaning list Update cacheline's timestamp each time it's being written. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-02-10 10:02:57 -05:00
Adam Rutkowski	5538a5a95d	Only request evict size equal to request unmapped count Removing the logic for oportunistic partition overflow reduction by evicting more cachelines than actually required by the request being serviced. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2021-02-09 11:11:15 -06:00
Robert Baldyga	3543f5c5cc	Merge pull request #443 from rafalste/update_copyright Update copyright statements (2021)	2021-02-03 11:59:39 +01:00
Michal Mielewczyk	3a7b55c4c2	Don't evict on hit If request is hit, simply try to acquire cachelines instead of verifying whether target partition's size is not exceeded. Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2021-01-29 17:15:32 -05:00
Rafal Stefanowski	6ed4cf8a24	Update copyright statements (2021) Signed-off-by: Rafal Stefanowski <rafal.stefanowski@intel.com>	2021-01-21 13:17:34 +01:00
Rafal Stefanowski	d3b61e474c	Remove init_mode and use metadata.is_volatile instead Signed-off-by: Rafal Stefanowski <rafal.stefanowski@intel.com>	2020-12-23 16:31:55 +01:00
Michal Mielewczyk	60680b15b2	Accessors for `req->info.mapping_error` Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2020-12-21 08:23:34 -05:00
Michal Mielewczyk	9e11a88f2e	Occupancy per ioclass Respect occpuancy limit set single ioclass Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2020-12-21 08:23:34 -05:00
Michal Mielewczyk	9d80882b00	Remove `re_part` field from `struct ocf_req_info` Since the request carries an explicit information about number of the cacheliens to be reparted, no need of keeping the boolean information if some of the request's cachelines are assigned to a wrong partition Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2020-12-21 08:00:25 -05:00
Michal Mielewczyk	e26ca30399	Track explicit number of cachelines to be reparted Instead of redunant calculating number of cachlines to be reparted, keep this information in request's info Signed-off-by: Michal Mielewczyk <michal.mielewczyk@intel.com>	2020-12-21 08:00:25 -05:00
Adam Rutkowski	44efe3e49e	Refactor LRU code to use part rather than part_id Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2020-12-17 14:35:27 +01:00
Robert Baldyga	990f5160eb	Cleanup request map entries in error handling path Signed-off-by: Robert Baldyga <robert.baldyga@intel.com>	2020-09-02 14:30:28 +02:00
Robert Baldyga	7d889fa1fc	Merge pull request #385 from arutk/pt_write_double_inv Two pass write invalidate	2020-07-28 07:42:44 +02:00
Adam Rutkowski	b232f2b633	Service WA write misses in WI engine WA write must follow follow the same two-pass pattern as WI does. This change modifies WA engine to default to WI in case of any miss (either partial or full), not only partial miss. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2020-07-20 17:26:36 +02:00
Adam Rutkowski	91b6098fda	Two pass write invalidate Add second pass of write invalidate. It is necessary only if concurrent I/O had inserted target LBAs to cache after WI request did traversation. These LBAs might have been written by WI request behind the concurrent I/O's back, resulting in making these sectors effectively invalid. In this case we must update these sectors' metadata to reflect this. However we won't know about this after we traverse the request again - hence calling ocf_write_wi again with req->wi_second_pass set to indicate that this is the second pass (core write should be skipped). Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2020-07-20 17:26:35 +02:00
Robert Baldyga	ec6eae6a5f	Merge pull request #377 from arutk/fix_map Set entry->core_id in ocf_engine_lookup_map_entry	2020-07-10 21:32:09 +02:00
Adam Rutkowski	b14312dcef	Set entry->core_id in ocf_engine_lookup_map_entry core_id should be set in this function. The fact that it is missing might lead to incorrect behaviour e.g. in case of promotion policy. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2020-06-09 13:15:50 +02:00
Adam Rutkowski	7776bd6485	WO: read clean sectors from cache In case of partial hit WO engine first reads data for the entire request address range from core device. Then it plumbs it by fetching dirty sectors from cache device. For unindentified reason this leads to a data corruption in YCSB workload A. After flushing dirty data and re-loading cache the data is correct. This change modifies WO read handler to read clean data from the cache. This is not optimal, as the clean sectors are now read twice in case of partial hit. For now it seems to be good enough work-around for the data corruption problem. The symptoms, combined with the fact that this change seems to make the problem go away, indicates that at some point WB write handler (and/or special I/O request handlers like discard) puts CAS in a state where in-memory medata wrongly indicates that a sector is clean while in fact it is dirty, as marked in the on-disk metadata. Signed-off-by: Adam Rutkowski <adam.j.rutkowski@intel.com>	2020-05-27 12:31:53 +02:00
Slawomir Jankowski	f516ed62e3	Remove unused parameter Signed-off-by: Slawomir Jankowski <slawomir.jankowski@intel.com>	2020-05-19 16:23:32 +02:00
Robert Baldyga	1c9312842a	Merge pull request #369 from rafalste/copyright_update Update copyright statements	2020-05-06 12:42:10 +02:00
Rafal Stefanowski	38e7e19290	Update copyright statements Signed-off-by: Rafal Stefanowski <rafal.stefanowski@intel.com>	2020-04-28 13:37:54 +02:00
Michal Rakowski	67577fc1ef	Force pass-through for requests bigger than cache Signed-off-by: Michal Rakowski <michal.rakowski@intel.com>	2020-04-24 15:34:27 +02:00

1 2 3

116 Commits