Bug #5745
closed[Keep] keepstore should serialize file reads/writes, but not directory lookups, when -serialize=true
Description
background¶
The most obvious benefit of this change is that, even when keepstore is busy, the process of probing N volumes to find an existing block can proceed quickly, using the OS's filesystem cache, instead of waiting in every volume's serialize queue along the way. With the current implementation, a keepstore can easily accumulate a queue of PUT requests (each consuming a decent chunk of RAM) that are just waiting to find out whether the blocks already exist -- and of course they'll wait in the queue an N+1st time when it's time to actually write them to disk.
solution / implementation¶
Suggest refactoring the "serialize" implementation to use a sync.Mutex instead of passing channels to a goroutine over a "queue" channel.
In Read, before calling ioutil.ReadFile
or doing any locking, do a stat() to see whether the file exists. If not, return an error without acquiring the volume lock.
In Read and Write, just before reading or writing data:
if v.mutex {
v.mutex.Lock()
defer v.mutex.Unlock()
}
Then we can rename Write to Put, rename Read to Get, and delete the old Get and Put wrappers, IOHandler, IORequest, IOResponse, and KeepGet/KeepPut constants.
Updated by Tom Clegg over 9 years ago
- Description updated (diff)
- Category set to Keep
Updated by Tom Clegg over 9 years ago
- Target version changed from Arvados Future Sprints to 2015-05-20 sprint
Updated by Peter Amstutz over 9 years ago
867263b 5745-serialize-content-only LGTM
Updated by Tom Clegg over 9 years ago
- Status changed from In Progress to Resolved
- % Done changed from 0 to 100
Applied in changeset arvados|commit:fde21d34d011af2123668983c559632221390fd4.