-
Notifications
You must be signed in to change notification settings - Fork 289
Description
Hi,
Apologies if this is not the right place to ask this; didn't know where it was best to ask.
I'm a graduate student working on building tools to simplify data management (and improve data transfer performance) for scientific applications. I've been looking at using a cache hierarchy using available local storage media (eg. RAM, SSD, HDD) to speed up writes, as well as perhaps more pipeline-dependent attributes such as prefetching, and data management within the cache (i.e. asynchronous flushing and eviction).
For a research project I was thinking of applying these features to s3fs but wanted to check with you first if it would be useful at all. I know there is a caching mechanism in fsspec and it looks like it can only be to specific path (rather than a multitude of potential paths), but I could be wrong.
Thanks!