Skip to content

Hierarchical caching for s3fs #408

@ValHayot

Description

@ValHayot

Hi,

Apologies if this is not the right place to ask this; didn't know where it was best to ask.

I'm a graduate student working on building tools to simplify data management (and improve data transfer performance) for scientific applications. I've been looking at using a cache hierarchy using available local storage media (eg. RAM, SSD, HDD) to speed up writes, as well as perhaps more pipeline-dependent attributes such as prefetching, and data management within the cache (i.e. asynchronous flushing and eviction).

For a research project I was thinking of applying these features to s3fs but wanted to check with you first if it would be useful at all. I know there is a caching mechanism in fsspec and it looks like it can only be to specific path (rather than a multitude of potential paths), but I could be wrong.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions