-
Notifications
You must be signed in to change notification settings - Fork 383
FTPFileSystem
incompatibility with pandas
/pyarrow
#705
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Upon further investigation, simply creating the directory before trying to store to parquet does not work because of the partition. The creation of the partition directories also depend on the |
Hm, I don't know - people are certainly using fsspec with arrow/parquet. |
I have a test that runs the snippet for local, s3, and ftp filesystems. Local passes because If the If you feel confident that |
It may well be something non-ideal in arrow, but you should also be aware that both the filesystem and parquet/dataset APIs have been replaced in recent times. |
Oh I see. At what point would that get set to |
Maybe it should depend on the pyarrow version (cc #411 , @jorisvandenbossche ) |
Late answer:
|
I've run across an issue, and to be honest I'm not quite sure which library it belongs in.
It starts with the line
which takes me through the
pandas
library until reaching this function inpyarrow
https://github.com/apache/arrow/blob/e2238582e2a2bf20a68a967145fe1a7b2337a997/python/pyarrow/parquet.py#L1891
which internally calls the helper function
_mkdir_if_not_exists()
https://github.com/apache/arrow/blob/e2238582e2a2bf20a68a967145fe1a7b2337a997/python/pyarrow/parquet.py#L1883
The problem is that for
FTPFileSystem
, this helper function does not actually create the directory because of this linehttps://github.com/intake/filesystem_spec/blob/85bb2f3fef2aa12f7ec8497ea116c78c644b49ec/fsspec/spec.py#L1219
The simplest solution would be to override this
_is_filestore()
method inFTPFileSystem
to returnTrue
.The text was updated successfully, but these errors were encountered: