Updated read_excel docstring to include parse_dates and date_parser #11527

litchfield · 2015-11-06T01:29:04Z

No description provided.

…arameter info

chris-b1 · 2015-11-06T13:04:21Z

I'm not sure those keywords actually do anything with the excel parser?

In [12]: dti = pd.date_range('2014-1-1', periods=10)

In [13]: df = pd.DataFrame({'dates':dti, 'strings':dti.strftime('%m/%d/%Y')})

In [14]: df.dtypes
Out[14]: 
dates      datetime64[ns]
strings            object
dtype: object

In [15]: df.to_excel('test.xlsx')

In [16]: pd.read_excel('test.xlsx').dtypes
Out[16]: 
dates      datetime64[ns]
strings            object
dtype: object

In [17]: pd.read_excel('test.xlsx', parse_dates=True).dtypes
Out[17]: 
dates      datetime64[ns]
strings            object
dtype: object

In [18]: pd.read_excel('test.xlsx', parse_dates=False).dtypes
Out[18]: 
dates      datetime64[ns]
strings            object
dtype: object

jreback · 2015-11-07T14:45:39Z

yeh, I don't think these are valid keywords. Actually what we really need here is a check on non-implemented keywords. Closing this and I will create another issue.

jorisvandenbossche · 2015-11-09T22:35:31Z

@chris-b1 It actually even gives an error and is not just ignored. With your example (as parse_dates=True if for parsing the index, so if you want to see if it can parse the string column, you have to pass its name):

In [37]: pd.read_excel('test.xlsx', parse_dates=['strings'])

....

C:\Anaconda\lib\site-packages\pandas\io\parsers.pyc in _should_parse_dates(self,
 i)
    812             return self.parse_dates
    813         else:
--> 814             name = self.index_names[i]
    815             j = self.index_col[i]
    816

TypeError: 'NoneType' object has no attribute '__getitem__'

But if you set the strings column as the index, parse_dates=True is indeed ignored.

jorisvandenbossche · 2015-12-22T11:21:47Z

@chris-b1 This error actually only happens if you have an implicit index due to the structure of the excel file. If you don't have this, parse_dates works as expected:

In [46]: df.to_excel('test.xlsx', index=False)

In [47]: pd.read_excel('test.xlsx').dtypes
Out[47]:
dates      datetime64[ns]
strings            object
dtype: object

In [48]: pd.read_excel('test.xlsx', parse_dates=['strings']).dtypes
Out[48]:
dates      datetime64[ns]
strings    datetime64[ns]
dtype: object

Updated read_excel docstring to include parse_dates and date_parser p…

5cc54fb

…arameter info

jreback closed this Nov 7, 2015

jreback added the IO Excel read_excel, to_excel label Nov 7, 2015

jreback mentioned this pull request Nov 7, 2015

ERR: raise NotImplemented error if keywords are passed to read_excel which are not supported #11544

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated read_excel docstring to include parse_dates and date_parser #11527

Updated read_excel docstring to include parse_dates and date_parser #11527

litchfield commented Nov 6, 2015

chris-b1 commented Nov 6, 2015

jreback commented Nov 7, 2015

jorisvandenbossche commented Nov 9, 2015

jorisvandenbossche commented Dec 22, 2015

Updated read_excel docstring to include parse_dates and date_parser #11527

Updated read_excel docstring to include parse_dates and date_parser #11527

Conversation

litchfield commented Nov 6, 2015

chris-b1 commented Nov 6, 2015

jreback commented Nov 7, 2015

jorisvandenbossche commented Nov 9, 2015

jorisvandenbossche commented Dec 22, 2015