-
-
Notifications
You must be signed in to change notification settings - Fork 18.5k
infer_dtype() function slower in latest version #28814
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
for completeness
master
we should be short-cutting on a numpy dtype, maybe was changed @borisaltanov if you want to take a look would be appreciated. |
The block of code that seems to be responsible is: Lines 1262 to 1263 in ad0539b
That code was introduced in this PR: #23422 A profiling of master with test case above:
The call to |
Code Sample
Problem description
When I run the above code on my machine, there is major difference in the performance between different versions. I ran the code using new conda environments created with Python 3.7 and the specific version of Pandas. The table below shows the execution times.
The example code is only an example. In the most cases I need this function to get more specific type of the column than the type returned by the .dtype parameter (usually when working with mixed data).
The text was updated successfully, but these errors were encountered: