Make short test runs faster #1073

JukkaL · 2015-12-13T20:26:07Z

Currently we create a new Python process for each unit test suite, which is wasteful when running just a small number of test cases, as the interpreter startup overhead has to be paid many times. Instead, we could use long-running processes that run multiple unit test tasks that get passed via a pipe or something from the main test runner process.

The goal is to make something like this run faster:

$ time ./runtests.py unit-test -a "missingtestname"
PARALLEL 8
SUMMARY  13 tasks selected
SUMMARY  all 13 tasks and 0 tests passed                                        
*** OK ***

real    0m0.584s
user    0m2.556s
sys 0m0.187s

If we can get this to ~0.3s on typical hardware I'd be happy.

The text was updated successfully, but these errors were encountered:

elazarg · 2017-08-21T21:32:44Z

pytest is also slow for zero tests (4 seconds on my machine). Most of the time is spent on collecting tests. The problem is that collecting the tests involves setting them up, including creation of temporary files. This can be fixed (taking it down to 0.5 seconds) but requires refactoring.

emmatyping · 2018-08-06T21:11:46Z

@elazarg do you have suggestions of how this could be accomplished? I often find myself running just a few tests, so trimming down the overhead of startup would be great.

elazarg · 2018-08-06T23:46:27Z

The collection process needs to do only one thing: find the data and the names of all the tests. Instead, it currently open files, parse them, open other files, write new files, etc. All this work should be done and the time of test setup, not at test collection.

I have implemented it several times; strangely enough, even though it worked very well at the past (which I abandoned due to the high code churn), it did not work when I re-tried it recently.

Another thing that I'm not sure about is why do we need the temporary files at all. Almost everything should be possible to perform using StringIO.

One can also imagine having a cache of name->testdata, which will make the collection trivial for testing the same feature repeatedly (effectively bypassing the decision to put several tests in the same file instead of storing the as different files on the same folder), but that's probably overkill.

elazarg · 2018-08-06T23:49:37Z

One difference between my first and second implementation is that in the first I have used itertools.groupby() and regexes for the initial parsing of the files. Perhaps I should try doing that again.

emmatyping · 2018-08-07T01:11:02Z

Another thing that I'm not sure about is why do we need the temporary files at all. Almost everything should be possible to perform using StringIO.

Mostly due to the way mypy is structured. I've wanted to allow passing a StringIO to the api for while, though that requires a bit of restructuring of build process, and figuring out what that means (for example, it won't have a file name, which we usually assume sources have, we likely will need to translate that StringIO into a BuildSource, etc, etc).

Michael0x2a · 2018-08-07T01:40:19Z

One idea might be to cleanly divorce the filesystem interaction code from the actual build process entirely -- e.g. pull out all of the IO logic into some sort of "filesystem" object that understands how to maps file names to module names and vice versa and returns IO objects on request.

We could then swap that object with something that uses StringIO instead of reading temp files for the tests.

Probably the trickiest part would be making sure this abstraction works cleanly with both incremental and daemon mode?

I guess I'm also not entirely sure if there actually is a one-to-one correspondence between the module name and the file name -- you could have files that have the same name living in different places, for example.

emmatyping · 2018-08-07T02:23:04Z

I guess I'm also not entirely sure if there actually is a one-to-one correspondence between the module name and the file name

There is always AFAIK a one-to-one with the fully resolved name to the file path, so I can have foo.bar.baz and foo.rab.baz.

I think a refactoring of the build would be great. It would also make it easier for editors to interface with mypy.

There's already #4365, so we should probably continue over there.

elazarg · 2018-08-12T17:05:34Z

I managed to get it from 2.88 to 1.4. The collection itself seems to be taking much of the time - I think we'll need caching to take it down all the way to 0.3

Parse tests on collection only enough to find the name, so small number of tests run faster On my machine, `pytest -n0 -k testAttrsSimple` takes at least 2.24 seconds to finish on master, and at most 0.95 seconds to finish with this PR. - 'skip-cache' is changed to 'only_when_nocache' and similarly 'skip-nocache' - I have replace while loops with "if True" to make the diff simpler. Further cleanup and optimizations are also possible Fixes #1073

JukkaL added the feature label Dec 13, 2015

ddfisher added this to the Future milestone Mar 2, 2016

gvanrossum added the priority-2-low label Jan 6, 2017

gvanrossum removed this from the Future milestone Mar 29, 2017

JukkaL added priority-0-high topic-tests and removed priority-2-low labels May 18, 2018

elazarg mentioned this issue Aug 12, 2018

Lazy tests: parse using regex first #5459

Merged

msullivan closed this as completed in #5459 Sep 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Make short test runs faster #1073

Make short test runs faster #1073

JukkaL commented Dec 13, 2015

elazarg commented Aug 21, 2017

Uh oh!

emmatyping commented Aug 6, 2018

Uh oh!

elazarg commented Aug 6, 2018

Uh oh!

elazarg commented Aug 6, 2018

Uh oh!

emmatyping commented Aug 7, 2018

Uh oh!

Michael0x2a commented Aug 7, 2018

Uh oh!

emmatyping commented Aug 7, 2018

Uh oh!

elazarg commented Aug 12, 2018

Uh oh!

Uh oh!

Make short test runs faster #1073

Make short test runs faster #1073

Comments

JukkaL commented Dec 13, 2015

elazarg commented Aug 21, 2017

Uh oh!

emmatyping commented Aug 6, 2018

Uh oh!

elazarg commented Aug 6, 2018

Uh oh!

elazarg commented Aug 6, 2018

Uh oh!

emmatyping commented Aug 7, 2018

Uh oh!

Michael0x2a commented Aug 7, 2018

Uh oh!

emmatyping commented Aug 7, 2018

Uh oh!

elazarg commented Aug 12, 2018

Uh oh!