Fix/Improve Test Driver #721

o11c · 2015-07-08T04:27:37Z

The current travis.sh is broken in a number of ways. This fixes most of them.

Note: there are legitimate bugs exposed in mypy here, that's why the tests fail. The stubs do not appear to be incorrect.

o11c · 2015-07-25T04:51:56Z

Okay, I've refactored so that this can be merged before the XML one.

o11c · 2015-07-26T22:05:12Z

@JukkaL I'm really not sure if I can solve the bug this exposes. I just investigated my hypothesis and it turned out to be fruitless.

(An example of) The failing test is:
mypy -c 'import requests.packages.urllib3.connection'

Which is curious because this works and does result in an import of the connection module:
mypy -c 'import requests.packages.urllib3'

But this doesn't work:
mypy -c 'import requests.packages.urllib3, requests.packages.urllib3.connection'

This changes the error message:
mypy -c 'import requests.packages.urllib3.connectionpool, requests.packages.urllib3.connection'

And this somehow works:
mypy -c 'import requests.adapters, requests.packages.urllib3.connectionpool, requests.packages.urllib3.connection'

So all I can really say is that this is some subtle ordering problem that is beyond my skill/knowledge.

o11c · 2015-07-31T03:33:54Z

I added a list of XFAILs so at least we can get meaningful test results until the real bug is fixed.

Also managed to find a non-racy way to run tasks in parallel without os.waitid.

JukkaL · 2015-08-04T03:58:43Z

Thanks for the PR! This is pretty big so it might take a while but I hope to have time to review it this weekend.

o11c · 2015-08-17T04:51:31Z

Ugh, your recent pushes made there be conflicts.

JukkaL · 2015-08-24T04:46:08Z

I've now merged your changes to the current master (haven't pushed them yet) and have been playing around with them a little. It's much better to run tests in parallel, type check all stubs and to have automatic test discovery!

There are some regressions that shouldn't be difficult to fix or work around:

Running individual test cases requires more work than before (python3 -m mypy.munit -m mypy.test.testfoo "*foobar*" vs python3 tests.py "*check*foobar*"). It's probably worth figuring a simpler way of running a subset of unit tests.
The output is way noisier than before. I don't want many pagefuls of output if things go right. By default, it should not output a lot of stuff (but it's okay to write more verbose output to a log file so that it's possible to debug if things go wrong).
If travis.py is the only entry point to running multiple test suites, we should probably rename it to something else as people will also want to run it locally (tests.py?).
Previously there was an easy way to run only inexpensive (unit) tests (tests.py). It would be nice to be still able to easily skip expensive integration tests but run everything else (at least pythoneval, which is really slow). Or maybe we could run python eval tests in parallel by splitting it into multiple chunks.

o11c · 2015-08-24T19:26:10Z

Running individual test cases requires more work than before (python3 -m mypy.munit -m mypy.test.testfoo "foobar" vs python3 tests.py "_check_foobar*"). It's probably worth figuring a simpler way of running a subset of unit tests.

mypy.myunit is not intended to be run by hand at the moment (this may change if we switch from distutils to setuptools and get easy entry_point management.

What I usually do is run ./travis.py patterns.... For example, I have my vim configured to run ./travis.py mypy.syntax to only run syntax-related tests. This has the advantage of also running the type-checker for those modules, not just the unit test modules.

If it's really worth running just a single unit test (since other than pythoneval, test modules are fast), I briefly experimented with ./travis.py modulepatterns... -- arguments but stopped since it wasn't clear what should happen if there was more than one module. Perhaps ./travis.py modulepattern:argument?

Really, mypy.myunit needs more love; it is currently a half-assed approach to testing. Of particular relevance to this issue, it ought to have discovery of its own. I have been thinking about switching to layout compatible with py.test however.

The output is way noisier than before.

Am I the only one who always runs test drivers in verbose mode? It's not like there is anything valuable in the terminal's scrollback, and the failures are nicely collected at the end. But I'm certainly not opposed to adding command-line options to travis.py to control it.

I prefer immediate output because deferred output and log files can be problematic, though I admit that I am used to working in C and C++ where memory corruption and sudden exit are more common (though they're certainly possible in python also).

If travis.py is the only entry point to running multiple test suites, we should probably rename it to something else as people will also want to run it locally (tests.py?).

I deliberately did not name it tests.py because it is not at all similar to the tests.py that existed before this patch. And I think people should draw comfort in knowing that the way they are running tests matches the way tests will be run automatically.

Bikeshed alternative: runtests.py ?

Previously there was an easy way to run only inexpensive (unit) tests

Currently, ./travis.py unit-test is my preferred way to run all unit tests, which includes pythoneval tests. Perhaps the collector should be given additional knowledge about the pythoneval tests specifically (some form of this may be needed anyway for #732), or possibly you could add negative patterns as well as positive patterns.

This PR is not an attempt to provide perfection, only an improvement. Ideally I think mypy.myunit would integrate all the abilities of travis.py and mypy.waiter but I don't have a clear design yet and I intend to work on other things first.

o11c · 2015-09-21T19:58:32Z

@JukkaL Can you please (rebase and) merge this? I have no clue what is acceptable to you, and you keep on giving painful delays.

I'm almost done with the new parser (still need to finish expression parsing and port billions of unit tests), but for lowering I really need to be able to use the updated version of mypy.

JukkaL · 2015-10-01T13:08:45Z

@o11c: Sorry for having being unresponsive; I've been out of the country for almost 3 weeks now and been very busy. Things will calm down next week. I may have a few hours later this week, but I can't promise anything.

o11c · 2015-10-01T16:47:02Z

Note, there is a rebased version of this at https://github.com/o11c/mypy/tree/driver-rebased but I haven't investigated the new test failures yet.

o11c · 2015-10-01T17:25:32Z

Okay, fixed.

o11c · 2015-10-01T17:27:57Z

Okay, fixed. More cyclic import problems that probably should be fixed elsewhere, but ... explicit annotations will do for now.

o11c · 2015-10-01T18:35:29Z

Rebased to fix another bug caused by upstream changes, and made the linter stop complaining.

I really think the linter is being stupid for some of those things though.

o11c · 2015-10-01T18:41:56Z

@JukkaL note that I still have to xfail 4 of the stubs, even though they are correct. Obviously there's still a cyclic import problem somewhere.

o11c · 2015-10-11T05:03:27Z

Rebased on top of #903 for easier maintenance of my other branches.

But seriously, this is embarrassing.

JukkaL · 2015-10-11T16:59:35Z

@o11c: This PR has taken me a lot of work to review, and because it has sometimes felt like a burden it's also been slow. I've done some reflection and tried to understand why this happened and how we can make this less likely to happen in the future. Here are my thoughts: http://www.mypy-lang.org/wiki/GoodPullRequest

Anyway, I'm going to do another round of review today and hopefully I can finally merge it.

o11c force-pushed the driver branch 3 times, most recently from 52b3e53 to 94b204e Compare July 13, 2015 03:28

This was referenced Jul 14, 2015

Update travis ci script to check all module stubs #594

Closed

Bad circular imports are not rejected #61

Open

On mypy libraries #724

Closed

o11c force-pushed the driver branch from 9345908 to 05cc8c8 Compare July 25, 2015 04:45

o11c force-pushed the driver branch from 05cc8c8 to a7ee0a6 Compare July 26, 2015 21:01

o11c force-pushed the driver branch from a7ee0a6 to ac3b304 Compare July 31, 2015 03:25

o11c force-pushed the driver branch 2 times, most recently from e9401c1 to f04f262 Compare August 1, 2015 03:26

This was referenced Sep 24, 2015

Parser redesign #880

Closed

Policy regarding python version specific stub changes #874

Closed

Typeshed #884

Merged

o11c force-pushed the driver branch from 8053a15 to 41b6e46 Compare October 1, 2015 17:23

o11c force-pushed the driver branch from 41b6e46 to de0ce10 Compare October 1, 2015 18:33

o11c and others added 24 commits October 10, 2015 21:41

Rename some stubs from .py to .pyi

a0c7b31

Miscellaneous small stub changes

bf40f0b

Stub for docutils should be third-party

3e6ae1a

Stub for token

4fd1040

Fix codec-related stubs

8d3a11e

Fix broken stub caused by JukkaL not merging my PR yet

4cfdbaf

Add stub for pipes

95f35a2

Add stubs for textwrap

f41a270

Stubs for bisect

dfcd744

Stub for inspect.stack() and related classes

9861d51

Use SupportsBytes

06b670d

Distinguish third-party stubs from 3.2 stubs

2e591dc

Move script to mypy.main module and just leave stubs

2c009ca

Fix bugs in test driver

e45f793

Allow testsuite to run in parallel

9d8c671

Run myunit test modules individually

f6b23d3

Use multiline repr when tests fail

129215e

Fix test failures after rebase

30a3d67

Appease the silly linter

300a19c

make test logging less verbose

b91362d

Move linting into travis.py

66da905

Add verbosity options

ba47837

Rename test driver

7febd53

Implement suggestions

46cd254

o11c force-pushed the driver branch from 1d2a197 to 46cd254 Compare October 11, 2015 05:01

JukkaL merged commit 46cd254 into python:master Oct 11, 2015

o11c deleted the driver branch October 11, 2015 19:29

JukkaL mentioned this pull request Oct 11, 2015

New test driver #907

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/Improve Test Driver #721

Fix/Improve Test Driver #721

o11c commented Jul 8, 2015

o11c commented Jul 25, 2015

o11c commented Jul 26, 2015

o11c commented Jul 31, 2015

JukkaL commented Aug 4, 2015

o11c commented Aug 17, 2015

JukkaL commented Aug 24, 2015

o11c commented Aug 24, 2015

o11c commented Sep 21, 2015

JukkaL commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 11, 2015

JukkaL commented Oct 11, 2015

Fix/Improve Test Driver #721

Fix/Improve Test Driver #721

Conversation

o11c commented Jul 8, 2015

o11c commented Jul 25, 2015

o11c commented Jul 26, 2015

o11c commented Jul 31, 2015

JukkaL commented Aug 4, 2015

o11c commented Aug 17, 2015

JukkaL commented Aug 24, 2015

o11c commented Aug 24, 2015

o11c commented Sep 21, 2015

JukkaL commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 1, 2015

o11c commented Oct 11, 2015

JukkaL commented Oct 11, 2015