check-aux: test core, alloc, std in Miri #123506

RalfJung · 2024-04-05T17:05:46Z

Let's see if this works, and how long it takes.

rustbot · 2024-04-05T17:05:54Z

rustbot has assigned @onur-ozkan.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

RalfJung · 2024-04-05T18:43:38Z

Oh...

     Running benches/lib.rs (build/x86_64-unknown-linux-gnu/stage1-std/miri/x86_64-unknown-linux-gnu/debug/deps/allocbenches-a59974b85394f408)

This runs some benchmarks it seems. That will ~never terminate under Miri.^^

RalfJung · 2024-04-06T08:11:17Z

Oh, interesting... the macOS build of compiler-builtins is trying to run a C compiler.

For now let's just run tests on Linux only, then.

RalfJung · 2024-04-06T08:23:17Z

Strangely the macos tests work fine locally. Not sure what is different on CI...

RalfJung · 2024-04-06T10:34:37Z

All right, tests are passing!

core, alloc lib tests: 20 minutes
core, alloc doc tests: 20 minutes
std lib tests: 3 minutes
std doc tests: 1 minute

On std we're unfortunately only testing one target as somehow the compiler-builtins build script fails for other targets. I'll extend the std tests a bit (we need some filters here as not all of std is supported by Miri).

There are likely some testcases that are unproportionally slow. They are somewhat hard to identify. For doctests we get the "has been running for a long time" warning, which points at this test that creates a 100k element linked list. For libtests, the test harness runs inside Miri where it disables timing in Miri to make sure it can run with isolation enabled. Now that we have the fake clock in isolation mode, maybe we can fix that. Whether that gives useful results is a different question...

RalfJung · 2024-04-06T10:44:17Z

library/alloc/src/sync.rs

-    /// for i in 0..100000 {
+    /// let size = 100000;
+    /// # let size = if cfg!(miri) { 100 } else { size };
+    /// for i in 0..size {


Is that an okay approach to make the test shorter in Miri? The alternative would be more convoluted but would let the code look as before on the web view:

# let size = if cfg!(miri) { 100 } else { 100000 }; # for i in 0..size { # /* for i in 0..100000 { # */

Seems fine to me, especially given it's relatively isolated. Realistically I'm not convinced this example needs a huge length (100 elements or 100,000 is about equally bad if you're pushing a new stack frame per element, IMO, given that you might already be close to the end of the allowed stack when starting to execute).

RalfJung · 2024-04-06T13:17:37Z

It's actually a bit surprising that these take so long, since the out-of-tree test tests alloc and core in 45 minutes total and it runs all tests on 2 targets. So the test here should have been about twice as fast.

Do we have debug assertions enabled on the gnu-aux runner, or something like that?

RalfJung · 2024-04-06T13:25:30Z

Cc @Mark-Simulacrum

ChrisDenton · 2024-04-06T13:28:08Z

Oh, interesting... the macOS build of compiler-builtins is trying to run a C compiler.

presumably the build.optimized-compiler-builtins config is set to true.

Mark-Simulacrum · 2024-04-06T13:39:02Z

Oh, interesting... the macOS build of compiler-builtins is trying to run a C compiler.

presumably the build.optimized-compiler-builtins config is set to true.

Yes:

rust/src/ci/run.sh

Line 91 in 01f7f3a

    
           RUST_CONFIGURE_ARGS="$RUST_CONFIGURE_ARGS --set build.optimized-compiler-builtins"

Do we have debug assertions enabled on the gnu-aux runner, or something like that?

Yes:

rust/src/ci/run.sh

Lines 125 to 140 in 01f7f3a

    
           # We almost always want debug assertions enabled, but sometimes this takes too 
        
           # long for too little benefit, so we just turn them off. 
        
           if [ "$NO_DEBUG_ASSERTIONS" = "" ]; then 
        
             RUST_CONFIGURE_ARGS="$RUST_CONFIGURE_ARGS --enable-debug-assertions" 
        
           fi 
        
           # Same for overflow checks 
        
           if [ "$NO_OVERFLOW_CHECKS" = "" ]; then 
        
             RUST_CONFIGURE_ARGS="$RUST_CONFIGURE_ARGS --enable-overflow-checks" 
        
           fi 
        
           # In general we always want to run tests with LLVM assertions enabled, but not 
        
           # all platforms currently support that, so we have an option to disable. 
        
           if [ "$NO_LLVM_ASSERTIONS" = "" ]; then 
        
             RUST_CONFIGURE_ARGS="$RUST_CONFIGURE_ARGS --enable-llvm-assertions" 
        
           fi

RalfJung · 2024-04-07T08:01:28Z

src/bootstrap/mk/Makefile.in

+	# In `std` we cannot test everything.
+	$(Q)MIRIFLAGS="-Zmiri-disable-isolation" BOOTSTRAP_SKIP_TARGET_SANITY=1 \
+		$(BOOTSTRAP) miri --stage 2 library/std \
+		--no-doc -- \


I didn't add mips-unknown-linux-gnu here as I think std has much less code that depends on bitwidth or endianess, so the extra 10min CI time does not seem worth it. Instead we have the macOS and Windows checks below, as std does contain a lot of OS-specific code. (And it's a 32bit Windows so at least that kind of architecture is covered.)

requires disabling some tests that do not work

RalfJung · 2024-04-07T08:06:45Z

Ah, that's more like it. Now even when testing two targets for core and alloc we get

core+alloc libtests: 30min
core+alloc doctests: 10min
std libtests: 10min
std doctests: 1min
std reduced libtests for extra targets: 11min

The total duration of the gnu-aux job is around 104min.

That is just 2min above the anticipated 1h. I can remove some of the extra targets if that is desired. Testing core and alloc only on one target brings it down by about 15min.

r? @Mark-Simulacrum

RalfJung · 2024-04-07T15:40:16Z

Some data from #123560: enabling just overflow checks add around 10min to total CI time, enabling just debug assertions adds around 30min.

RalfJung · 2024-04-07T15:40:36Z

@rustbot ready

Mark-Simulacrum · 2024-04-07T22:45:06Z

@bors r+

I think this is good to land. We can always tweak it further in future PRs (e.g., get rid of the Makefile, maybe try to enable benchmarks running once or something like that).

bors · 2024-04-07T22:45:08Z

📌 Commit 596908b has been approved by Mark-Simulacrum

It is now in the queue for this repository.

bors · 2024-04-08T00:08:47Z

⌛ Testing commit 596908b with merge a2c72ce...

bors · 2024-04-08T02:11:58Z

☀️ Test successful - checks-actions
Approved by: Mark-Simulacrum
Pushing a2c72ce to master...

rust-timer · 2024-04-08T04:07:29Z

Finished benchmarking commit (a2c72ce): comparison URL.

Overall result: ❌ regressions - no action needed

@rustbot label: -perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	0.5%	[0.4%, 0.8%]	4
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-	-	0

Max RSS (memory usage)

This benchmark run did not return any relevant results for this metric.

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	1.2%	[1.0%, 1.4%]	4
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	1.2%	[1.0%, 1.4%]	4

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 668.322s -> 668.882s (0.08%)
Artifact size: 318.48 MiB -> 318.31 MiB (-0.05%)

RalfJung · 2024-04-08T05:27:05Z

I think benchmarks are already only run once in test mode? This is based on a vague memory, nothing more. However some benchmarks create large data structures in the preparation phase of the benchmark and I think that is what makes them slow on Miri.

Kobzol · 2024-04-08T07:20:55Z

This PR had nothing to do with our benchmarking infrastructure :) So this has to be pure noise.

rustbot assigned onur-ozkan Apr 5, 2024

RalfJung force-pushed the miri-test-libstd branch 2 times, most recently from 49acb75 to ff3fccd Compare April 5, 2024 19:35

This comment has been minimized.

Sign in to view

RalfJung force-pushed the miri-test-libstd branch from ff3fccd to 4a36dfb Compare April 5, 2024 21:11

This comment has been minimized.

Sign in to view

onur-ozkan added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 6, 2024

RalfJung force-pushed the miri-test-libstd branch from 139edf9 to b2801f7 Compare April 6, 2024 06:08

This comment has been minimized.

Sign in to view

RalfJung force-pushed the miri-test-libstd branch 2 times, most recently from 3776b53 to 18955b2 Compare April 6, 2024 08:21

RalfJung force-pushed the miri-test-libstd branch from 18955b2 to e72edf3 Compare April 6, 2024 10:38

RalfJung commented Apr 6, 2024

View reviewed changes

RalfJung force-pushed the miri-test-libstd branch from e72edf3 to 31fa4b3 Compare April 6, 2024 11:26

This comment has been minimized.

Sign in to view

RalfJung mentioned this pull request Apr 6, 2024

disable the thread-leaking test instead of allowing leaks rust-lang/miri-test-libstd#58

Merged

RalfJung force-pushed the miri-test-libstd branch from be4aa35 to 7e33fd3 Compare April 7, 2024 07:59

RalfJung marked this pull request as ready for review April 7, 2024 08:00

RalfJung commented Apr 7, 2024

View reviewed changes

RalfJung added 5 commits April 7, 2024 10:05

also test parts of std

1242093

requires disabling some tests that do not work

make a doctest less slow in Miri

a986c0a

run some std tests on more targets

2408981

disable debug assertions to speed up the check-aux job

d0346c5

also test core+alloc on a 32bit big-endian target

596908b

RalfJung force-pushed the miri-test-libstd branch from 7e33fd3 to 596908b Compare April 7, 2024 08:06

rustbot assigned Mark-Simulacrum and unassigned onur-ozkan Apr 7, 2024

RalfJung changed the title ~~check-aux: test core and alloc in Miri~~ check-aux: test core, alloc, std in Miri Apr 7, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Apr 7, 2024

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 7, 2024

bors added the merged-by-bors This PR was explicitly merged by bors. label Apr 8, 2024

bors merged commit a2c72ce into rust-lang:master Apr 8, 2024

rustbot added this to the 1.79.0 milestone Apr 8, 2024

RalfJung deleted the miri-test-libstd branch April 8, 2024 20:26

check-aux: test core, alloc, std in Miri #123506

check-aux: test core, alloc, std in Miri #123506

Uh oh!

Conversation

RalfJung commented Apr 5, 2024

Uh oh!

rustbot commented Apr 5, 2024

Uh oh!

RalfJung commented Apr 5, 2024

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

RalfJung commented Apr 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RalfJung commented Apr 6, 2024

Uh oh!

RalfJung commented Apr 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RalfJung Apr 6, 2024

Choose a reason for hiding this comment

Uh oh!

Mark-Simulacrum Apr 7, 2024

Choose a reason for hiding this comment

Uh oh!

This comment has been minimized.

RalfJung commented Apr 6, 2024

Uh oh!

RalfJung commented Apr 6, 2024

Uh oh!

ChrisDenton commented Apr 6, 2024

Uh oh!

Mark-Simulacrum commented Apr 6, 2024

Uh oh!

RalfJung Apr 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RalfJung commented Apr 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RalfJung commented Apr 7, 2024

Uh oh!

RalfJung commented Apr 7, 2024

Uh oh!

Mark-Simulacrum commented Apr 7, 2024

Uh oh!

bors commented Apr 7, 2024

Uh oh!

bors commented Apr 8, 2024

Uh oh!

bors commented Apr 8, 2024

Uh oh!

rust-timer commented Apr 8, 2024

Overall result: ❌ regressions - no action needed

Instruction count

Max RSS (memory usage)

Cycles

Binary size

Uh oh!

RalfJung commented Apr 8, 2024 via email

Uh oh!

Kobzol commented Apr 8, 2024

Uh oh!

Uh oh!

RalfJung commented Apr 6, 2024 •

edited

Loading

RalfJung commented Apr 6, 2024 •

edited

Loading

RalfJung Apr 7, 2024 •

edited

Loading

RalfJung commented Apr 7, 2024 •

edited

Loading