Fix long interleaving times #55

jmid · 2022-04-25T22:42:25Z

Investigating long running times where the following Thread test took long to the point of suspecting a deadlock/infinite loop (or a bug in the run-time):

   dune exec src/neg_tests/thread_lin_tests.exe -- -v -s 219916560

Because of other recent fixes enhancing reproducability, I was able to pin-point the following cmd triple:

([(Member 982); (Add_node 919); (Member 443); (Member 66); (Add_node 3); (Add_node 3); (Member 5)],
 [(Add_node 9); (Member 64); (Member 5); (Add_node 72); (Add_node 989); (Member 0); (Add_node 4); (Add_node 9); (Add_node 78); (Member 67); (Member 56); (Member 76); (Member 223); (Add_node 6); (Member 89)],
 [(Add_node 855); (Add_node 36); (Member 1); (Member 142); (Add_node 6); (Add_node 4); (Member 36); (Member 4); (Member 1); (Add_node 7); (Add_node 54); (Member 9); (Add_node 8)])

with a 15 and 13 element cmd list running in parallel and needing interleaving.

Depending on the scheduling, 50 repetitions of the above would take between 1min39sec and 15min to run - with most computation time being spent in the interleaving search. Since the interleaving is costly (exponential in input length I believe) we therefore reduce the input cmd list size. This brings the interleaving search time significantly down (to around 1min, worst case).

While we are at it, we similarly adjust STM's par_len to at most 12.

jmid · 2022-04-26T08:23:44Z

I remembered an old remark in Claessen-al:ICFP09 (state-machine based):

"We generate parallel test cases by parallelizing a suffix of an eqc_statem test case, separating it into two lists
of commands of roughly equal length, [...]"

I then changed arb_cmds_par to such an approach, as there's a smaller chance of triggering concurrency issues when running a (0 or) 1-element cmd list in parallel with a, say 10-element cmd list.
This has a statistical significant impact, which then led me to reduce rep_count back to 100 from 125 for Thread.

As all three interpretations (Domain, Thread, Effect) are affected by the arb_cmds_par change I noticed that Effect tests in neg_tests started taking needlessly long. I therefore reduced their test count from 20.000 down to 1.000 like the others. They still trigger the expected errors.

jmid added 4 commits April 26, 2022 00:09

adjust par cmd list length

62bad5f

adjust par cmd list length for STM too

83e9480

switch arb_cmds_par to generate cmd lists of roughly equal length

999c8b9

reduce Effect test count

595e034

jmid merged commit a64b114 into main May 2, 2022

jmid deleted the fix-long-interleaving branch May 2, 2022 07:12

jmid mentioned this pull request May 3, 2022

Port previous Lin tests to the new signature DSL #54

Closed

jmid mentioned this pull request Mar 14, 2023

STM tests ocaml-multicore/saturn#61

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix long interleaving times #55

Fix long interleaving times #55

Uh oh!

jmid commented Apr 25, 2022

Uh oh!

jmid commented Apr 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix long interleaving times #55

Fix long interleaving times #55

Uh oh!

Conversation

jmid commented Apr 25, 2022

Uh oh!

jmid commented Apr 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants