Don't modify the on disk cache in fine-grained mode #4664

msullivan · 2018-03-02T02:39:51Z

This is a little subtle, because interface_hash still needs to be
computed, as it is a major driver of the coarse-grained build process.

Since metas are no longer computed for files that get rechecked during
build, to avoid spuriously reprocessing them we need to find initial
file state in cache mode as well.

They'll still merge conflict, but the resolution will be trivial now (I'm trying avoid making one depend on the other)

JukkaL

Looks good, just a few notes below. Please test this change manually with and without a remote cache before merging.

It would be useful to have the motivation for this change spelled out in the commit message. Is it more about performance of incremental updates or about correctness?

Longer-term, we may want to write cache files in fine-grained incremental mode, at least when not using a remote cache. This could improve performance if users frequently restart the daemon and don't use a remote cache (or if we decide to shut down the daemon after some time of inactivity). Can you create an issue about this?

JukkaL · 2018-03-02T14:10:06Z

mypy/dmypy_server.py

-        if not self.options.use_fine_grained_cache:
-            # Stores the initial state of sources as a side effect.
-            self.fswatcher.find_changed()
+        # Stores the initial state of sources as a side effect.


Why was this changed to be executed unconditionally?

So that we have accurate fswatcher cache information for files that we didn't read from the on-disk cache, now that we don't generate CacheMetas

JukkaL · 2018-03-02T14:11:09Z

mypy/dmypy_server.py

        # Run a fine-grained update starting from the cached data
        if self.options.use_fine_grained_cache:
            # Pull times and hashes out of the saved_cache and stick them into
            # the fswatcher, so we pick up the changes.
            for state in self.fine_grained_manager.graph.values():
                meta = state.meta
+                # If there isn't a meta, that means the current
+                # version got checked in the initial build.


Relying on this logic here seems a bit fragile. Maybe move the meta is None check to a helper method in State and use the helper method here instead?

Changing it to use is_cache_skeleton instead.

msullivan · 2018-03-05T17:29:39Z

The main rationale is closer to correctness than performance. Currently we write and delete cache files during the initial load of fine-grained mode, but never during fine-grained updates. This means that fine-grained can mess the cache up but won't fix it without a restart, which seems not great.

This is a little subtle, because interface_hash still needs to be computed, as it is a major driver of the coarse-grained build process. Since metas are no longer computed for files that get rechecked during build, to avoid spuriously reprocessing them we need to find initial file state in cache mdoe as well.

msullivan · 2018-03-05T19:34:16Z

This has a bad interaction with #4669 and needs changes. My plan is to land #4669 and revise this after.

msullivan · 2018-03-08T00:22:59Z

Withdrawing this. Some changes I am making as part of my deletion optimization path is going to make this fall out much more simply.

msullivan requested a review from JukkaL March 2, 2018 02:39

msullivan added a commit that referenced this pull request Mar 2, 2018

Tweak flushing logic to better match #4664

4c6d59e

They'll still merge conflict, but the resolution will be trivial now (I'm trying avoid making one depend on the other)

msullivan added a commit that referenced this pull request Mar 2, 2018

Tweak flushing logic to better match #4664

0f5a411

They'll still merge conflict, but the resolution will be trivial now (I'm trying avoid making one depend on the other)

JukkaL approved these changes Mar 2, 2018

View reviewed changes

msullivan added 2 commits March 5, 2018 09:51

Tweak some of the logic for pulling info from the metas

8b3702d

msullivan force-pushed the no_fg_cache_write branch from f9076ed to 8b3702d Compare March 5, 2018 17:52

msullivan mentioned this pull request Mar 7, 2018

Make testfinegrained use dmypy_server #4699

Merged

msullivan closed this Mar 8, 2018

msullivan deleted the no_fg_cache_write branch March 8, 2018 00:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Don't modify the on disk cache in fine-grained mode #4664

Don't modify the on disk cache in fine-grained mode #4664

Uh oh!

msullivan commented Mar 2, 2018

Uh oh!

JukkaL left a comment

Uh oh!

JukkaL Mar 2, 2018

Uh oh!

msullivan Mar 5, 2018

Uh oh!

JukkaL Mar 2, 2018

Uh oh!

msullivan Mar 5, 2018

Uh oh!

msullivan commented Mar 5, 2018

Uh oh!

msullivan commented Mar 5, 2018

Uh oh!

msullivan commented Mar 8, 2018

Uh oh!

Uh oh!

Uh oh!

Don't modify the on disk cache in fine-grained mode #4664

Don't modify the on disk cache in fine-grained mode #4664

Uh oh!

Conversation

msullivan commented Mar 2, 2018

Uh oh!

JukkaL left a comment

Choose a reason for hiding this comment

Uh oh!

JukkaL Mar 2, 2018

Choose a reason for hiding this comment

Uh oh!

msullivan Mar 5, 2018

Choose a reason for hiding this comment

Uh oh!

JukkaL Mar 2, 2018

Choose a reason for hiding this comment

Uh oh!

msullivan Mar 5, 2018

Choose a reason for hiding this comment

Uh oh!

msullivan commented Mar 5, 2018

Uh oh!

msullivan commented Mar 5, 2018

Uh oh!

msullivan commented Mar 8, 2018

Uh oh!

Uh oh!