Store all incremental data into single file #748

jerhard · 2022-05-31T15:39:51Z

With this PR, the incremental data is stored to a single file.

Now, incremental data is kept in-memory, maintained by Serialize.Cache. Data stored on disc can be loaded into memory with load_data, and stored from the cache to disk with store_data.
Retrieving the data from there can be done using a GADT, giving some type safety for the stored data with static type. As the the types for solver and analysis data are determined at run time, they are still kept as Obj.t in the cache.

Closes #357.

Incremental data is kept in-memory, maintained by Serialize.Cache. Retrieving the data from there can be done using a GADT.

src/framework/control.ml

src/incremental/serialize.ml

sim642 · 2022-06-01T07:26:29Z

src/incremental/serialize.ml

+    solver_data: Obj.t option ref;
+    analysis_data: Obj.t option ref;
+    version_data: MaxIdUtil.max_ids option ref;
+    cil_file: Cil.file option ref;


Instead of refs, the fields could simply be mutable.

Also, why are they all options? When would we ever have incremental data that doesn't have all four components present? I think we always need all of them (or have none of them), so the whole record could be optional.

Instead of refs, the fields could simply be mutable.

Done.

Also, why are they all options? When would we ever have incremental data that doesn't have all four components present? I think we always need all of them (or have none of them), so the whole record could be optional.

The issue is that during the initial analysis, these fields would not all become available at the same time. Namely, the Cil file and the max ids would become available after parsing (in Maingoblint), while the solver and analysis data only becomes availabe after solving the constraint system.

If one were to make the fields in the record non-optional, one would have to drag the cil file and max ids around until all of the data is available, so that one can set them in the record here.

One could potentially have two different records for the data that was read from the previous run and for the data that one produces in the current run. Then in first record the fields would be non-optional and in the other they would be optional.

these fields would not all become available at the same time

Right. At some point this bothered me before, but I think we can keep them optional for now.

At some point it might be useful to have them all in an immutable record that's read and created as a whole, because storing them at different times maybe can cause some inconsistencies during server mode or such. For example, if the new file is parsed and stored, the solving starts but is aborted, then the cache possibly has inconsistent state.

src/incremental/serialize.ml

in some intermediate version of the code refs were used, as they make it possbile to extract the code that obtains the reference to a field into a separate method. This is no longer needed.

Remove "Request" suffix for variants of the data_query constructor.

… incremental data

…ype. This way, it can be avoided to do an explicit cast via Obj.obj when querying the AnalysisData.

src/framework/control.ml

sim642

I didn't test this, but the code looks like it should.

jerhard added 2 commits May 31, 2022 17:20

Store all incremental data into one file, see issue #357.

a7a1d02

Incremental data is kept in-memory, maintained by Serialize.Cache. Retrieving the data from there can be done using a GADT.

Add type annotation for loaded analysis data

3878b40

jerhard added the cleanup Refactoring, clean-up label May 31, 2022

jerhard requested a review from stilscher May 31, 2022 15:39

sim642 self-requested a review May 31, 2022 15:41

sim642 reviewed Jun 1, 2022

View reviewed changes

jerhard added 6 commits June 1, 2022 10:55

Remove redundant comment

472c8ef

Remove incremental_data type, replace its usage with data_query

340f63c

Make fields of Cache.data mutable instead of refs.

9a3e7a8

in some intermediate version of the code refs were used, as they make it possbile to extract the code that obtains the reference to a field into a separate method. This is no longer needed.

Rename variants of data_query constructor.

31a6542

Remove "Request" suffix for variants of the data_query constructor.

Remove dead code dealing with no longer necessary tmp directories for…

3f872f3

… incremental data

Remove type variable for constructor AnalysisData of the data_query t…

a9dff9a

…ype. This way, it can be avoided to do an explicit cast via Obj.obj when querying the AnalysisData.

sim642 reviewed Jun 3, 2022

View reviewed changes

src/framework/control.ml Outdated Show resolved Hide resolved

src/framework/control.ml Outdated Show resolved Hide resolved

jerhard added 2 commits June 3, 2022 18:00

Remove unnecessary Obj.repr in control in call to update_data

5b00f11

Remove type variable for Cache.SolverData

140fec6

jerhard force-pushed the issue_357_new branch from 6fe9f43 to 140fec6 Compare June 3, 2022 16:05

sim642 approved these changes Jun 6, 2022

View reviewed changes

Fix comments

c3962dc

jerhard merged commit af3a084 into master Jun 8, 2022

jerhard deleted the issue_357_new branch June 8, 2022 08:18

jerhard mentioned this pull request Jun 8, 2022

Handle renaming of local variables in incremental analysis (AST) #731

Merged

sim642 added this to the v2.0.0 milestone Aug 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store all incremental data into single file #748

Store all incremental data into single file #748

Uh oh!

jerhard commented May 31, 2022

Uh oh!

Uh oh!

Uh oh!

sim642 Jun 1, 2022

Uh oh!

jerhard Jun 1, 2022

Uh oh!

jerhard Jun 1, 2022

Uh oh!

sim642 Jun 1, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sim642 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Store all incremental data into single file #748

Store all incremental data into single file #748

Uh oh!

Conversation

jerhard commented May 31, 2022

Uh oh!

Uh oh!

Uh oh!

sim642 Jun 1, 2022

Choose a reason for hiding this comment

Uh oh!

jerhard Jun 1, 2022

Choose a reason for hiding this comment

Uh oh!

jerhard Jun 1, 2022

Choose a reason for hiding this comment

Uh oh!

sim642 Jun 1, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sim642 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants