Our regression tests mostly have expected results that can be tested against, but none of our benchmarks do. This means it's impossible to know when Goblint has become better or worse on them and to track regressions in benchmarks automatically.
Various degrees of expected results would be possible:
Our regression tests mostly have expected results that can be tested against, but none of our benchmarks do. This means it's impossible to know when Goblint has become better or worse on them and to track regressions in benchmarks automatically.
Various degrees of expected results would be possible: