- Currently there exist 6-7 tests in the evaluation framework that should exist under the benchmarking framework. - For those we want to make sure that the testing environment variable is active and that we avoid releasing / making a PR / etc..