improve ergonomics of test execution

The test format implementation is difficult to work with for a number of reasons.

## Cannot Run Coverage Code Locally
Attempting to run
```
pytest tests/test_extensions.py::test_substrait_extension_coverage
```
locally results in errors like
```
__________________________________________________ ERROR collecting tests/test_extensions.py __________________________________________________
ImportError while importing test module '/Users/victor.barua/scratch/substrait/tests/test_extensions.py'.
Hint: make sure your test modules/packages have valid Python names.
Traceback:
tests/test_extensions.py:4: in <module>
    from tests.coverage.case_file_parser import load_all_testcases
tests/coverage/case_file_parser.py:4: in <module>
    from antlr4 import CommonTokenStream, FileStream
E   ModuleNotFoundError: No module named 'antlr4'
=========================================================== 1 error in 0.02 seconds ===========================================================
ERROR: not found: /Users/victor.barua/scratch/substrait/tests/test_extensions.py::test_substrait_extension_coverage
(no name '/Users/victor.barua/scratch/substrait/tests/test_extensions.py::test_substrait_extension_coverage' in any of [<Module test_extensions.py>])
```
even with then antlr4 runtime available
```
➜ pip3 freeze
antlr4-python3-runtime==4.13.2
...
```

As a result I cannot test changes locally AND because this is included as a pre-commit hook, not of my commits are able to use the hooks.

## Adding Tests Requires Mucking With Coverage Asserts
When trying to fix the lints in https://github.com/substrait-io/substrait/pull/769 as a result of me _adding_ coverage to existing tests, I have to muck with some of the numbers in
https://github.com/substrait-io/substrait/blob/2d8b1b673718df3deb9deae73de1f53dedba75f1/tests/test_extensions.py#L10-L38 It's not clear what I need to be modifying based on the error, and also it's hard to verify what needs to change as I can't run this locally.

# Desired State
In order of increasing opinionatedness
1. If were going to have Python tooling, we should have way to configure environments reproducibly (i.e. requirements.tx/pipfiles/etc) so that folks can execute test and hooks locally.
2. We should seperate out the execution of the tests from coverage.
3. CI should block on test failures, but not on coverage changes. Coverage should be a report that we can look at.



	# NOTE: this test is run as part of pre-commit hook
	def test_substrait_extension_coverage():
	script_dir = os.path.dirname(os.path.abspath(__file__))
	extensions_path = os.path.join(script_dir, "../extensions")
	registry = Extension.read_substrait_extensions(extensions_path)
	assert len(registry.registry) >= 161
	num_overloads = sum([len(f) for f in registry.registry.values()])
	assert num_overloads >= 510
	assert len(registry.dependencies) >= 13
	assert len(registry.scalar_functions) >= 162
	assert len(registry.aggregate_functions) >= 29
	assert len(registry.window_functions) >= 0

	test_case_dir = os.path.join(script_dir, "./cases")
	all_test_files = load_all_testcases(test_case_dir)
	coverage = get_test_coverage(all_test_files, registry)

	assert coverage.test_count >= 1077
	assert (
	coverage.num_tests_with_no_matching_function == 0
	), f"{coverage.num_tests_with_no_matching_function} tests with no matching function"
	assert coverage.num_covered_function_variants >= 226
	assert coverage.total_function_variants >= 513
	assert (
	coverage.total_function_variants - coverage.num_covered_function_variants
	) <= 287, (
	f"Coverage gap too large: {coverage.total_function_variants - coverage.num_covered_function_variants} "
	f"function variants with no tests, out of {coverage.total_function_variants} total function variants."
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

improve ergonomics of test execution #771

Cannot Run Coverage Code Locally

Adding Tests Requires Mucking With Coverage Asserts

Desired State

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

improve ergonomics of test execution #771

Description

Cannot Run Coverage Code Locally

Adding Tests Requires Mucking With Coverage Asserts

Desired State

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions