test: example from wiki by paulz · Pull Request #63 · thisisartium/continuous-alignment-testing

paulz · 2025-03-25T03:18:19Z

This pull request includes several changes to the examples/team_recommender/tests suite, focusing on statistical significance testing and sample size calculations. The most important changes include the addition of new functions, modifications to existing test functions, and the removal of a test file.

New functions and modifications:

examples/team_recommender/tests/helpers.py: Added a new function is_statistically_significant to determine statistical significance.

Test function updates:

examples/team_recommender/tests/test_helpers.py: Updated test_next_success_rate to use pytest.approx for better precision in assertions.
examples/team_recommender/tests/test_helpers.py: Added parameterized tests for test_next_sample_size_with_1_failure and test_next_no_failure_sample_size_via_loop to cover more cases.
examples/team_recommender/tests/test_helpers.py: Added a new test test_example_on_wiki to validate statistical significance and sample size calculations.

File removals and dependency updates:

examples/team_recommender/tests/test_proportions_ztest.py: Removed the entire file as it is no longer needed.
pyproject.toml: Removed the statsmodels dependency, which was used in the deleted test file.

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

Copilot

Pull Request Overview

A refactoring to correct statistical calculations in calculate_ztest and improve test clarity for sample size functions.

Corrected the parameters passed to proportions_ztest in calculate_ztest.
Renamed and introduced new sample size functions to handle both one failure and no failure cases with accompanying tests.
Improved error messages and added a test case based on a wiki example.

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
examples/team_recommender/tests/test_helpers.py	Refactored sample size helper functions and updated tests with consistent naming.
examples/team_recommender/tests/test_proportions_ztest.py	Corrected calculate_ztest parameters and updated significance tests with a wiki example.

Comments suppressed due to low confidence (3)

examples/team_recommender/tests/test_proportions_ztest.py:126

The error message does not match the test parameters; it refers to '0 out of 3' while the sample size is 1000. Please update the message to accurately reflect the test scenario.

assert is_statistically_significant(0.7, 0, 1000), "not significant result for 0 out of 3"

examples/team_recommender/tests/test_proportions_ztest.py:131

The assertion expects a statistically significant result, but the error message suggests otherwise. Please align the error message with the expected outcome.

assert is_statistically_significant(0.7, 0, 10), "no improvement detected at 10"

examples/team_recommender/tests/test_proportions_ztest.py:139

The error message implies a lack of improvement despite the assertion requiring statistical significance. Please update the message to match the intended check.

assert is_statistically_significant(0.97, 0, 100), "no improvement detected at 100"

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

test_no_failures_always_cause_insignificance Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

as they generate stats only for 95% confidence and we use 90% Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

carl · 2025-03-25T19:55:41Z

🐻 approved

paulz marked this pull request as ready for review March 25, 2025 18:47

carl added 3 commits March 25, 2025 11:47

test: example from wiki

8206d38

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

test: add assertion for next_rate against upper bound

736e20c

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

fix: add next_sample_size_no_failure to calculate wiki page example

879b9e8

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

paulz force-pushed the test-examples-on-wiki branch from d66d649 to 879b9e8 Compare March 25, 2025 18:47

tkersey requested a review from Copilot March 25, 2025 18:48

Copilot AI reviewed Mar 25, 2025

View reviewed changes

carl added 5 commits March 25, 2025 12:05

test: refactor next_sample_size tests to use parameterization

da6558e

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

fix: update next_sample_size_with_1_failure function and test data

d35dba8

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

feat: add statistical significance functions and corresponding tests

2c84e53

test_no_failures_always_cause_insignificance Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

refactor: move test to helpers

dd6b2af

Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

cleanup: remove statsmodels

9311248

as they generate stats only for 95% confidence and we use 90% Signed-off-by: Paul Zabelin <paulzabelin@artium.ai>

paulz merged commit 2329c15 into thisisartium:main Mar 25, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: example from wiki#63

test: example from wiki#63
paulz merged 8 commits intothisisartium:mainfrom
paulz:test-examples-on-wiki

paulz commented Mar 25, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

carl commented Mar 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

paulz commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New functions and modifications:

Test function updates:

File removals and dependency updates:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

carl commented Mar 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

paulz commented Mar 25, 2025 •

edited

Loading