Add API v2 by micahwiesner67 · Pull Request #385 · CDCgov/cfa-epinow2-pipeline

micahwiesner67 · 2026-02-23T16:07:51Z

@natemcintosh re-opening that PR here.

Notes:

Adding parameter for facility_active_proportion with a default of 0.94
Exposing api parameters in the makefile and passing them into azure/generate_config.py for simplicity (user can now simply run 'make config DATA_API=v2' and this will route the input data container as nssp-etl-api-v2 (default is still set to API v1 - nssp-etl)
Adding unit tests to test the read_data() with facility_active_proportion passed or defaulted for both API v1 and v2
The current stan-dev install from github stan-dev/cmdstanr fails out. It seems the latest dev updates don't play well with our dependencies. Pinning to the 0.9.0 commit hash (da99e2b) appears to work
test with a command like this make test-batch JOB=test_name DATA_API=v2
Added documentation in the read_data function to highlight how facility_active_proportion is used only in the API v2 DATA

Some additional changes for continuity:

To run 'make run' for local testing it was necessary to update the test.json config in rt-epinow2-config with the low_case_count_thresholds field
"low_case_count_thresholds": {
"COVID-19": 10,
"Influenza": 10,
"RSV": 5
},

The initial version of this test file had only some of the reference dates of the whole time span

More to come

for more information, see https://pre-commit.ci

Need to do a bit more testing I think

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

These changes should only be merged into main at the same time as the config generator library gets these changse merged

natemcintosh · 2026-02-23T18:59:37Z

Yea, I think something lower than 1.0 would be good. I don't remember exactly what worked well in the runs when I did it. @kgostic do you remember if there was a particular threshold you liked?

micahwiesner67 · 2026-02-24T14:13:35Z

To start, I resolved the merge conflicts. In the 'main' branch we have 0.94 as the proportion for active facilities threshold so that's what I set to here.

github-actions · 2026-02-24T14:15:08Z

Thank you for your contribution @micahwiesner67 🚀! Your pkgdown-site is ready for download 👉 here 👈!
_{(The artifact expires on 2026-04-15T15:13:37Z. You can re-generate it by re-running the workflow here.)}

zsusswein

I'm still not following around testing. Added some more detail in the relevant comment chain.

Co-authored-by: Zachary Susswein <46581799+zsusswein@users.noreply.github.com>

for more information, see https://pre-commit.ci

micahwiesner67 · 2026-04-07T16:35:45Z

@zsusswein
So the facility_active_proportion field is not used in the read_data() function when we are pulling data from API v1. The read_data() function will look for the any_visits_this_day column in the data - if it exists it will determine that data is from API v2 and proceed as such (filtering out based on the facility_active_proportion threshold of 0.94 which we set).

In regard to the 4 tests you mentioned.

Is still being tested in some test that references the sample_config_with_exclusion.json file ("Pipeline run produces expected outputs and returns success"
- Maybe the confusion is in the naming of the CA_*.parquet files? CA_test.parquet is still a file that has API v1 data. Would it make more sense if it was explicitly named so (CA_apiv1_test.parquet) to match the naming schema of CA_apiv2_test.parquet?

zsusswein · 2026-04-07T16:45:32Z

@zsusswein So the facility_active_proportion field is not used in the read_data() function when we are pulling data from API v1. The read_data() function will look for the any_visits_this_day column in the data - if it exists it will determine that data is from API v2 and proceed as such (filtering out based on the facility_active_proportion threshold of 0.94 which we set).

That explanation makes sense. My request here is add tests for the expected behavior, especially for the key production use-cases.

Is still being tested in some test ...

And the above is a little concerning because we have a change to the core API here. I believe you when you tell me that this runs in both cases and there's a test somewhere, but I think we need a test to cover explicitly this behavior in each of its permutations.

micahwiesner67 · 2026-04-08T14:41:26Z

Documenting our conversation yesterday for thoroughness. Zach raised concerns over testing in these scenarios. The main distinction to make is that if the facility_active_proportion is not passed through the config that is totally fine as there is a default value (0.94) set.

Do not pass facility_active_proportion in the config and run on API v1: this is the existing behavior on main --> This is the branch behavior prior to this PR, correct.
Do pass facility_active_proportion in the config and run on API v1: this is my understanding of the "production" behavior on this branch --> This is correct that this will be "production" behavior. Essentially, facility active proportion is passed as default (0.94) but ignored as it's not used for API v1.
Do not pass facility_active_proportion in the config and run on API v2 --> This is not correct, this will not error out as read_data has a default facility_active_proportion of 0.94 set.
Do pass facility_active_proportion in the config and run on API v2: my understanding is that this is the API v2 backtest behavior

In our meeting yesterday we agreed it wasn't necessary to test the handling of the config as long as we had a respective case in read_data. As there is a default for facility_active_proportion this field is essentially always passed in. To cover these cases, I've added some tests in the tests/testthat/test-read_data.R script.

incoming
"Replace COVID-19/Omicron with COVID-19, US (API v1)"
"API v2 with COIVD-19, US (default facility_active_proportion)"
"facility_active_proportion affects counts (API v2)"

natemcintosh and others added 18 commits June 17, 2025 15:28

start of reading either API

b71e2d7

add news entry

091b21c

start of new tests

f815018

add new api test file

7b593d1

needed to make sure every reference date existed

1df7505

The initial version of this test file had only some of the reference dates of the whole time span

Merge branch 'main' into nam-api-v2-prep

19648a3

Merge branch 'main' into nam-api-v2-prep

9c49ad7

first working version of reader for api v2

de138bf

More to come

[pre-commit.ci] auto fixes from pre-commit.com hooks

36c3359

for more information, see https://pre-commit.ci

set up proportion active argument.

4e27878

Need to do a bit more testing I think

Air formatter

ebc05ce

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

fix test issues

d1f8397

attempting fixes suggested on the internet

0b4da22

love messing up my less than, equal, greater than

645c69b

changes to allow use of the new field

114462b

These changes should only be merged into main at the same time as the config generator library gets these changse merged

Merge branch 'main' into nam-api-v2-prep

e7a1b20

Merge branch 'main' into nam-api-v2-prep

ae1ce90

forgot to add facility active field to metadata

487be6d

Merge branch 'main' into dev-nam_api_v2_prep

39dab0f

micahwiesner67 changed the title ~~Dev nam api v2 prep~~ Add API v2 Feb 24, 2026

micahwiesner67 added 4 commits February 24, 2026 14:52

roxygenize + tagging cmdstanr release version

69848a4

removing rogue comma in list

c36abb6

adding facility active proportion to test json

16aaa81

removing big exclusions test as this test parquet does not exist

40b4337

micahwiesner67 temporarily deployed to production February 24, 2026 17:23 — with GitHub Actions Inactive

micahwiesner67 temporarily deployed to production February 24, 2026 17:25 — with GitHub Actions Inactive

micahwiesner67 requested a review from natemcintosh February 24, 2026 17:31

zsusswein reviewed Apr 7, 2026

View reviewed changes

Comment thread R/config.R Outdated

Comment thread R/read_data.R Outdated

Comment thread R/read_data.R Outdated

Comment thread tests/testthat/data/README.md Outdated

micahwiesner67 and others added 2 commits April 7, 2026 11:51

Apply suggestions from code review

bbb45b9

Co-authored-by: Zachary Susswein <46581799+zsusswein@users.noreply.github.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

3e3a143

for more information, see https://pre-commit.ci

pre-commit-ci Bot temporarily deployed to production April 7, 2026 15:55 Inactive

pre-commit-ci Bot temporarily deployed to production April 7, 2026 15:56 Inactive

micahwiesner67 commented Apr 7, 2026

View reviewed changes

Comment thread R/read_data.R

updating documentation in read_data function

4466890

micahwiesner67 temporarily deployed to production April 7, 2026 19:17 — with GitHub Actions Inactive

micahwiesner67 temporarily deployed to production April 7, 2026 19:18 — with GitHub Actions Inactive

adding tests for facility active proportion

088557c

micahwiesner67 temporarily deployed to production April 8, 2026 14:39 — with GitHub Actions Inactive

micahwiesner67 temporarily deployed to production April 8, 2026 14:41 — with GitHub Actions Inactive

roxygenizeing

ebc6a65

micahwiesner67 temporarily deployed to production April 8, 2026 14:45 — with GitHub Actions Inactive

micahwiesner67 temporarily deployed to production April 8, 2026 14:47 — with GitHub Actions Inactive

micahwiesner67 requested a review from zsusswein April 8, 2026 14:51

zsusswein approved these changes Apr 8, 2026

View reviewed changes

Comment thread R/read_data.R Outdated

Comment thread R/read_data.R Outdated

making DATA lowercase

fa93197

micahwiesner67 had a problem deploying to production April 8, 2026 15:14 — with GitHub Actions Error

micahwiesner67 had a problem deploying to production April 8, 2026 15:45 — with GitHub Actions Error

natemcintosh approved these changes Apr 8, 2026

View reviewed changes

micahwiesner67 temporarily deployed to production April 8, 2026 16:11 — with GitHub Actions Inactive

micahwiesner67 temporarily deployed to production April 8, 2026 16:13 — with GitHub Actions Inactive

micahwiesner67 merged commit 924733e into main Apr 8, 2026
23 of 27 checks passed

micahwiesner67 deleted the dev-nam_api_v2_prep branch April 8, 2026 16:16

micahwiesner67 temporarily deployed to production April 8, 2026 16:16 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add API v2#385

Add API v2#385
micahwiesner67 merged 45 commits intomainfrom
dev-nam_api_v2_prep

micahwiesner67 commented Feb 23, 2026 •

edited

Loading

Uh oh!

natemcintosh commented Feb 23, 2026

Uh oh!

micahwiesner67 commented Feb 24, 2026

Uh oh!

github-actions Bot commented Feb 24, 2026 •

edited

Loading

Uh oh!

zsusswein left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

micahwiesner67 commented Apr 7, 2026

Uh oh!

zsusswein commented Apr 7, 2026

Uh oh!

micahwiesner67 commented Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

micahwiesner67 commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

natemcintosh commented Feb 23, 2026

Uh oh!

micahwiesner67 commented Feb 24, 2026

Uh oh!

github-actions Bot commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zsusswein left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

micahwiesner67 commented Apr 7, 2026

Uh oh!

zsusswein commented Apr 7, 2026

Uh oh!

micahwiesner67 commented Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

micahwiesner67 commented Feb 23, 2026 •

edited

Loading

github-actions Bot commented Feb 24, 2026 •

edited

Loading

zsusswein left a comment •

edited

Loading