-
Notifications
You must be signed in to change notification settings - Fork 16
Add stop time update likely arrivals + initial metrics by trip-stop grain #3917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Warehouse report 📦 Checks/potential follow-upsChecks indicate the following action items may be necessary.
New models 🌱calitp_warehouse.mart.gtfs.fct_stop_time_arrivals calitp_warehouse.mart.gtfs.fct_stop_time_updates_metrics calitp_warehouse.mart.gtfs.fct_stop_time_updates_with_arrivals Changed incremental models 🔀calitp_warehouse.intermediate.gtfs.int_gtfs_rt__trip_updates_trip_day_map_grouping DAGLegend (in order of precedence)
|
Noting that the DAG diagram shows....NTD tables too? It should be a fairly limited diagram with |
@vevetron: Changed this to a view based on what we discussed so I think it's ready to be merged? |
9d5128a
to
78e2f5f
Compare
78e2f5f
to
f960149
Compare
7ef1f44
to
b989297
Compare
Why are there partition by configs if there is no materialized table? Are they necessary? |
I think the 2 I wanted to turn incremental were Noting for now - can't rerun dbt because this error comes up:
|
263451a
to
2b50c64
Compare
Description
3 new tables:
Add back the table (renamed to
mart_gtfs.fct_stop_time_arrivals
), last created inmart_adhoc
for RT trip updates / likely stop arrival. Implement the sql in Feature: Calculate each trip's likely arrival at each stop #2684.fct_stop_time_updates_with_arrivals
: add an intermediate step wherefct_stop_time_updates
is merged with thefct_stop_time_arrivals
fct_stop_time_update_metrics
: start adding the minute binned metrics (trip update completeness) and summarizing across all the minutes of predictions within 30 min of the actual arrival.header/vehicle_timestamp
+ explore whether some deduping is neededType of change
mart_gtfs.fct_stop_time_arrivals
, which usesfct_stop_time_updates
andfct_trip_updates_summaries
How has this been tested?
Added
trip_instance_key
to the column, but found that I had to switch theservice_date
to be a date that was available infct_trip_updates_summaries
(which might be run weekly as an incremental table). Picked one date in common with the lookback period for RT tables and what has been materialized infct_trip_updates_summaries
to see iftrip_instance_key
is added correctly.This first table just ran the existing sql, updating only column names to reflect what we have. Noting here that because it's a view, whether I query on one operator's base64_url or all, a single service_date will touch 1+ TB!
Decision points pre-merge
fct_trip_update_summaries
first to see what desired metrics we want, before we formally make it something that's run weekly? --> decided on making it a viewPost-merge follow-ups
Document any actions that must be taken post-merge to deploy or otherwise implement the changes in this PR (for example, running a full refresh of some incremental model in dbt). If these actions will take more than a few hours after the merge or if they will be completed by someone other than the PR author, please create a dedicated follow-up issue and link it here to track resolution.