🌉 Bridge Tables

1) Concept Explanation

Bridge tables resolve many-to-many relationships in dimensional models.
Without bridge tables, many-to-many joins can double count metrics.

Interview framing:

Fact and dimensions are usually many-to-one joins
When a relationship is many-to-many (e.g., customer↔account, title↔genre), use bridge table
Sometimes include allocation weights to distribute measures correctly

2) Text-Based Diagrams

2.1 Customer to Account (many-to-many)

dim_customer      bridge_customer_account       dim_account
------------      -----------------------       -----------
customer_key  <-> customer_key            <->   account_key
                account_key
                relationship_type
                effective_date
                end_date
                allocation_pct (optional)

2.2 Fact join pattern

fact_transaction
----------------
transaction_id
account_key
amount
date_key

To analyze by customer:
fact_transaction -> dim_account -> bridge_customer_account -> dim_customer

2.3 Genre bridge (Netflix-like)

dim_title <-- bridge_title_genre --> dim_genre

3) Real-World Use Case

Banking/fintech customer-account ownership

One account can have multiple holders; one customer can have multiple accounts.
Bridge preserves ownership relationships and enables customer-level reporting.

Netflix title categorization

A title can belong to multiple genres/subgenres. Bridge avoids storing repeated genre arrays in fact tables.

Uber enterprise rides

Corporate cost centers can map to multiple departments with weighted allocation.

4) When to Use / When NOT to Use

Use when

Genuine many-to-many relationship exists
Need analytically correct aggregation across both sides
Need temporal ownership history in relationships

Avoid when

Relationship is actually one-to-many (simpler FK works)
Bridge is used to patch upstream data quality issues
Team cannot maintain weighting and SCD logic correctly

5) Advantages & Disadvantages

Advantages

Correct modeling of many-to-many
Prevents schema hacks and repeated denormalized arrays
Supports weighted attribution

Disadvantages

More complex joins
Risk of double counting if not weighted
Harder for analysts without semantic layer guidance

6) Common Mistakes

No allocation logic for shared ownership
Counting fact amount fully for each bridge match (inflation)
Missing effective dates in bridge when relationships change
Using bridge without clear business definition
Not documenting whether bridge is exclusive/non-exclusive

7) Performance Considerations

Keep bridge narrow and indexed on both keys
Add effective date filters for temporal joins
Precompute customer-attributed facts for hot dashboards
Use allocation_pct with clear default rules
Validate bridge cardinality drift regularly

8) 🔥 Interview Questions

Conceptual

What problem does a bridge table solve?
Difference between bridge table and factless fact table?
Why can bridge tables cause double counting?

Scenario-based

One account has 2 owners and $100 transaction. How do you report customer revenue?
Relationship changes over time. How do you model historical ownership?
Dashboard totals exceed source by 40% after adding bridge joins. Debug steps?

Product-based

Design customer-account bridge for Amazon co-branded credit products.
Design Netflix title-genre bridge with evolving taxonomy.
Design Uber enterprise cost allocation bridge with weighted splits.

Follow-ups

When do you pre-aggregate instead of joining bridge at query time?
How do you test allocation correctness?
Could this be solved in semantic layer only?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌉 Bridge Tables

1) Concept Explanation

2) Text-Based Diagrams

2.1 Customer to Account (many-to-many)

2.2 Fact join pattern

2.3 Genre bridge (Netflix-like)

3) Real-World Use Case

Banking/fintech customer-account ownership

Netflix title categorization

Uber enterprise rides

4) When to Use / When NOT to Use

Use when

Avoid when

5) Advantages & Disadvantages

Advantages

Disadvantages

6) Common Mistakes

7) Performance Considerations

8) 🔥 Interview Questions

Conceptual

Scenario-based

Product-based

Follow-ups

FilesExpand file tree

bridge_tables.md

Latest commit

History

bridge_tables.md

File metadata and controls

🌉 Bridge Tables

1) Concept Explanation

2) Text-Based Diagrams

2.1 Customer to Account (many-to-many)

2.2 Fact join pattern

2.3 Genre bridge (Netflix-like)

3) Real-World Use Case

Banking/fintech customer-account ownership

Netflix title categorization

Uber enterprise rides

4) When to Use / When NOT to Use

Use when

Avoid when

5) Advantages & Disadvantages

Advantages

Disadvantages

6) Common Mistakes

7) Performance Considerations

8) 🔥 Interview Questions

Conceptual

Scenario-based

Product-based

Follow-ups