For each table:
And for each column in the table:
- How many unique values?
- 3 most common values (truncated) - to help it understand things like
state of CA v.s. California
I should cache this as it may be expensive to calculate, especially for wide tables
YAML is a good format here.