Skip to content

Using percentages instead of counts to compare distribution of two tables #834

Open
@borisRa

Description

@borisRa

Hi,

How can I compare between train/test distributions ?
Using this code :
plot_diff([train_df[train_df.columns[~train_df.columns.isin(['Survived'])]], test_df],config={"diff.label": ["train_df", "test_df"]})

I am getting counts as is , I would like to compare percentage instead.
Similar to this plot for Age distribution :
image

Thanks !
Boris

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions