Skip to content

Simpler interface for computing distances #40

@bobot

Description

@bobot

The correlation function in polars has a simple interface. It takes two polars expressions as arguments. It allows to compute those column dynamically.

On the other hand polars-distance seems to really only compute the distance on arrays. On list for example it is turning it into a set before computing something. It makes impossible to filter the datas, or grouping them before computing the distance (since arrays have a fixed length and can't be computed dynamically).

For example, computing the correlation matrix of the time taken by different algorithms is easy:

df = cross_results.group_by("algo1","algo2").agg(corr=pl.corr("cpuTime1","cpuTime2", method="pearson"))

I don't see how to do it using distances defined by polars-distance.

Thanks for this library it can be a great addition to polars.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions