Skip to content

Conversation

e-strauss
Copy link
Contributor

[SYSTEMDS-3740] Python API Builtin countDistinct

This patch adds the builtin operator for countDistinct to the python api.

elif axis is None:
return Scalar(self.sds_context, 'countDistinct', [self])
raise ValueError(
f"Axis has to be either 0, 1 or None, for column, row or complete {self.operation}")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR is missing a test case for the error.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Baunsgaard added, I also removed the print statements and squashed + rebased

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Baunsgaard the older test cases for the aggregation like min, max and co, it makes sense to add test cases there as well - or are there some in place which I havent found?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

then we need to add it for them as well,
unfortunately we do not see these things because we do not have code coverage for the Python API,
It would be cool to have that as well.

@e-strauss e-strauss force-pushed the MISSING_BUILTIN_COUNT_DISTINCT branch from 73f4b29 to 2f63100 Compare September 3, 2024 10:53
@Baunsgaard Baunsgaard closed this in e0a62ff Sep 3, 2024
@Baunsgaard
Copy link
Contributor

LGTM, merged

@e-strauss e-strauss deleted the MISSING_BUILTIN_COUNT_DISTINCT branch September 4, 2024 09:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

2 participants