Skip to content
This repository was archived by the owner on Apr 22, 2020. It is now read-only.

Enable to use only topK param with similarity algos#784

Open
tomasonjo wants to merge 1 commit into
neo4j-contrib:3.4from
tomasonjo:topk_similarity
Open

Enable to use only topK param with similarity algos#784
tomasonjo wants to merge 1 commit into
neo4j-contrib:3.4from
tomasonjo:topk_similarity

Conversation

@tomasonjo

@tomasonjo tomasonjo commented Jan 8, 2019

Copy link
Copy Markdown
Collaborator

This is an option how to allow users to use only topK parameter as for example:

CALL algo.similarity.cosine.stream(data, {topK:1})

Idea came from #729 and #730 and this PR is a nice and clean solution,
which I think improves user experience and causes less confusion with the user. On the other hand, there are cases where dissimilar relationships would get written back because we don't limit it anymore when providing topK param. This should be more of a problem with smaller datasets and with large datasets this won't usually happen in my experience.

What are your thoughts @mneedham @jexp?

Will add the code to all similarity algos if this is ok.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant