Add relationship_type by straeter · Pull Request #123 · futuresearch/futuresearch-python

straeter · 2026-02-10T10:14:27Z

add one_on_one parameter for merge
add / uncomment some system files to gitignore

src/everyrow/ops.py

jackwildman

I think this needs better explained to the user, and the bug sentry spotted also looks genuine. Other than that, good to go.

src/everyrow/ops.py

CallumMcMahon

does this need to be a bool? what about "1:1"/"m:1"/"1:m" or enums, to handle the other options?

straeter · 2026-02-11T08:03:58Z

does this need to be a bool? what about "1:1"/"m:1"/"1:m" or enums, to handle the other options?

ok good point. At the moment we can only have m:1 and 1:1 but in the future we might have 1:m and m:m. I will instead introduce a parameter "relationship_type" this an enum now and default to m:1

straeter · 2026-02-11T10:38:17Z

@jackwildman is it now clearer with the relationship_type enum?

jackwildman

Looks good, and definitely more obvious now. I'm not the keenest on relationship_type, but I'm happy to go with whatever you ultimately go with

jackwildman · 2026-02-11T11:36:24Z

src/everyrow/generated/models/merge_operation.py

            right_key=right_key,
            use_web_search=use_web_search,
-            one_on_one=one_on_one,
+            relationship_type=relationship_type,


Nittiest of nits: relationship_type is kind of vague, and arguably a bit inaccurate. cardinality is maybe more accurate (or at least where terms like "many-to-many" often come in), but at the expense of not being immediately obvious to anyone who doesn't live in a database. We already have a strategy field on dedupe, so strategy here might be a good choice. If nothing else, it would add some harmony between the operations.

I was thinking about cardinality but for me this is a very mathematical term that most people have never heard of, whereas relationship should also be more familiar for every person that has worked with SQL. I think strategy would not be a good choice here, it would rather refer / understood as the strategy how to perform the merge like "first try fuzzy, then web agents". What we want to describe is really an existing relationship between the data / rows and our algorithm then figures out the best way + strategy to cope with that relationship

update: I just realized that cardinality has two very different meanings in mathematics and computer science

what I am a bit confused about is that you call relationship_type "inaccurate" -> is it not just a synonym of cardinality? https://www.geeksforgeeks.org/dbms/types-of-relationship-in-database/

Yeah, I think cardinality also suffers the same inaccuracy now that I think about it more. I think, fundamentally, it's about whether we call this a "relationship", as we're not really establishing a relationship but more operating in a manner where x on one side can match and merge with y on the other side, so it is more like a mode or principle of operation than a relationship.

Saying this, I think basically any term can be nitpicked for this, so probably best just to pick relationship_type and move on with it. The key thing is that even if the term isn't immediately obvious, it doesn't take long to figure out from reading the doc string or the enumerated values, and from there it's easy enough to understand

... and the blog articles we will soon write about it :)

update: I just realized that cardinality has two very different meanings in mathematics and computer science

Yet more confusingly, computer science often uses both definitions. I mean, I do see why one might arrive at "cardinality" for n:m relationships in a database if we consider that element x has a set of connections of cardinality m, but it's definitely a bit of an overloaded term.

straeter added 2 commits February 10, 2026 12:13

add one_on_one merge parameter

4b51530

add vscode, pycharm, macos files to gitignore

bf4667a

straeter requested review from CallumMcMahon and jackwildman February 10, 2026 10:14

sentry bot reviewed Feb 10, 2026

View reviewed changes

src/everyrow/ops.py Outdated Show resolved Hide resolved

jackwildman approved these changes Feb 10, 2026

View reviewed changes

src/everyrow/ops.py Outdated Show resolved Hide resolved

src/everyrow/ops.py Outdated Show resolved Hide resolved

CallumMcMahon reviewed Feb 10, 2026

View reviewed changes

straeter added 2 commits February 11, 2026 12:01

merge main

d16d24f

one_on_one -> relationship_type

1e18b44

straeter changed the title ~~Add one on one~~ Add relationship_type Feb 11, 2026

jackwildman approved these changes Feb 11, 2026

View reviewed changes

straeter merged commit f599f82 into main Feb 11, 2026
3 checks passed

straeter deleted the add_one_on_one branch February 11, 2026 12:45

Conversation

straeter commented Feb 10, 2026

Uh oh!

Uh oh!

jackwildman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

CallumMcMahon left a comment

Choose a reason for hiding this comment

Uh oh!

straeter commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

straeter commented Feb 11, 2026

Uh oh!

jackwildman left a comment

Choose a reason for hiding this comment

Uh oh!

jackwildman Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

straeter Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

straeter Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

jackwildman Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

straeter Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

jackwildman Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

straeter commented Feb 11, 2026 •

edited

Loading

straeter Feb 11, 2026 •

edited

Loading