find_best_match

find_best_match

Summary

An open source Python project to consume a set of candidates (each consisting of a list of attributes with an associated score for each attribute) and match them against a comparator (a list of attributes with an associated weight). Return a list of the best match(es), as defined by the lowest difference score (a sum of the difference scores of all the attributes). Examples below may clarify this!

Collaborators should be aware, and follow this Team Agreement.

Problem Statement

There are many use cases to take a set of candidates with attributes and scores and match them against a set of attributes and weights to see which is the best match. For instance, candidates for a job, matches on a dating site etc.

But the algorithms are often hidden, proprietary and may contain bias.

This tool is open sourced and the algorythm is exposed through code, documentation etc. So consumers can see how the matching decision is made.

Initial Setup

Create virtual environment python -m venv venv <-- NOTE: The second venv ref is the directory for the virtual environment. Feel free to choose your own directory name, and change the next instruction to match!
activate virtual environment . venv/Scripts/activate
Install pip packages pip install -r requirements.txt

Note on Naming

Naming is hard :) Naming the list of candidates was easy (candidates) but naming the set of attributes, scores and weights that we are matching against was strangely difficult. I landed on the imperfect comparator. Let's just say that is the term I'm least unhappy with!

If any contributors can think of a better name please start a discussion. A universal name change will be easy and, because of the high level of automatic testing, safe.

Early Design Decisions

The request will always receive a response rather than a failure. So if required values are not sent in the request, they will be defaulted, a warning emitted and our best result sent back. The warnings show where issues with the request will affect the matching choice. Here are initial defaults. Each results in a warning in the response
- In the comparator:
  - If an attribute exists more than once, delete all but the first such attribute
  - If no perfect_score exists, or it isn't an integer, default it to 100
  - If no weight exists, or it isn't an integer between 1 and 100, default to 100
- In the candidates
  - If a candidate exists more than once, delete all but the first such candidate
  - If an attribute does not exist for a candidate, default its value to 0
  - If a candidate has extra attributes, ignore them
Do NOT choose randomly where more than one candidate has the highest score. Return the full set of top matches, and the requester can choose how to treat these results
Once published as v_1.0.0 and beyond, the API request and response contract should never change - never break code that calls it!

Sample JSON

Readme with notes on JSON samples are here
Sample request is here - request.json
Sample response is here - response.json

Bonus Documentation

Here is a handy wee directory containing light documentation I built over time

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.github/workflows		.github/workflows
_design		_design
_hints_docs		_hints_docs
_sandbox		_sandbox
doc		doc
sample_json		sample_json
src		src
tests		tests
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md
_TeamAgreement.md		_TeamAgreement.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

find_best_match

Summary

Problem Statement

Initial Setup

Note on Naming

Early Design Decisions

Sample JSON

Bonus Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

pritchardians/find-best-match

Folders and files

Latest commit

History

Repository files navigation

find_best_match

Summary

Problem Statement

Initial Setup

Note on Naming

Early Design Decisions

Sample JSON

Bonus Documentation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages