Search Relevance Tests

This script generates ElasticSaerch performance/relevancy reports for configured applications.

For monitored applications, this tool should be used to detect search performance changes any time the application changes. In general, search performance is only likely to change following changes to code that builds Elasticsearch queries.

In the case of the discovery-api, this should only happen when there are updates to:

lib/resources.js, or
lib/elasticsearch/*

As a rule of thumb, if either of the above paths appear in the PR, the PR likely needs to include a candidate relevancy report.

Setup

To install all dependencies and activate a venv:

make venv
source .venv/bin/activate

Running

make venv
source .venv/bin/activate

To re-run all tests against all registered commits for a named application (for example when targets are modified):

python main.py APPLICATION test-all [--rows ROWS]

To rebuild the report for a named application using saved manifests:

python main.py APPLICATION rebuild-report

Building candidate relevancy reports for local changes

To run tests for a named, local application (for example to assess changes under development) use the test-local command. This allows you to build a candidate relevancy report based on a local app, even for code that is not yet committed, and optionally publish the resulting report.

python main.py APPLICATION test-local --appdir APPDIR --description DESC [--rows ROWS] [--publish]

For example, to build a report against local discovery-api changes and publish a sharable report:

python main.py discovery-api test-local --appdir ../discovery-api --description "my feature wip" --publish

You'll be prompted for the best folder to publish the temp report.

Include this report in your PR.

Docker:

To build a local image for local invocation:

docker build . -t search-relevance-tests --target local

You can then run arbitrary commands:

docker run -it -v $HOME/.aws/credentials:/root/.aws/credentials:ro search-relevance-tests APP COMMAND [--options...]

Sam:

To run the function in a simulated Lambda environment locally:

sam build -t sam.template.yml
sam local invoke -t .aws-sam/build/template.yaml SearchRelevanceTests -e events/...

Deployment

Deployment is handled by GHA on merge to main. These steps include:

./provisioning/push-image.sh
terraform -chdir=provisioning init
terraform -chdir=provisioning apply -var 'environment=qa'

Testing an application

To test an application's search performance and relevance, establish a directory in ./applications/ (e.g. ./applications/discovery-api).

The application folder should contain the following files:

commits.csv: A CSV, with header, that defines:

commit: The git commit hash
description: A friendly description for the significant change(s)

targets.yaml: A YAML file with multiple documents defining:

params: A hash of the query params recognized by the application (e.g. q, isbn, search_scope, etc.)
metric: The rank-eval metric (e.g. "precision", "recall")
metric_at: The rank-eval metric-at param
relevant: The set of ids constituting "good" hits
notes: An array of notes explaning the premise, origin, expectations of the target.

initialize.sh: A BASH script that initializes the app at the specified location (e.g. using git) and install dependencies. The script expects two arguments:

BASEDIR: The location on disk to initialize the application.
COMMIT: The git commit hash to check out

get-config.sh: A BASH script that accepts two arguments:

BASEDIR: The location of the app on disk.
OUTFILE: The location that the script should write the config to. (e.g. /tmp/config.json). This outfile is expected to ultimately be a JSON that defines:
- nodes: ES host(s)
- apiKey: ES api-key
- index: ES index

get-query.sh: A BASH script that accepts three arguments:

BASEDIR: The location of teh app on disk.
INFILE: The location of a JSON file on disk defining the query. (e.g. /tmp/query-params.json)
OUTFILE': The location on disk that the script should write the application-generated ES query (e.g. /tmp/es-query.json`)

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github/workflows		.github/workflows
applications/discovery-api		applications/discovery-api
config		config
events		events
lib		lib
provisioning		provisioning
templates		templates
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
dev-requirements.txt		dev-requirements.txt
main.py		main.py
package.sh		package.sh
requirements.txt		requirements.txt
sam.template.yml		sam.template.yml
setup.cfg		setup.cfg
unpackage.sh		unpackage.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Search Relevance Tests

Setup

Running

Building candidate relevancy reports for local changes

Docker:

Sam:

Deployment

Testing an application

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Search Relevance Tests

Setup

Running

Building candidate relevancy reports for local changes

Docker:

Sam:

Deployment

Testing an application

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages