Finditperhaps

Document search and retrieval with deep learning (part of ML institute programme)

Setup

Ensure Python is installed on the system. If you need a specific version eg 3.9, run:

add-apt-repository ppa:deadsnakes/ppa
apt-get update
apt-get install python3.9

Install PDM with curl -sSL https://pdm-project.org/install-pdm.py | python3 - (you may need to add to your path after doing this to allow running the pdm command, see output of installation script for details)
Point PDM to a python interpreter - for example if installed python 3.9 in step 1 run pdm use python3.9. PDM will automatically create a virtual environment in the .venv folder
Run source .venv/bin/activate to activate the virtual environment
Run pdm install to install dependencies

Training locally

Run pdm run load to preprocess the dataset into a csv
Run pdm run train to train the model in minimode

Training on a GPU

Run ./ssh.sh, providing ip and port when prompted to open vscode on the GPU remotely
Follow steps from setup to install PDM and python on the GPU
Run FULLRUN=1 pdm run load to preprocess the dataset into a csv
Run FULLRUN=1 pdm run train to train the model in full mode with all the data

Running inference locally

Run docker compose -f docker-compose.dev.yml up to spin up the chroma (vector database) instance
Run pdm run cache to run a script that stores the encoded vectors for each document in chroma
Run pdm run serve to launch the web server. It should open on http://localhost:8080

Overriding the weights used

By default inference is run using model weights downloaded from wandb (see src/util/artifacts.py). Override these by setting env variables, for example to override the weights for the projector during caching you could run DOC_PROJECTOR_WEIGHTS_PATH=data/epoch-weights/doc-weights_epoch-30.generated.pt pdm run cache

Deployment

Run pdm run build to build the server docker image and push it to docker hub
Open inventory.ini and update to reflect the ip and port of the server you want to deploy to
Run pdm run ansible to run the playbook.yml file which should ssh to the remote server and launch a chroma instance and the server.

Note that the server will take a long time to startup because it needs to run the document caching logic from pdm run cache to insert the document vectors into chroma before it can handle requests. You can check its progress by ssh-ing to the server and running sudo docker compose -f /root/mlx/docker-compose.yml logs server

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.vscode		.vscode
artifacts		artifacts
data		data
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build.sh		build.sh
deploy.sh		deploy.sh
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.prod.yml		docker-compose.prod.yml
inventory.ini		inventory.ini
pdm.lock		pdm.lock
playbook.yml		playbook.yml
pyproject.toml		pyproject.toml
ssh.sh		ssh.sh
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Finditperhaps

Setup

Training locally

Training on a GPU

Running inference locally

Overriding the weights used

Deployment

About

Uh oh!

Uh oh!

Languages

CNimmo16/finditperhaps

Folders and files

Latest commit

History

Repository files navigation

Finditperhaps

Setup

Training locally

Training on a GPU

Running inference locally

Overriding the weights used

Deployment

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages