NeuroRank: High-Performance Neural Information Retrieval

NeuroRank is a production-ready neural reranking service designed for low-latency Information Retrieval (IR). It leverages Knowledge Distillation to compress a large, highly accurate BERT-based "teacher" model into a smaller, faster "student" model. Further performance gains are achieved through ONNX runtime optimization and 8-bit quantization, making it suitable for real-time search applications.

🚀 Key Features

Knowledge Distillation: Achieves 97% of the teacher model's accuracy with a 6x reduction in model size and 10x faster inference.
Production-Optimized: Deployed using ONNX Runtime with dynamic quantization for CPU-based inference.
Scalable API: Includes a FastAPI-based REST service ready for containerization (Docker).
Standard Benchmarks: Trained and evaluated on the MS MARCO passage ranking dataset.

🛠️ Architecture

graph LR
    A[MS MARCO Data] --> B(Teacher Model<br/>Cross-Encoder BERT);
    B -->|Distillation Logs| C{Distillation<br/>Trainer};
    A --> C;
    C --> D(Student Model<br/>MiniLM);
    D --> E[ONNX Export &<br/>Quantization];
    E --> F(NeuroRank<br/>Service API);

⚡ Performance Benchmarks

Model Version	MRR@10	Latency (p99)	Model Size
Teacher (BERT-Base)	0.382	120ms	420MB
Student (MiniLM-L6)	0.371	15ms	90MB
NeuroRank (Quantized ONNX)	0.369	8ms	23MB

> Note: Benchmarks run on Intel Xeon CPU @ 2.20GHz, 4 vCPUs.

📦 Quick Start

The project uses pyproject.toml for dependencies and neurorank as the central CLI tool.

Clone and Install Dependencies:

git clone [https://github.com/yourusername/neurorank.git](https://github.com/yourusername/neurorank.git)
cd neurorank
# Install the project and all dependencies from pyproject.toml
pip install .

Build the Production Model: If you don't have a pre-trained model, you must run the full training pipeline (train-teacher, train-student) followed by the export step.

# 1. Run the full training pipeline (skipped here for brevity)
# 2. Convert the trained .pt model to the final quantized .onnx model
bash scripts/build_release_model.sh

Run the Service (Recommended: Docker):
```
docker-compose up --build
```
Alternatively, run it locally via the CLI:
```
neurorank runserver --port 8000
```

Test the API: Note: The API expects the document list field to be named texts.

curl -X POST "http://localhost:8000/rerank" \
     -H "Content-Type: application/json" \
     -d '{"query": "machine learning", "texts": ["intro to ML", "advanced AI", "cooking recipes"]}'

🏗️ Project Structure

src/neuro_ranker/: Core library for model definitions, encoders, and helpers.
training_pipeline/: Scripts for training, distillation, and evaluation (e.g., train_teacher.py).
scripts/: Utility scripts for data prep, ONNX export, and benchmarking.
ranker_service/: FastAPI application for serving the model.
configs/: Training configuration files.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
configs		configs
data		data
docs		docs
ranker_service		ranker_service
scripts		scripts
src/neuro_ranker		src/neuro_ranker
tests		tests
training_pipeline		training_pipeline
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml
manage.py		manage.py
pyproject.toml		pyproject.toml
requirements.lock		requirements.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeuroRank: High-Performance Neural Information Retrieval

🚀 Key Features

🛠️ Architecture

⚡ Performance Benchmarks

📦 Quick Start

🏗️ Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NeuroRank: High-Performance Neural Information Retrieval

🚀 Key Features

🛠️ Architecture

⚡ Performance Benchmarks

📦 Quick Start

🏗️ Project Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages