Genetic Syndrome Classification

Context

You are being hired by a fictional biotech company specializing in genetic research. The task involves analyzing embeddings derived from images to classify genetic syndromes. These embeddings are outputs from a pre-trained classification model. The company wants to improve its understanding of the data distribution and enhance the classification accuracy of genetic syndromes based on these embeddings. Your objective is to implement a comprehensive pipeline that includes data preprocessing, visualization, classification, manual implementation of key metrics, and insightful analysis.

Prerequisites

You must have installed:

Python 3.11.10

Optional:

UV

Setup

In the root of the project you must run:

# If you have python UV installed
uv sync

# If you have only pyhton and pip installed
pip install -r requirements.txt

Usage

Starting the Server

From the project root directory:

# Run pipeline and then start server (default)
python main.py

# Run only the inference server
python main.py --server

# Start server on a specific port
python main.py --server --port 9000

# Run only the pipeline
python main.py --pipeline

# Run pipeline and then start server
python main.py --all

Testing with cURL

# Health check
curl -X GET "http://localhost:8000/health"

# Single prediction with sample data
curl -X POST "http://localhost:8000/predict" \
  -H "Content-Type: application/json" \
  -d @artifacts/sample_inputs/sample_1.json

# Batch prediction
curl -X POST "http://localhost:8000/predict_batch" \
  -H "Content-Type: application/json" \
  -d @artifacts/sample_inputs/batch_input.json

# Get model info
curl -X GET "http://localhost:8000/model/info"

Troubleshooting

"Model not loaded" Error

Cause: The trained model file is missing.
Solution: Run the pipeline first:

python main.py --pipeline

Port Already in Use

Cause: Another process is using port 8000.
Solution: Use a different port:

python main.py --server --port 9000

Reports

Check the REPORTS.md file with all reports about the problem, solutions, analysis, and possible improvements

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
src		src
.gitignore		.gitignore
README.md		README.md
REPORT.md		REPORT.md
create_sample_data.py		create_sample_data.py
main.py		main.py
mini_gm_public_v0.1.p		mini_gm_public_v0.1.p
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Genetic Syndrome Classification

Prerequisites

Setup

Usage

Starting the Server

Testing with cURL

Troubleshooting

"Model not loaded" Error

Port Already in Use

Reports

About

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Genetic Syndrome Classification

Prerequisites

Setup

Usage

Starting the Server

Testing with cURL

Troubleshooting

"Model not loaded" Error

Port Already in Use

Reports

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages