AI-Powered OCR Prescription Reader

The OCR Prescription Reader is a hybrid system that combines advanced image processing, Tesseract OCR for text extraction, and AI-powered validation to enhance accuracy. This project automates the process of extracting, validating, and structuring prescription data, reducing errors and improving healthcare workflows.

Key Features

Image Preprocessing:
- Noise reduction, adaptive thresholding, and morphological operations for improved OCR accuracy.
Text Extraction:
- Utilizes Tesseract OCR to extract text from prescription images.
AI Integration:
- Validates and refines extracted text using the Gemini AI Model, ensuring consistency and accuracy.
Drug Interaction Check:
- RxNorm
PDF Output:
- Generates a structured PDF report with extracted and validated data.

System Workflow

1. Image Preprocessing

Steps:
1. Convert image to grayscale for simplicity.
2. Apply denoising using Non-Local Means filtering.
3. Perform adaptive thresholding to binarize the image.
4. Invert the binary image and apply dilation for contour detection.
Purpose: Enhances text clarity and prepares the image for OCR.

2. Optical Character Recognition (OCR)

Uses Tesseract OCR to detect and extract text regions from preprocessed images.
Bounding boxes are generated for potential text areas, which are cropped, resized, and analyzed.

3. AI Validation

Extracted text is passed to the Gemini AI Model for validation and correction.
Gemini Enhancements:
- Corrects abbreviations and errors in medication names.
- Standardizes dosage units and frequencies (e.g., "QD" → "once daily").
- Structures the data into a clear and consistent format.

4. Output Generation

Results are saved as:
- A JSON file for programmatic use.
- A PDF report for easy sharing and documentation.

Example Workflow

Image Input:
- Upload a prescription image via the web interface.
Processing:
- Preprocess the image and extract text using Tesseract OCR.
Validation:
- AI validates and structures the extracted text.
Output:
- View structured text and download the PDF report.

Limitations

Handwritten Text:
- Tesseract OCR struggles with highly variable handwriting.
- Designed primarily for printed prescriptions.
Image Quality:
- Poor-quality or low-resolution images may impact accuracy.

Future Directions

Enhanced Models:
- Replace Tesseract with advanced deep learning-based OCR models like CRNN or Vision Transformers for better handwriting recognition.
Edge Deployment:
- Optimize for mobile and IoT devices for on-the-go prescription analysis.
Multilingual Support:
- Extend support to non-English prescriptions and international drug standards.

Screenshots

Image Preprocessing

Extracted and Validated Text

Requirements

Environment:
- Python 3.8+
- Tesseract OCR installed locally.
Dependencies:
- numpy, opencv-python, pytesseract, requests, fpdf, dotenv, and google.generativeai.

Authors

Name	GitHub	LinkedIn
Abdulmonem Elsherif	@AbdulmonemElsherif
Sharif Ehab	@Sharif_Ehab

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
app		app
LICENSE		LICENSE
OCR_LLM.py		OCR_LLM.py
README.md		README.md
gemini_output.pdf		gemini_output.pdf
prescription.jpg		prescription.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-Powered OCR Prescription Reader

Key Features

System Workflow

1. Image Preprocessing

2. Optical Character Recognition (OCR)

3. AI Validation

4. Output Generation

Example Workflow

Limitations

Future Directions

Screenshots

Image Preprocessing

Extracted and Validated Text

Requirements

Authors

About

Uh oh!

Releases

Packages

Languages

License

SharifEhab/OCR-Prescription-Reader

Folders and files

Latest commit

History

Repository files navigation

AI-Powered OCR Prescription Reader

Key Features

System Workflow

1. Image Preprocessing

2. Optical Character Recognition (OCR)

3. AI Validation

4. Output Generation

Example Workflow

Limitations

Future Directions

Screenshots

Image Preprocessing

Extracted and Validated Text

Requirements

Authors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages