pdf2md

Convert PDF files to Markdown using Mistral's OCR API.

Setup

uv as package manager is highly recommended (https://docs.astral.sh/uv/getting-started/installation)

Clone/copy this project to your machine

Copy .env.example to .env and add your Mistral API key:

cp .env.example .env
# Edit .env and set MISTRAL_API_KEY

Install dependencies:
```
uv sync
```

Usage

Run from the project directory:

uv run --env-file .env main.py input.pdf

With custom output folder:

uv run --env-file .env main.py input.pdf -o output_folder

Installing as a global command

Windows (not tested)

Add the project folder to PATH (windows add folder to path)

Then run from anywhere: pdf2md input.pdf

Linux/MacOS

Add an alias to your shell profile (.bashrc or .zshrc):

alias pdf2md='uv run --project <path_to_this_folder> --env-file <path_to_this_folder>/.env <path_to_this_folder>/main.py'

Output

Creates a folder (same name as the PDF) containing:

filename.md - The converted markdown
img-*.jpeg - Extracted images (if any)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
main.py		main.py
pdf2md.bat		pdf2md.bat
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pdf2md

Setup

Usage

Installing as a global command

Windows (not tested)

Linux/MacOS

Output

About

Uh oh!

Releases

Packages

Languages

davemaier/pdf2md

Folders and files

Latest commit

History

Repository files navigation

pdf2md

Setup

Usage

Installing as a global command

Windows (not tested)

Linux/MacOS

Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages