Convert Word & PDF documents to LLM-ready Markdown - 100% Private, No Data Upload
A privacy-first web application that converts Word documents (.docx) and PDFs to clean Markdown format entirely in your browser. Perfect for preparing documents for local AI/LLMs like Ollama, LM Studio, or GPT4All - without ever uploading your sensitive data to the cloud.
Why Privacy? • Features • Quick Start • Usage • Contributing • License
Launch Doc to Markdown - Use it right now in your browser!
Even the online version processes everything locally - your documents never leave your device.
- Click the download link above
- Extract the ZIP file
- Double-click
word-to-markdown.htmlto open it in your browser - Works completely offline - no internet required!
When working with AI and LLMs, you often need to convert documents containing sensitive information - business reports, legal documents, personal notes, medical records, or confidential research. Most online converters upload your files to their servers, creating privacy and security risks.
Doc to Markdown is different:
- Zero data transmission - Your documents never leave your device
- No server, no cloud - Everything runs locally in your browser
- Works completely offline - No internet connection required after loading
- No accounts or tracking - No sign-ups, no analytics, no cookies
- Open source - Audit the code yourself to verify privacy claims
This makes it ideal for:
- Converting confidential business documents for AI analysis
- Preparing sensitive legal or medical documents for local LLMs
- Processing proprietary research papers offline
- Anyone who values data privacy when using AI tools
- 100% Private & Offline - All processing happens locally in your browser, no data ever uploaded
- Word & PDF Support - Convert both
.docxWord documents and PDF files - No Server Required - Works as a standalone HTML file you can double-click
- Drag & Drop Interface - Simply drag your document or click to browse
- Copy to Clipboard - Instantly copy the converted markdown with one click
- Markdown Preview - Expandable preview area to review your converted content
- Three Image Handling Modes:
- Separate Images (Recommended) - Clean markdown with images in a folder
- Base64 Embedded - Single file with embedded images
- Text Only - Lightweight output without images
- ZIP Bundle Output - Download everything in a ready-to-use package
- Dark Glassmorphic UI - Modern, beautiful interface
- Optimized for Local LLMs - Clean markdown output perfect for Ollama, LM Studio, GPT4All, and other local AI tools
If you want to modify the code or run the development server:
- Node.js (v18 or higher)
- npm (comes with Node.js)
# Clone the repository
git clone https://github.com/yourusername/word-to-markdown.git
cd word-to-markdown
# Install dependencies
npm install
# Start the development server
npm run devThe app will be available at http://localhost:5173
# Build for production (outputs to dist/)
npm run build
# Build standalone HTML file (outputs to standalone/)
npm run build:standalone
# Preview the production build
npm run preview
# Run linter
npm run lint- Open the app - Double-click
standalone/word-to-markdown.html(or runnpm run devfor development) - Upload your document - Drag and drop a
.docxor.pdffile, or click to browse - Choose image handling - Select how you want images to be processed
- Convert - Click "Convert to Markdown"
- Preview & Copy - Expand the preview to review, or click "Copy" to copy to clipboard
- Download - Get your ZIP bundle with markdown and images
- Convert your document and click "Copy" to copy the markdown
- Paste directly into Ollama's chat interface
- Or extract the downloaded ZIP and open the markdown file
- Your document is ready for AI analysis!
| Mode | Description | Best For |
|---|---|---|
| Separate Images | Images saved in a folder, markdown references them | Most LLMs, clean output |
| Base64 Embedded | Images encoded directly in markdown | Single-file convenience |
| Text Only | Images replaced with placeholders | Text-only LLMs, smallest size |
- React 19 - UI framework
- TypeScript - Type safety
- Vite - Build tool
- Tailwind CSS - Styling
- Mammoth.js - Word document parsing
- PDF.js - PDF document parsing
- JSZip - ZIP file generation
- Turndown - HTML to Markdown conversion
word-to-markdown/
├── standalone/ # Standalone build (double-click to run)
│ └── word-to-markdown.html
├── src/
│ ├── components/ # React components
│ │ ├── Logo.tsx
│ │ ├── DropZone.tsx
│ │ ├── ImageOptions.tsx
│ │ ├── ConvertButton.tsx
│ │ ├── ProgressIndicator.tsx
│ │ └── DownloadSection.tsx
│ ├── utils/ # Utility functions
│ │ ├── converter.ts # Word to Markdown conversion
│ │ ├── pdfConverter.ts # PDF to Markdown conversion
│ │ └── zipGenerator.ts
│ ├── types/ # TypeScript types
│ │ └── index.ts
│ ├── App.tsx # Main application
│ ├── main.tsx # Entry point
│ └── index.css # Global styles
├── public/ # Static assets
├── package.json
├── vite.config.ts
├── tailwind.config.js
└── tsconfig.json
- Chrome/Edge (latest)
- Firefox (latest)
- Safari (latest)
Requires a modern browser with File API support.
- Word: Only
.docxfiles are supported (not.doc) - PDF: Text extraction works best on text-based PDFs (scanned documents may have limited results)
- Maximum file size: 200MB
- Complex formatting may not convert perfectly
- Track changes and comments are not preserved
Contributions are welcome! Please see our Contributing Guide for details.
See the Development Setup section above for instructions on setting up your local environment.
This project is licensed under the MIT License - see the LICENSE file for details.
- Adappt AI for sponsoring and building this tool
- Mammoth.js for excellent Word document parsing
- Mozilla PDF.js for robust PDF parsing
- Ollama for making local LLMs accessible
- All our contributors
If you're the repository owner, add these topics to improve discoverability:
privacy offline local-llm ollama document-converter markdown word-to-markdown pdf-to-markdown docx pdf ai-privacy client-side no-upload gpt4all lm-studio private-ai data-privacy
This tool is useful for people searching for:
- Private Word to Markdown converter
- Private PDF to Markdown converter
- Offline document converter for AI
- Convert Word to Markdown without uploading
- Convert PDF to Markdown without uploading
- Local LLM document preparation
- Privacy-focused document converter
- Ollama document converter
- Client-side docx to markdown
- Client-side PDF to markdown
- No-upload file converter
- Secure document conversion for AI
- GDPR compliant document converter
- Air-gapped document processing
Made with care for the privacy-conscious open source community
This project sponsored by Adappt AI