Advanced-PDF-Document-Utility 🗂️

A fully offline, browser-based PDF & document utility. No uploads. No server. No data leaves your machine.

🔗 Live Demo: https://rajathchannasetty-sys.github.io/Advanced-PDF-Document-Utility/

✨ Features

Feature	Description
✂️ Extract PDF Pages	Pick specific pages or ranges (e.g. `1, 3-5, 8`) from any PDF
🔗 Merge PDFs	Combine multiple PDFs into one file in your chosen order
📄 PDF → DOC	Extract text content from a PDF into a Word-compatible document
📝 DOCX → PDF	Convert a DOCX file's text into a clean, paginated PDF
🖼️ Image → PDF	Combine PNG/JPG images into a single PDF document
📋 Extract DOCX Pages	Pull specific pages from a DOCX and export as PDF

🔒 Privacy & Security

100% local processing — your files never leave your browser
No server, no backend, no database
No file uploads to any external service
MIME type validation on every file
Magic byte (%PDF) verification before processing
Filename sanitization (strips XSS, path traversal, special chars)
Page range input hardened against injection and DoS attacks
Object URLs revoked after download (no memory leaks)
Buttons locked during operations (no race conditions)

🚀 How to Use

There are two ways to use this tool — pick whichever suits you:

▶️ Option A — Use it online (nothing to install)

Just click the link and it opens in your browser:

👉 https://rajathchannasetty-sys.github.io/Advanced-PDF-Document-Utility/

Works in Chrome, Edge, Firefox, and Safari. No login, no sign-up.

💾 Option B — Download and open the file directly

You can download a single .html file and use it forever — even without internet.

Step 1 — Download the file

Click here → index.html → then click the ⬇ Download raw file button (top right of the file view on GitHub)

Step 2 — Open it in your browser

Your OS	How to open
Windows	Right-click `index.html` → Open with → Chrome or Edge
Mac	Double-click `index.html` — it opens in your default browser
Linux	Right-click → Open with → choose your browser

Or drag the file directly into an open browser window.

✅ That's it. No install, no setup, no internet required after the first open. The file works completely offline — your documents never leave your computer.

📋 Using the tool (same for both options)

Drop your file — drag and drop onto the upload area, or click Browse
Pick what to do — choose from the 6 operation cards
Fill in any options — page numbers, output filename (optional)
Click the action button — the result downloads automatically
Done or start again — use the ↺ Clear or ⟳ Reload buttons to reset

Supported file types: PDF · DOCX · PNG · JPG | Max size: 50 MB per file

🛠️ Tech Stack

Library	Version	Purpose
pdf-lib	1.17.1	PDF creation, manipulation, merging, page extraction
mammoth.js	1.6.0	DOCX text extraction

No build tools. No frameworks. No npm. Pure HTML + CSS + JavaScript.

📁 Project Structure

Advanced-PDF-Document-Utility/
├── index.html          # The entire app (single file)
├── README.md           # This file
├── LICENSE             # MIT License
├── SECURITY.md         # Security policy
├── CHANGELOG.md        # Version history
└── .github/
    └── ISSUE_TEMPLATE/
        ├── bug_report.md
        └── feature_request.md

🖥️ Run Locally

No installation required. Just open index.html in your browser:

# Clone the repo
git clone https://github.com/rajathchannasetty-sys/Advanced-PDF-Document-Utility.git

# Open in browser (macOS)
open Advanced-PDF-Document-Utility/index.html

# Open in browser (Windows)
start Advanced-PDF-Document-Utility/index.html

# Open in browser (Linux)
xdg-open Advanced-PDF-Document-Utility/index.html

Or serve it locally with Python (for development):

cd pdf-toolkit
python3 -m http.server 8080
# Then open http://localhost:8080

🤝 Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch: git checkout -b feature/my-feature
Commit your changes: git commit -m "Add: my feature"
Push to the branch: git push origin feature/my-feature
Open a Pull Request

Please read SECURITY.md before reporting vulnerabilities.

📋 Known Limitations

PDF → DOC text extraction is best-effort. Scanned PDFs (image-based) may return no text.
DOCX → PDF uses plain text rendering — formatting, fonts, and images are not preserved.
Page extraction from DOCX splits by ~3000 characters per page, not by document-defined page breaks.
Files above 50 MB are rejected to protect browser performance.

📄 License

👤 Author

Rajath Channasetty

LinkedIn: @rajath-channasetty
GitHub: @rajathchannasetty-sys

Made with ❤️ — 100% private, 100% free

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advanced-PDF-Document-Utility 🗂️

✨ Features

🔒 Privacy & Security

🚀 How to Use

▶️ Option A — Use it online (nothing to install)

💾 Option B — Download and open the file directly

📋 Using the tool (same for both options)

🛠️ Tech Stack

📁 Project Structure

🖥️ Run Locally

🤝 Contributing

📋 Known Limitations

📄 License

👤 Author

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
bug_report.md		bug_report.md
deploy.yml		deploy.yml
feature_request.md		feature_request.md
index.html		index.html

Folders and files

Latest commit

History

Repository files navigation

Advanced-PDF-Document-Utility 🗂️

✨ Features

🔒 Privacy & Security

🚀 How to Use

▶️ Option A — Use it online (nothing to install)

💾 Option B — Download and open the file directly

📋 Using the tool (same for both options)

🛠️ Tech Stack

📁 Project Structure

🖥️ Run Locally

🤝 Contributing

📋 Known Limitations

📄 License

👤 Author

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages