Extract main content and convert to Markdown.
Mark Harvest is a simple yet powerful Chrome extension that extracts the meat of any web page using Mozilla's Readability library and converts it into clean Markdown using Turndown.
- Smart Extraction: Uses
@mozilla/readability(the engine behind Firefox's Reader View) to identify and extract the main content while stripping away ads and navigation. - Clean Markdown: Converts HTML to Markdown using
turndown. - Easy Copy: One-click to copy the extracted Markdown to your clipboard.
- Developer Friendly: Built with modern web technologies: Vite, TypeScript, and Vitest.
(See "For Developers" section below.)
- Clone this repository.
- Install dependencies:
pnpm install
- Build the extension:
pnpm build
- Load the extension in Chrome:
- Open Chrome and navigate to
chrome://extensions/. - Enable Developer mode (toggle in the top right).
- Click Load unpacked.
- Select the
distdirectory in this project folder.
- Open Chrome and navigate to
- Navigate to a web page you want to capture.
- Click the Mark Harvest icon in your browser toolbar.
- Click the Harvest Content button.
- The content is now in your clipboard!
pnpm dev: Start Vite dev server.pnpm build: Build production artifacts (outputs todist/).pnpm test: Run the test suite with Vitest.
MIT