Comparison

crawler.sh vs MarkItDown

Full-site Markdown extraction with metadata vs a single-file converter from Microsoft.

MarkItDown is a Python utility from Microsoft that converts a single file or URL to Markdown. It is great for one-off conversions. crawler.sh is built for scale: crawl an entire site, render JavaScript, and produce a Markdown archive where every file includes YAML frontmatter with its URL, title, word count, and language.

Side by side

Honest comparison on the axes that decide which tool fits your workflow.

Primary use case
Convert one file or URL to Markdown.
Crawl an entire site and export every page as Markdown.
Installation
Python package install.
One command installs a single static binary.
Batch processing
Manual or scripted per item.
One command crawls the whole site and exports a ZIP archive.
JavaScript rendering
No. Static HTML only.
Built-in custom engine. Auto-detected per site.
YAML frontmatter
Not included by default.
Every file includes url, title, captured_at, word_count, and language.
Output format
Raw Markdown string.
Structured Markdown archive ready for RAG or fine-tuning pipelines.
Link discovery
None. You provide the input.
Built-in BFS link extraction with depth and page limits.
robots.txt support
None.
Honored by default. Per-host adaptive backoff.
Desktop app
No.
Yes.

Feature notes reflect publicly listed information at the time of writing.

When to pick which

Both tools solve real problems. Pick based on where the work actually runs and what you are billing against.

Pick MarkItDown when

You need a quick one-off conversion of a single file or URL, you are already in a Python environment, and you do not need JavaScript rendering or site-wide extraction.

Pick crawler.sh when

You need the whole site, not one page. You want a structured Markdown archive with metadata, JavaScript rendering, and link discovery built in. You want a desktop app and a CLI that work the same way.

Try the local-first path

Install in one command. Crawl any site into clean Markdown in seconds. Free up to 1,000 pages, $99 a year for 10,000.

Crawler.sh - Free Local AEO & SEO Spider and a Markdown content extractor | Product Hunt