v0.2.5: Content Extraction Fixes & SEO Glossary
Improved content extraction accuracy, fixed edge cases in markdown conversion, and added an SEO glossary to the documentation.
What’s New in v0.2.5
Content Extraction Improvements
Fixed several edge cases in the content extraction pipeline where certain HTML structures would produce broken or incomplete markdown output. Pages with complex nested layouts, tables, and embedded media now convert more reliably.
SEO Glossary
The documentation now includes an SEO glossary covering all 18 check categories in the SEO audit - from missing titles and short descriptions to noindex pages and non-self canonicals. Each entry explains what the issue is, why it matters, and how to fix it.
Bug Fixes
- Fixed an issue where some discovered links were not being followed correctly
- Updated the llms.txt reference documentation
Related
Wrap-up
A CMS shouldn't slow you down. Crawler aims to expand into your workflow — whether you're coding content models, collaborating on product copy, or launching updates at 2am.
If that sounds like the kind of tooling you want to use — try Crawler .