Thursday, May 22, 2025
Show HN: Defuddle, an HTML-to-Markdown alternative to Readability https://ift.tt/PR5S6If
Show HN: Defuddle, an HTML-to-Markdown alternative to Readability Defuddle is an open-source library I built to parse and extract the main content and metadata from web pages. It can also return the content as Markdown. I built Defuddle while working on Obsidian Web Clipper[1] (also MIT-licensed) because Mozilla's Readability appears to be mostly abandoned, and didn't work well for many sites. It's still very much a work in progress, but I thought I'd share it today, in light of the announcement that Mozilla is shutting down Pocket. This library could be helpful to anyone building a read-it-later app. Defuddle is also available as a CLI: https://ift.tt/VMb1pfl [1] https://ift.tt/oT7xD5U https://ift.tt/0TgUebs May 23, 2025 at 01:40AM
Labels:
Hacker News
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment