html-conduit

Parse HTML documents using xml-conduit datatypes.

https://github.com/snoyberg/xml

Version on this page:1.2.1.1@rev:1
LTS Haskell 22.39:1.3.2.2
Stackage Nightly 2024-10-31:1.3.2.2
Latest on Hackage:1.3.2.2

See all snapshots html-conduit appears in

MIT licensed by Michael Snoyman
Maintained by [email protected]
This version can be pinned in stack with:html-conduit-1.2.1.1@sha256:de32ca4d6df94a7c027a11db1b2e32ef1a7ccfe0565923f24528613ade821343,1970

Module documentation for 1.2.1.1

This package uses tagstream-conduit for its parser. It automatically balances mismatched tags, so that there shouldn’t be any parse failures. It does not handle a full HTML document rendering, such as adding missing html and head tags.

Changes

1.2.1.1

  • Allow xml-conduit 1.4

1.2.1

  • Add strict and lazy text parsing #66

1.2.0

  • Drop system-filepath

1.1.1.2

  • Fix a bug with double-unescaping of entities