4 releases

0.0.8 Dec 25, 2023
0.0.7 Dec 16, 2023
0.0.6 Nov 26, 2023
0.0.5 Nov 26, 2023

#71 in #compatible


Used in rs_html_parser

MIT license

4KB
50 lines

Very compatible HTML tokenizer and parser.

Differences to HTML specification.

  • Errors are not thrown for invalid input
  • Just like web browsers, this parser automatically handles missing closing tags
  • <? and <! always opens a new processing instruction
  • Orphan end tags are ignored ( This might still change )

Notes:

  • According to the spec attribute handling is case-insensitive

Dependencies

~1MB
~18K SLoC