#html-parser #error #browser #tokenizer #web #compatible #input

rs_html_parser_tokenizer

Rs Html Parser Tokenizer

4 releases

0.0.10 Jan 9, 2024
0.0.9 Jan 9, 2024
0.0.6 Nov 26, 2023
0.0.5 Nov 26, 2023

#57 in #compatible


Used in rs_html_parser

MIT license

40KB
936 lines

Very compatible HTML tokenizer and parser.

Differences to HTML specification.

  • Errors are not thrown for invalid input
  • Just like web browsers, this parser automatically handles missing closing tags
  • <? and <! always opens a new processing instruction
  • Orphan end tags are ignored ( This might still change )

Notes:

  • According to the spec attribute handling is case-insensitive

Dependencies

~240–485KB