python-html5lib
Python - html5lib - HTML parser
2.6.4
2016-07-26
eisfair-1
the eisfair team, team(at)eisfair(dot)org
stable
base 2.7.4
python 2.6.4
python-six 2.6.4
python-webencodings 2.6.4
Internal Program Version: html5lib 0.999999999
Build for Python 2.7
HTML parser designed to follow the HTML5
specification. The parser is designed to handle all flavours of HTML and
parses invalid documents using well-defined error handling rules compatible
with the behaviour of major desktop web browsers.
Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format
https://github.com/html5lib/html5lib-python