Age | Commit message (Collapse) | Author | Files | Lines | |
---|---|---|---|---|---|
2009-10-19 | Update to html5lib-0.11.1. No detailed changes. | joerg | 2 | -6/+6 | |
2009-06-14 | Remove @dirrm entries from PLISTs | joerg | 1 | -7/+1 | |
2009-01-27 | Import py-html5lib-0.11: | joerg | 4 | -0/+159 | |
html5lib is a pure-python library for parsing HTML. The parser is designed to handle all flavours of HTML and parses invalid documents using well-defined error handling rules compatible with the behaviour of major desktop web browsers. Output is to a tree structure; the current release supports output to DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a simple custom format. |