summaryrefslogtreecommitdiff
path: root/textproc/hs-tagsoup/DESCR
blob: e1ab2427747397226edc8b0cdebbf8ca7e95cfbd (plain)
1
2
3
4
5
6
7
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making it
ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.