lib/bs4/NEWS.txt

125     XML parser or HTML parser, and prints out the parser events as
132 * In an HTML document, the contents of a <script> or <style> tag will
286 * Got rid of the 4.0.2 workaround for HTML documents--it was
371   Python's built-in HTML parser.
421   This actually affects all attributes that the HTML standard defines
433 * Unicode, Dammit now detects the encoding in HTML 5-style <meta> tags
459   created to parse HTML.
462   improving html.parser's ability to handle bad HTML.
484 Soup's custom HTML parser in favor of a system that lets you write a
485 little glue code and plug in any HTML or XML parser you want.
490  * lxml's HTML and XML parsers
491  * html5lib's HTML parser
517 improving the HTML parser that comes with Python 3.2.
521 Currently, the lxml and html5lib HTML parsers ignore CDATA sections in
551 builders set it to False. If you want to parse XHTML with an HTML
576 which is gone in Python 3. There's some bad HTML that SGMLParser
585 HTML.
667 Information inside a TEXTAREA tag is now parsed literally, not as HTML
684 Beautiful Soup is now usable as a way to clean up invalid XML/HTML (be
685 sure to pass in an appropriate value for convertEntities, or XML/HTML
686 entities might stick around that aren't valid in HTML/XML). The result
708 When run from the command line, Beautiful Soup now acts as an HTML
758 BeautifulSoup's HTML-specific rules, but no tag nesting rules. [Doc
805 XML or HTML entities into the corresponding Unicode characters. [Doc
819 HTML tag names showed up within a SCRIPT tag.
891 trying to build a parse tree out of ambiguous HTML tags. This version
909 characters with their HTML entity equivalents.
911 You can now get a pretty-print version of parsed HTML to get a visual
951 well-structured HTML, this makes your code a lot cleaner.
977 level of the parse tree. For instance, if you have HTML like this: