[Zope] Strip all HTML

Chris Withers chrisw@nipltd.com
Wed Aug 6 12:30:06 EDT 2003


ken@practical.org wrote:

> However this converter, like the others I have tried (Strip-o-Gram, as well as an external method based on striphtml.py), seem unable to remove the content of <style></style> or <script></script> tags. So I get plenty of hits with a search for 'children' or 'window' or 'background'...

I beg to differ:

Python 2.2.2 (#37, Oct 14 2002, 17:02:34) [MSC 32 bit (Intel)] on win32
Type "help", "copyright", "credits" or "license" for more information.
 >>> from stripogram import html2text
 >>> html = "seem unable to remove the content of <style>stuff</style> or <script
 >more stuff</script>"
 >>> html2text(html)
'seem unable to remove the content of stuff or more stuff'
 >>>

How are you using stripogram?

cheers,

Chris





More information about the Zope mailing list