[Zope] Indexing dtml/html files

Geir B Hansen Geir B Hansen <geirh@funcom.com>
Thu, 9 Mar 2000 13:02:20 +0100


on Thursday, March 09, 2000 Anders Holmbech Nielsen wrote :
AHN> Hi,

AHN> Is it possible to use catalog to index ordinary html pages intelligent ?

AHN> If I have a html(dtml) file like:

AHN> <HTML>
AHN> <p id=title>
AHN> Are you Intelligent ?
AHN> </p>
AHN> <p id=author>
AHN> A. Einstein
AHN> </p>
AHN> </HTML>

AHN> I would like to index the title and the author id's, for which I could search.

AHN> Is it possible ??

I guess you could do it in XML-documents, that are
inherently more machine-readable than html..
Otherwise, storing title and author in properties would
do the trick for zcatalog..Not exactly what you ask for, but
as close as i can get with my skills..
I guess you could write a pythonmethod to extract title
and author from the html and store them in properties so
that they could be easily indexed..

--
Geir B Hansen
web-developer/designer
geirh@funcom.com
http://www.funcom.com