[Zope] Zcatalog searches

Andreas Jung lists at andreas-jung.com
Thu Jan 20 12:48:57 EST 2005



--On Donnerstag, 20. Januar 2005 12:24 Uhr -0500 Jonathan Hobbs 
<toolkit at magma.ca> wrote:

>
> ----- Original Message -----
> From: "Ken Ara" <feedreader at yahoo.com>
>> Some code you can use as a starting point to produce
>> KWIC (key word in context) can be found here:
>> http://zope.org/Members/Ioan/SiteSearch. It's old but
>> works for me under Zope 2.7.3.
>>
>> This implementation requires the searchable text to be
>> in Catalog metadata, which seems to be a bad thing,
>> but I have never really understood just how bad..
>
> We have experimented with storing compressed and uncompressed metadata (up
> to 20k bytes per zcatalog record).  This worked fine for us (in terms of
> retrieval speed and zodb size) until we hit about 500k records (we used
> our own KWIC scripts not SiteSearch).  At that time retrieval time
> started to increase to unacceptable times (over 2 seconds per search) and
> the zodb started to become unwieldly (5gb+).  At that time (slow
> retrieval speed) we had about 5 million objects in the zodb (after
> packing).
>
> I would suggest that storing lots of metadata is workable if:
>
> 1) you don't have a lot of records
> 2) you don't store too much metadata/record
> 3) you have lots (1gb+) of RAM
> 4) you have fast disks
> 5) you have a fast cpu
>
> We currently have a zcatalog with a single ZCTextindex which holds about 1
> million records (zodb size is under 3gb).  Our retrieval speed, include
> KWIC processing, is under 1.5 seconds per search.  We have very little
> metadata (less than 100 bytes per record), and access the final result
> set objects to get the data we need for KWIC processing and result set
> display.

Dieters AdvancedQuery and ManagableIndexes products might help you all
to optimize your catalogs.

-aj


More information about the Zope mailing list