Re: integration of fulltext search in bytea/docs - Mailing list pgsql-general

From Pavel Stehule
Subject Re: integration of fulltext search in bytea/docs
Date
Msg-id 162867790907290838q5cf5a640x241fb14a99d269f2@mail.gmail.com
Whole thread Raw
In response to integration of fulltext search in bytea/docs  (Radek Novotný <radek.novotny@mediawork.cz>)
Responses Re: integration of fulltext search in bytea/docs
List pgsql-general
Hello

Dne 29. červenec 2009 16:46 Radek Novotný <radek.novotny@mediawork.cz>
napsal(a):
> Hello,
> is there in the roadmap of postgre integration of fulltext searching in
> documents saved in blobs (bytea)?
>

What I know, no. PostgreSQL doesn't know about others binary formats,
so it cannot do it.

> For example linux antiword can export fine text output that can be inserted
> into varchar field.

I understand it well. Three years ago, we used antiword. We stored two
values - bytea - original word doc, and text - antiword output. The
overhead is minimal, and this solution worked very well.

>
> Would be very very nice (postgre users can be proud to be first) to save
> documents into bytea and search that field via to_tsvector, to_tsquery ...
>

It's should be very slow. You have to do repeated transformation.

Pavel Stehule

> --
>
> www.publicstream.cz - vytvořili jsme produkt pro živé přenosy (online
> streaming) a videoarchív ve formátu FLASH.
>
> Bc. Radek Novotný
> jednatel Mediawork group s.r.o.
>
> tel.: +420 724 020 361
> email: radek.novotny@mediawork.cz
> http://www.mediawork.cz
>
>

pgsql-general by date:

Previous
From: Tom Lane
Date:
Subject: Re: OID in $_TD->{new}/$_TD->{old}
Next
From: Jennifer Trey
Date:
Subject: How do I run PG Tuning Wizard on Linux?