Re: tsearch2 and pdf files - Mailing list pgsql-general

From Magnus Hagander
Subject Re: tsearch2 and pdf files
Date
Msg-id 6BCB9D8A16AC4241919521715F4D8BCEA0FDF8@algol.sollentuna.se
Whole thread Raw
In response to Re: tsearch2 and pdf files  (Henrik Zagerholm <henke@mac.se>)
Responses Re: tsearch2 and pdf files  ("philip johnson" <philip.johnson@atempo.com>)
List pgsql-general
> 1. Convert PDF to file with e.g xpdf
> 2. Insert parsed text to a table of your choice.
> 3. Make vectors from the text.

Actually, if you're not going to use the headline() function, you cna
just store it directly in a vector, cutting down on the size
requirements. Just insert to the to_tsvector() result. The full text is
required for headline() though, so you can't cheat on that.

//Magnus

pgsql-general by date:

Previous
From: Henrik Zagerholm
Date:
Subject: Re: tsearch2 and pdf files
Next
From: "Jonathan Ellis"
Date:
Subject: forcing compression of text field