Re: Replacement for Oracle Text - Mailing list pgsql-general

From Stephen Davies
Subject Re: Replacement for Oracle Text
Date
Msg-id 56C7AF03.30701@sdc.com.au
Whole thread Raw
In response to Re: Replacement for Oracle Text  (Bruce Momjian <bruce@momjian.us>)
Responses Re: Replacement for Oracle Text  (Chris Travers <chris.travers@gmail.com>)
List pgsql-general
On 20/02/16 00:24, Bruce Momjian wrote:
> On Fri, Feb 19, 2016 at 02:49:16PM +0100, s d wrote:
>> On 19 February 2016 at 14:19, Bruce Momjian <bruce@momjian.us> wrote:
>>      >     Ah, no. That's not possible
>>      >
>>      >
>>      > ...not possible, Yet.
>>      >
>>      > PostgreSQL grows by adding the features people need and its changing
>>      rapidly.
>>
>>      I wonder if PLPerl could be used to extract the words from a PDF
>>      document and create a tsvector column from it.
>>
>>   I don't know about PLPerl(I'm pretty sure it could be used for this purpose,
>> though.).  On the other hand I've written code for this in Python which should
>> be easy to adapt for PLPython, if necessary.
>
> Right, so you would write a PL/Perl or PL/Python trigger function that
> would populate the tsvector column on every INSERT or UPDATE.
>
FWIW, I just use pdftotext in my CGI.

--
=============================================================================
Stephen Davies Consulting P/L                             Phone: 08-8177 1595
Adelaide, South Australia.                                Mobile:040 304 0583


pgsql-general by date:

Previous
From: Oleg Bartunov
Date:
Subject: Re: Replacement for Oracle Text
Next
From: Chris Travers
Date:
Subject: Re: Replacement for Oracle Text