Re: Can tsearch do some basic text mining - Mailing list pgsql-general

From Oleg Bartunov
Subject Re: Can tsearch do some basic text mining
Date
Msg-id Pine.LNX.4.64.0708242151450.2727@sn.sai.msu.ru
Whole thread Raw
In response to Can tsearch do some basic text mining  ("Phoenix Kiula" <phoenix.kiula@gmail.com>)
Responses Re: Can tsearch do some basic text mining  ("Phoenix Kiula" <phoenix.kiula@gmail.com>)
List pgsql-general
On Fri, 24 Aug 2007, Phoenix Kiula wrote:

> Hi,
>
> We have big blobs of text (average 10,000 characters) in a database,
> from which we would like to discover the most often repeated words or
> phrases. Can tsearch be used for this kind of pattern search? I
> suppose it's Text Mining 101 sort of stuff, nothing complex.

there is stat() function, see
http://www.sai.msu.su/~megera/wiki/Tsearch_V2_Notes
for more details.
It's not fast, so better to save results in a table

>
> TIA!
>
> ---------------------------(end of broadcast)---------------------------
> TIP 2: Don't 'kill -9' the postmaster
>

     Regards,
         Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

pgsql-general by date:

Previous
From: Tom Lane
Date:
Subject: Re: FATAL: could not reattach to shared memory (Win32)
Next
From: Cody Pisto
Date:
Subject: lc_collate issue