Re: What is the simpliest text search configuration? - Mailing list pgsql-general

From Oleg Bartunov
Subject Re: What is the simpliest text search configuration?
Date
Msg-id Pine.LNX.4.64.0911121923190.6801@sn.sai.msu.ru
Whole thread Raw
In response to Re: What is the simpliest text search configuration?  (Jérôme Etévé <jerome.eteve@gmail.com>)
List pgsql-general
We submitted unaccent dictionary for 8.5
See http://www.sai.msu.su/~megera/wiki/unaccent for some information

Oleg
On Thu, 12 Nov 2009, Jrme Etv wrote:

> Hi Michael,
>
> I actually found that the 'simple' dictionary doesn't enforce a
> stopword list by default. so i defined my search conf like this and it
> works:
>
> create text search configuration sbsimple  ( parser = 'default' ) ;
> alter text search configuration sbsimple ALTER MAPPING FOR
> word,hword,asciiword,asciihword WITH simple
>
> Cheers!
>
> J.
>
> 2009/11/12 Michael Nacos <m.nacos@gmail.com>:
>> Dear Jerome,
>>
>> from personal experience full-text searching in PostgreSQL can be quite
>> powerful
>> but it's not simple, it requires thought, planning and coding. PostgreSQL
>> mainly
>> provides an efficient token matching mechanism supporting positional
>> information
>> and weights, but natural language processing and normalization is pretty
>> basic.
>>
>> If you don't mind writing a couple of user-defined functions to take control
>> of lexeme
>> normalization, then tsvector/tsquery support can be a very powerful tool for
>> custom
>> search engines.
>>
>> regards,
>>
>> Michael
>>
>> 2009/11/12 JЪЪrЪЪme EtЪЪvЪЪ <jerome.eteve@gmail.com>
>>>
>>> Hi all,
>>>
>>>  I'd like to implement a full text search with postgresql, and I can't
>>> find
>>> a text search configuration that would just:
>>>
>>> map unicode accentuated letters to an un-accentuated equivalent
>>> tokenize the words (and skip any non word characters)
>>> no stopwords
>>> lower case the tokens
>>>
>>> How can I achieve this? I'm particularly interested in deactivating
>>> the stopwords filtering.
>>>
>>> I tried pg_catalog.simple, but despite its name, it still considers stop
>>> words.
>>>
>>> Thanks for your help!
>>>
>>> Jerome.
>>>
>>
>>
>
>
>
>

     Regards,
         Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

pgsql-general by date:

Previous
From: Andreas Kretschmer
Date:
Subject: Re: re-using RETURNING
Next
From: Alvaro Herrera
Date:
Subject: Re: [pgeu-general] pgday.eu