Need help with full text index configuration - Mailing list pgsql-general

From Brian Hirt
Subject Need help with full text index configuration
Date
Msg-id 1897DC4E-FBDE-437C-AF21-37DFDF07F2D7@mobygames.com
Whole thread Raw
Responses Re: Need help with full text index configuration
Re: Need help with full text index configuration
List pgsql-general
I have some data that can be searched, and it looks like the parser is making some assumptions about the data that
aren'ttrue in our case and I'm trying to figure out how to exclude a token type.   I haven't been able to find the
answerto my question so far, so I thought I would ask here. 

The data I have are english words, and sometimes there are words separated by a / without spaces.   The parser finds
thesethings and tokenizes them as files.   I'm sure in some situations that's the right assumption, but based on my
data,I know there will never be a file name in the column.    

For example instead of the parser recognizing three asciiword it recognizes one asciiword and one file.   I'd like a
wayto have the / just get parsed as blank.  

db=# select * from ts_debug('english','maybe five/six');
   alias   |    description    |  token   |  dictionaries  |  dictionary  |  lexemes
-----------+-------------------+----------+----------------+--------------+------------
 asciiword | Word, all ASCII   | maybe    | {english_stem} | english_stem | {mayb}
 blank     | Space symbols     |          | {}             |              |
 file      | File or path name | five/six | {simple}       | simple       | {five/six}
(3 rows)

I thought that maybe I could create a new configuration and drop the file mapping, but that doesn't seem to work
either.

db=# CREATE TEXT SEARCH CONFIGURATION public.testd ( COPY = pg_catalog.english );
CREATE TEXT SEARCH CONFIGURATION
db=# ALTER TEXT SEARCH CONFIGURATION testd DROP MAPPING FOR file;
ALTER TEXT SEARCH CONFIGURATION
db=# SELECT * FROM ts_debug('testd','mabye five/six');
   alias   |    description    |  token   |  dictionaries  |  dictionary  | lexemes
-----------+-------------------+----------+----------------+--------------+---------
 asciiword | Word, all ASCII   | mabye    | {english_stem} | english_stem | {maby}
 blank     | Space symbols     |          | {}             |              |
 file      | File or path name | five/six | {}             |              |
(3 rows)


Is there anyway to do this?

Thanks for the help in advance.  I'm running 8.4.4

pgsql-general by date:

Previous
From: "Joshua D. Drake"
Date:
Subject: Re: Which CMS/Ecommerce/Shopping cart ?
Next
From: Peter Bex
Date:
Subject: Re: How Big is Too Big for Tables?