Re: directional marks - Mailing list pgsql-hackers

From nngodinh@tiscali.it
Subject Re: directional marks
Date
Msg-id 3D6DC7120001AA51@mail-4.tiscalinet.it
Whole thread Raw
In response to directional marks  (nngodinh@tiscali.it)
Responses Re: directional marks  (Peter Eisentraut <peter_e@gmx.net>)
List pgsql-hackers
I'm speaking about directional marks that are ignored by - for instance
- by Microsoft SQL 7.0 because they're unuseful in that position (like when
they're in a one way text either left-to-right or right-to-left). It may
happen that this kind of symbols are randomly inserted: for example...

The entry user types an english text like "test". At the end he switches
the keyboard layout to arabic and types something arabic but he realizes
he don't want to do that and erases the arabic text, switches again the
keyboard and inserts english text after "test". Some directional marks are
inserted but they're unuseful.

The problem is that sometimes the directional mark is inside a word, not
just at the ending, and after all if you try to index using txt2txtidx,
directional marks are not recognized as delimiters (and they aren't) so
the txtidx array will contain the near word with an appended directional
mark.

May be you can say that the source I've exported the db from is a malformed
one, and you are absolutely right. Anyway I know that some programs (expecially
Microsoft) does this mistake. I'm not speaking of PHP.

Bye.

>-- Messaggio Originale --
>Date: Mon, 16 Sep 2002 19:25:30 +0200 (CEST)
>From: Peter Eisentraut <peter_e@gmx.net>
>To: nngodinh@tiscali.it
>cc: pgsql-hackers@postgresql.org
>Subject: Re: [HACKERS] directional marks
>
>
>nngodinh@tiscali.it writes:
>
>> The only way to fix the problem is to remove any directional mark occurrence,
>> or to make PostgreSQL ignore that kind of characters during UNICODE queries.
>>
>> What do you think about it?
>
>Either remove the directional marks or consistently use them in all your
>queries (or use wildcards to paint over the difference).  The directional
>mark characters aren't just for amusement -- they contain real information
>so they cannot be ignored.
>
>--
>Peter Eisentraut   peter_e@gmx.net
>
>



__________________________________________________________________
Tiscali Ricaricasa
la prima prepagata per navigare in Internet a meno di un'urbana e
risparmiare su tutte le tue telefonate. Acquistala on line e non avrai
nessun costo di attivazione né di ricarica!
http://ricaricasaonline.tiscali.it/





pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: 7.3 Beta Schema and pg_dump
Next
From: Tom Lane
Date:
Subject: Re: Proposal for resolving casting issues