Re: Filtering dictionaries support and unaccent dictionary - Mailing list pgsql-hackers

From Bruce Momjian
Subject Re: Filtering dictionaries support and unaccent dictionary
Date
Msg-id 200908131914.n7DJETG02802@momjian.us
Whole thread Raw
In response to Re: Filtering dictionaries support and unaccent dictionary  (Peter Eisentraut <peter_e@gmx.net>)
Responses Re: Filtering dictionaries support and unaccent dictionary
List pgsql-hackers
Peter Eisentraut wrote:
> On Thursday 13 August 2009 18:07:51 Alvaro Herrera wrote:
> > Oleg Bartunov wrote:
> > > Peter,
> > >
> > > how to write accented characters in sgml ? Is't not allowed to write
> > > them as is ?
> >
> > á for ?, etc.  You can't use characters that aren't in Latin-1 I
> > think. Writing them literally is not allowed.
> 
> It's somehow possible, but it's not as straightforward as say with XML.  And 
> you might get into a Latin-1 vs UTF-8 mixup.  At least that's what I noticed 
> in my limited testing the other day.

The top of release.sgml has instructions on that because that is often
something we need to do for names in release notes:
       non-ASCII characters            convert to HTML4 entity (&) escapes
       official:      http://www.w3.org/TR/html4/sgml/entities.html       one page:
http://www.zipcon.net/~swhite/docs/computers/browsers/entities_page.html      other lists:
http://www.zipcon.net/~swhite/docs/computers/browsers/entities.html
http://www.zipcon.net/~swhite/docs/computers/browsers/entities_page.html
http://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references
       we cannot use UTF8 because SGML Docbook       does not support it
http://www.pemberley.com/janeinfo/latin1.html#latexta

--  Bruce Momjian  <bruce@momjian.us>        http://momjian.us EnterpriseDB
http://enterprisedb.com
 + If your life is a hard drive, Christ can be your backup. +


pgsql-hackers by date:

Previous
From: Andrew Dunstan
Date:
Subject: Re: Hot standby and synchronous replication status
Next
From: Bruce Momjian
Date:
Subject: Re: Alpha 1 release notes