Thread: print formated special characteres
Hi all,
Somebody help me, please.
How to make the texts are aligned with 10 characters?
elog(INFO, "\n%-10s Fim\n%-10s Fim\n", "Variável", "Variavel");
Output:
INFO:
Variável Fim
Variavel Fim
thank you
Att,
Attachment
El día sábado, octubre 17, 2020 a las 03:37:46p. m. -0300, Celso Lorenzetti escribió: > Somebody help me, please. > > How to make the texts are aligned with 10 characters? > > > > elog(INFO, "\n%-10s Fim\n%-10s Fim\n", "Variável", "Variavel"); Hola Celso, You can reproduce the same on the UNIX shell with: $ printf "\n%-10s Fim\n%-10s Fim\n" "Variável" "Variavel" Variável Fim Variavel Fim $ printf "\n%-10s Fim\n%-10s Fim\n" "VariXvel" "Variavel" VariXvel Fim Variavel Fim The second test (changing the accented char 'á' by 'X'), shows that the problem/bug is a) more generic, not only in PostgreSQL and b) has todo with being the UTF-8 char 'á' a two byte char, while 'X' is only one byte. I have no solution, though at the moment... Obrigado matthias -- Matthias Apitz, ✉ guru@unixarea.de, http://www.unixarea.de/ +49-176-38902045 Public GnuPG key: http://www.unixarea.de/key.pub Без книги нет знания, без знания нет коммунизма (Влaдимир Ильич Ленин) Without books no knowledge - without knowledge no communism (Vladimir Ilyich Lenin) Sin libros no hay saber - sin saber no hay comunismo. (Vladimir Ilich Lenin)
On 2020-10-17 20:51:36 +0200, Matthias Apitz wrote: > El día sábado, octubre 17, 2020 a las 03:37:46p. m. -0300, Celso Lorenzetti escribió: > > > Somebody help me, please. > > > > How to make the texts are aligned with 10 characters? > > > > > > > > elog(INFO, "\n%-10s Fim\n%-10s Fim\n", "Variável", "Variavel"); Which programming language is this? PL/pgSQL? > Hola Celso, > > You can reproduce the same on the UNIX shell with: > > $ printf "\n%-10s Fim\n%-10s Fim\n" "Variável" "Variavel" > > Variável Fim > Variavel Fim Hmm. Zsh gets it right: trintignant:~ 0:31 :-) 1032% printf "\n%-10s Fim\n%-10s Fim\n" "Variável" "Variavel" Variável Fim Variavel Fim As do Perl and Python. > The second test (changing the accented char 'á' by 'X'), shows that the > problem/bug is a) more generic, not only in PostgreSQL and b) has todo > with being the UTF-8 char 'á' a two byte char, while 'X' is only one > byte. Yes, determining how much space a UTF-8 sequence occupies on screen is surprisingly hard. I'm not sure what the C standard says about that. But these days I would expect any programming language to get it right at least for the simple cases. hp -- _ | Peter J. Holzer | Story must make more sense than reality. |_|_) | | | | | hjp@hjp.at | -- Charles Stross, "Creative writing __/ | http://www.hjp.at/ | challenge!"
Attachment
"Peter J. Holzer" <hjp-pgsql@hjp.at> writes: > On 2020-10-17 20:51:36 +0200, Matthias Apitz wrote: >> El día sábado, octubre 17, 2020 a las 03:37:46p. m. -0300, Celso Lorenzetti escribió: >>> elog(INFO, "\n%-10s Fim\n%-10s Fim\n", "Variável", "Variavel"); > Which programming language is this? PL/pgSQL? Looks like C in the backend. >> The second test (changing the accented char 'á' by 'X'), shows that the >> problem/bug is a) more generic, not only in PostgreSQL and b) has todo >> with being the UTF-8 char 'á' a two byte char, while 'X' is only one >> byte. > Yes, determining how much space a UTF-8 sequence occupies on screen is > surprisingly hard. I'm not sure what the C standard says about that. But > these days I would expect any programming language to get it right at > least for the simple cases. Our version of snprintf intentionally counts bytes not characters, so that it does not have to make assumptions about what encoding the given string uses. It's somewhat unclear whether the C/POSIX standard mandates either of these interpretations. The GNU implementation of snprintf tries to count characters. But in the cases where that's mattered to us at all, it's generally been the wrong thing, because glibc didn't necessarily know the encoding to use. That's one reason why we stopped relying on libc's snprintf. The way to get this to work as Celso wishes would be to count characters and then do his own arithmetic about how much padding to add. regards, tom lane
Hi Peter, is language C standart.. On 2020-10-17 20:51:36 +0200, Matthias Apitz wrote: > El día sábado, octubre 17, 2020 a las 03:37:46p. m. -0300, Celso Lorenzetti escribió: > > > Somebody help me, please. > > > > How to make the texts are aligned with 10 characters? > > > > > > > > elog(INFO, "\n%-10s Fim\n%-10s Fim\n", "Variável", "Variavel"); Which programming language is this? PL/pgSQL? > Hola Celso, > > You can reproduce the same on the UNIX shell with: > > $ printf "\n%-10s Fim\n%-10s Fim\n" "Variável" "Variavel" > > Variável Fim > Variavel Fim Hmm. Zsh gets it right: trintignant:~ 0:31 :-) 1032% printf "\n%-10s Fim\n%-10s Fim\n" "Variável" "Variavel" Variável Fim Variavel Fim As do Perl and Python. > The second test (changing the accented char 'á' by 'X'), shows that > the problem/bug is a) more generic, not only in PostgreSQL and b) has > todo with being the UTF-8 char 'á' a two byte char, while 'X' is only > one byte. Yes, determining how much space a UTF-8 sequence occupies on screen is surprisingly hard. I'm not sure what the C standard says about that. But these days I would expect any programming language to get it right at least for the simple cases. hp -- _ | Peter J. Holzer | Story must make more sense than reality. |_|_) | | | | | hjp@hjp.at | -- Charles Stross, "Creative writing __/ | http://www.hjp.at/ | challenge!" -- Este email foi escaneado pelo Avast antivírus. https://www.avast.com/antivirus
Thanks Tom Lane, got it, I will follow the suggestion Att, -----Mensagem original----- De: Tom Lane [mailto:tgl@sss.pgh.pa.us] Enviada em: sábado, 17 de outubro de 2020 20:08 Para: Peter J. Holzer Cc: pgsql-general@lists.postgresql.org Assunto: Re: print formated special characteres "Peter J. Holzer" <hjp-pgsql@hjp.at> writes: > On 2020-10-17 20:51:36 +0200, Matthias Apitz wrote: >> El día sábado, octubre 17, 2020 a las 03:37:46p. m. -0300, Celso Lorenzetti escribió: >>> elog(INFO, "\n%-10s Fim\n%-10s Fim\n", "Variável", "Variavel"); > Which programming language is this? PL/pgSQL? Looks like C in the backend. >> The second test (changing the accented char 'á' by 'X'), shows that the >> problem/bug is a) more generic, not only in PostgreSQL and b) has todo >> with being the UTF-8 char 'á' a two byte char, while 'X' is only one >> byte. > Yes, determining how much space a UTF-8 sequence occupies on screen is > surprisingly hard. I'm not sure what the C standard says about that. But > these days I would expect any programming language to get it right at > least for the simple cases. Our version of snprintf intentionally counts bytes not characters, so that it does not have to make assumptions about what encoding the given string uses. It's somewhat unclear whether the C/POSIX standard mandates either of these interpretations. The GNU implementation of snprintf tries to count characters. But in the cases where that's mattered to us at all, it's generally been the wrong thing, because glibc didn't necessarily know the encoding to use. That's one reason why we stopped relying on libc's snprintf. The way to get this to work as Celso wishes would be to count characters and then do his own arithmetic about how much padding to add. regards, tom lane -- Este email foi escaneado pelo Avast antivírus. https://www.avast.com/antivirus
po 19. 10. 2020 v 3:18 odesílatel Celso Lorenzetti <celso@sysrs.com.br> napsal:
Hi Peter,
is language C standart..
On 2020-10-17 20:51:36 +0200, Matthias Apitz wrote:
> El día sábado, octubre 17, 2020 a las 03:37:46p. m. -0300, Celso
Lorenzetti escribió:
>
> > Somebody help me, please.
> >
> > How to make the texts are aligned with 10 characters?
> >
> >
> >
> > elog(INFO, "\n%-10s Fim\n%-10s Fim\n", "Variável", "Variavel");
Which programming language is this? PL/pgSQL?
This is an internal function available in internal C API.
As Tom Lane said in other mail - this internal API just doesn't support alignment for multibyte encodings.
Regards
Pavel
> Hola Celso,
>
> You can reproduce the same on the UNIX shell with:
>
> $ printf "\n%-10s Fim\n%-10s Fim\n" "Variável" "Variavel"
>
> Variável Fim
> Variavel Fim
Hmm. Zsh gets it right:
trintignant:~ 0:31 :-) 1032% printf "\n%-10s Fim\n%-10s Fim\n" "Variável"
"Variavel"
Variável Fim
Variavel Fim
As do Perl and Python.
> The second test (changing the accented char 'á' by 'X'), shows that
> the problem/bug is a) more generic, not only in PostgreSQL and b) has
> todo with being the UTF-8 char 'á' a two byte char, while 'X' is only
> one byte.
Yes, determining how much space a UTF-8 sequence occupies on screen is
surprisingly hard. I'm not sure what the C standard says about that. But
these days I would expect any programming language to get it right at least
for the simple cases.
hp
--
_ | Peter J. Holzer | Story must make more sense than reality.
|_|_) | |
| | | hjp@hjp.at | -- Charles Stross, "Creative writing
__/ | http://www.hjp.at/ | challenge!"
--
Este email foi escaneado pelo Avast antivírus.
https://www.avast.com/antivirus