Re: popcount - Mailing list pgsql-hackers

From David Fetter
Subject Re: popcount
Date
Msg-id 20201230164108.GK13234@fetter.org
Whole thread Raw
In response to Re: popcount  ("Daniel Verite" <daniel@manitou-mail.org>)
Responses Re: popcount
List pgsql-hackers
On Wed, Dec 30, 2020 at 05:27:04PM +0100, Daniel Verite wrote:
>     David Fetter wrote:
> 
> +Datum
> +byteapopcount(PG_FUNCTION_ARGS)
> +{
> +    bytea    *t1 = PG_GETARG_BYTEA_PP(0);
> +    int        len, result;
> +
> +    len = VARSIZE_ANY_EXHDR(t1);
> +    result = pg_popcount(VARDATA_ANY(t1), len);
> +
> +    PG_RETURN_INT32(result);
> +}
> 
> The input may have more than 2 billion bits set to 1. The biggest possible
> result should be 8 billion for bytea (1 GB with all bits set to 1).
> So shouldn't this function return an int8?

It does now, and thanks for looking at this.

> There is a pre-existing function in core: bit_length(bytea) returning int4,
> but it errors out for large values (in fact it's a SQL function that returns
> octet_length($1)*8).
> 
> For the varbit case, it doesn't seem like it can hold so many bits, although
> I don't see the limit documented in
> https://www.postgresql.org/docs/current/datatype-bit.html

Out of an abundance of caution, I changed that one, too.

I'd noticed earlier that ceil() doesn't need to be quite as wasteful
as the way I had it earlier, and fixed that with help from Andrew
Gierth.

Best,
David.
-- 
David Fetter <david(at)fetter(dot)org> http://fetter.org/
Phone: +1 415 235 3778

Remember to vote!
Consider donating to Postgres: http://www.postgresql.org/about/donate

Attachment

pgsql-hackers by date:

Previous
From: "Daniel Verite"
Date:
Subject: Re: popcount
Next
From: Alexey Kondratov
Date:
Subject: Re: [PATCH] postgres_fdw connection caching - cause remote sessions linger till the local session exit