Home > mailing lists

Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING - Mailing list pgsql-hackers

From	Heikki Linnakangas
Subject	Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING
Date	August 25, 2014 13:01:30
Msg-id	53FB3399.2070802@vmware.com Whole thread Raw
In response to	Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING (Jeff Davis <pgsql@j-davis.com>)
Responses	Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING
List	pgsql-hackers

Tree view

On 07/12/2014 05:16 AM, Jeff Davis wrote:
> I was able to see about a 2% increase in runtime when using the
> similar_escape function directly. I made a 10M tuple table and did:
>
>      explain analyze
>        select
> similar_escape('ΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣ','#') from t;
>
> which was the worst reasonable case I could think of. (It appears that
> selecting from a table is faster than from generate_series. I'm curious
> what you use when testing the performance of an individual function at
> the SQL level.)

A large table like that is what I usually do. A large generate_series() 
spends a lot of time building the tuplestore, especially if it doesn't 
fit in work_mem and spills to disk. Sometimes I use this to avoid it:

explain analyze      select
similar_escape('ΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣ','#') 
from generate_series(1, 10000) a, generate_series(1,1000);

although in my experience it still has somewhat more overhead than a 
straight seqscan because.

- Heikki

pgsql-hackers by date:

From: Albe Laurenz
Date: 25 August 2014, 12:59:10
Subject: Re: Optimization for updating foreign tables in Postgres FDW

From: Kevin Grittner
Date: 25 August 2014, 13:26:02
Subject: Re: LIMIT for UPDATE and DELETE

Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING - Mailing list pgsql-hackers

Previous

Next