Home > mailing lists

Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING - Mailing list pgsql-hackers

From	Heikki Linnakangas
Subject	Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING
Date	August 25, 2014 17:52:15
Msg-id	53FB4D87.7040507@vmware.com Whole thread Raw
In response to	Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING
List	pgsql-hackers

Tree view

On 08/25/2014 04:48 PM, Tom Lane wrote:
> Heikki Linnakangas <hlinnakangas@vmware.com> writes:
>> On 07/12/2014 05:16 AM, Jeff Davis wrote:
>>> I was able to see about a 2% increase in runtime when using the
>>> similar_escape function directly. I made a 10M tuple table and did:
>>>
>>> explain analyze
>>> select
>>>
similar_escape('Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£','#')
fromt;
 
>>>
>>> which was the worst reasonable case I could think of. (It appears that
>>> selecting from a table is faster than from generate_series. I'm curious
>>> what you use when testing the performance of an individual function at
>>> the SQL level.)
>
>> A large table like that is what I usually do. A large generate_series()
>> spends a lot of time building the tuplestore, especially if it doesn't
>> fit in work_mem and spills to disk. Sometimes I use this to avoid it:
>
>> explain analyze
>>         select
>>
similar_escape('Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£Î£','#')
>> from generate_series(1, 10000) a, generate_series(1,1000);
>
>> although in my experience it still has somewhat more overhead than a
>> straight seqscan because.
>
> [ scratches head... ]  Surely similar_escape is marked immutable, and
> will therefore be executed exactly once in either of these formulations,
> because the planner will fold the expression to a constant.

Yeah, just noticed that myself..

- Heikki

pgsql-hackers by date:

From: Heikki Linnakangas
Date: 25 August 2014, 17:41:35
Subject: Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING

From: Heikki Linnakangas
Date: 25 August 2014, 18:05:07
Subject: Re: Unwanted LOG during recovery of DROP TABLESPACE REDO

Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING - Mailing list pgsql-hackers

Previous

Next