Re: Regex performance issue - Mailing list pgsql-performance

From Dave Dutcher
Subject Re: Regex performance issue
Date
Msg-id 030b01c71651$984577c0$8300a8c0@tridecap.com
Whole thread Raw
In response to Regex performance issue  ("Alexandru Coseru" <alexandru.coseru@totaltelecom.ro>)
List pgsql-performance
> -----Original Message-----
> From: pgsql-performance-owner@postgresql.org On Behalf Of Alexandru Coseru
> asterisk=> explain analyze SELECT * FROM destlist WHERE
> '0039051248787' ~
> prefix AND id_ent='-2' AND dir=0 ORDER by length(prefix) DESC;
>
>
> QUERY PLAN
> --------------------------------------------------------------
> ----------------------------------------------------------------------
>  Sort  (cost=7925.07..7925.15 rows=31 width=67) (actual
> time=857.715..857.716 rows=2 loops=1)
>    Sort Key: length((prefix)::text)
>    ->  Bitmap Heap Scan on destlist  (cost=60.16..7924.30
> rows=31 width=67)
> (actual time=2.156..857.686 rows=2 loops=1)
>          Recheck Cond: ((id_ent = -2) AND (dir = 0))
>          Filter: ('0039051248787'::text ~ (prefix)::text)
>          ->  Bitmap Index Scan on destlist_indx2  (cost=0.00..60.16
> rows=6193 width=0) (actual time=1.961..1.961 rows=5205 loops=1)
>                Index Cond: ((id_ent = -2) AND (dir = 0))
>  Total runtime: 857.804 ms
> (8 rows)
>
>
>     "mmumu" btree (prefix varchar_pattern_ops)
>

I'm surpised Postgres isn't using the index on prefix seeing as the index
uses the varchar_pattern_ops operator class.  It could be that the index
isn't selective enough, or is Postgres not able to use an index with Posix
regular expressions?  The docs seem to say that it can, but I'd be curious
to see what happens if you use LIKE instead of ~.

Dave



pgsql-performance by date:

Previous
From: "Alexandru Coseru"
Date:
Subject: Regex performance issue
Next
From: "Alexandru Coseru"
Date:
Subject: Re: Regex performance issue