Home > mailing lists

Re: tuple radix sort - Mailing list pgsql-hackers

From	Chao Li
Subject	Re: tuple radix sort
Date	December 3 11:22:13
Msg-id	606C775A-4C1A-482B-BE7D-2E7A46AE14B9@gmail.com Whole thread Raw
In response to	Re: tuple radix sort (John Naylor <johncnaylorls@gmail.com>)
List	pgsql-hackers

Tree view

Hi John,

I played with this again today and found an optimization that seems to dramatically improve the performance:

```
+static void
+radix_sort_tuple(SortTuple *begin, size_t n_elems, int level, Tuplesortstate *state)
+{
+    RadixPartitionInfo partitions[256] = {0};
+    uint8_t        remaining_partitions[256] = {0};
```

Here partitions and remaining_partitions are just temporary buffers, allocating memory from stack and initialize them
seemsslow. By passing them as function parameters are much faster. See attached diff for my change. 

V5 patch: by the way, v5 is very faster than v1.
```
evantest=# select * from test_multi order by category, name;
Time: 299.912 ms
evantest=# select * from test_multi order by category, name;
Time: 298.793 ms
evantest=# select * from test_multi order by category, name;
Time: 300.306 ms
evantest=# select * from test_multi order by category, name;
Time: 302.140 ms
```

v5 + my change:
```
evantest=# select * from test_multi order by category, name;
Time: 152.572 ms
evantest=# select * from test_multi order by category, name;
Time: 143.296 ms
evantest=# select * from test_multi order by category, name;
Time: 151.606 ms
```

The test I did today is just the high cardinality first column test I had done before:
```
drop table if exists test_multi;
create unlogged table test_multi (category int, name text);
insert into test_multi select (random() * 1000000)::int as category,  md5(random()::text) || md5(random()::text) as
namefrom generate_series(1, 1000000); 
vacuum freeze test_multi;
\timing on
\o /dev/null
set wip_radix_sort = ‘on;
set work_mem = ‘2GB’;
select * from test_multi order by category, name;
```

Best regards,
--
Chao Li (Evan)
HighGo Software Co., Ltd.
https://www.highgo.com/

Attachment

tuplesort_chaoli.diff

pgsql-hackers by date:

From: Bertrand Drouvot
Date: 03 December, 11:14:41
Subject: Re: Remove useless pointer advance in StatsShmemInit()

From: Mikhail Kharitonov
Date: 03 December, 11:37:32
Subject: [PATCH] VACUUM: avoid pre-creation transactions holding back cleanup of newly created relations

Re: tuple radix sort - Mailing list pgsql-hackers

Attachment

Previous

Next