Home > mailing lists

Re: Fast insertion indexes: why no developments - Mailing list pgsql-hackers

From	Simon Riggs
Subject	Re: Fast insertion indexes: why no developments
Date	November 5, 2013 10:49:09
Msg-id	CA+U5nMJpR+_aHdXowrd2L5OFTrcRLWx9_umS+SGw=MfLRkVjpQ@mail.gmail.com Whole thread Raw
In response to	Re: Fast insertion indexes: why no developments (Yann Fontana <yann.fontana@gmail.com>)
Responses	Re: Fast insertion indexes: why no developments
List	pgsql-hackers

Tree view

On 30 October 2013 14:34, Yann Fontana <yann.fontana@gmail.com> wrote:
>
>
>> On 30 October 2013 11:23, Leonardo Francalanci <m_lists@yahoo.it> wrote:
>>
>> >> In terms of generality, do you think its worth a man year of developer
>> >> effort to replicate what you have already achieved? Who would pay?
>
>
> I work on an application that does exactly what Leonardo described. We hit
> the exact same problem, and came up with the same exact same solution (down
> to the 15 minutes interval). But I have also worked on other various
> datamarts (all using Oracle), and they are all subject to this problem in
> some form: B-tree indexes slow down bulk data inserts too much and need to
> be disabled or dropped and then recreated after the load. In some cases this
> is done easily enough, in others it's more complicated (example: every day,
> a process imports from 1 million to 1 billion records into a table partition
> that may contain from 0 to 1 billion records. To be as efficient as
> possible, you need some logic to compare the number of rows to insert to the
> number of rows already present, in order to decide whether to drop the
> indexes or not).
>
> Basically, my point is that this is a common problem for datawarehouses and
> datamarts. In my view, indexes that don't require developers to work around
> poor insert performance would be a significant feature in a
> "datawarehouse-ready" DBMS.


Everybody on this thread is advised to look closely at Min Max indexes
before starting any further work.

MinMax will give us access to many new kinds of plan, plus they are
about as close to perfectly efficient, by which I mean almost zero
overhead, with regard to inserts as it is possible to get.

-- Simon Riggs                   http://www.2ndQuadrant.com/PostgreSQL Development, 24x7 Support, Training & Services

pgsql-hackers by date:

From: Gurjeet Singh
Date: 05 November 2013, 10:47:48
Subject: Re: Shave a few instructions from child-process startup sequence

From: Leonardo Francalanci
Date: 05 November 2013, 11:25:39
Subject: Re: Fast insertion indexes: why no developments

Re: Fast insertion indexes: why no developments - Mailing list pgsql-hackers

Previous

Next