Home > mailing lists

Re: Using Postgres to store high volume streams of sensor readings - Mailing list pgsql-general

From	Ciprian Dorin Craciun
Subject	Re: Using Postgres to store high volume streams of sensor readings
Date	November 23, 2008 02:35:02
Msg-id	8e04b5820811222234p3afd26aav85b915093a48c031@mail.gmail.com Whole thread Raw
In response to	Re: Using Postgres to store high volume streams of sensor readings (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Using Postgres to store high volume streams of sensor readings
List	pgsql-general

Tree view

On Sun, Nov 23, 2008 at 1:02 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Alvaro Herrera <alvherre@commandprompt.com> writes:
>> The problem is, most likely, on updating the indexes.  Heap inserts
>> should always take more or less the same time, but index insertion
>> requires walking down the index struct for each insert, and the path to
>> walk gets larger the more data you have.
>
> It's worse than that: his test case inserts randomly ordered keys, which
> means that there's no locality of access during the index updates.  Once
> the indexes get bigger than RAM, update speed goes into the toilet,
> because the working set of index pages that need to be touched also
> is bigger than RAM.  That effect is going to be present in *any*
> standard-design database, not just Postgres.
>
> It's possible that performance in a real-world situation would be
> better, if the incoming data stream isn't so random; but it's
> hard to tell about that with the given facts.
>
> One possibly useful trick is to partition the data by timestamp with
> partition sizes chosen so that the indexes don't get out of hand.
> But the partition management might be enough of a PITA to negate
> any win.
>
>                        regards, tom lane

    Thanks for your feedback! This is just as I supposed, but i didn't
had the Postgres experience to be certain.
    I'll include your conclusion to my report.

    Ciprian Craciun.

pgsql-general by date:

From: Bruce Momjian
Date: 22 November 2008, 23:08:08
Subject: Re: Postgres mail list traffic over time

From: "Ciprian Dorin Craciun"
Date: 23 November 2008, 02:35:54
Subject: Re: Using Postgres to store high volume streams of sensor readings

Re: Using Postgres to store high volume streams of sensor readings - Mailing list pgsql-general

Previous

Next