Re: Introducing the TPC-V benchmark, and its relationship to PostgreSQL - Mailing list pgsql-performance

From Reza Taheri
Subject Re: Introducing the TPC-V benchmark, and its relationship to PostgreSQL
Date
Msg-id 66CE997FB523C04E9749452273184C6C137CB88B48@exch-mbx-113.vmware.com
Whole thread Raw
In response to Re: Introducing the TPC-V benchmark, and its relationship to PostgreSQL  (Craig Ringer <ringerc@ringerc.id.au>)
Responses Re: Introducing the TPC-V benchmark, and its relationship to PostgreSQL
List pgsql-performance
Thanks for reply, Craig. As far as publishing a draft, we are planning to do something along those lines.

For the schema and the queries, we are pretty much taking those wholesale from TPC-E, whose specification is public
(http://www.tpc.org/tpce/spec/v1.12.0/TPCE-v1.12.0.pdf).The high-level differences with TPC-E are detailed in the 2010
and2012 TPC TC papers I mentioned. We will stick closely to the TPC-E schema and queries. Anything new means a long
specificationwriting process, which we are trying to avoid. We want to get this benchmark out there quickly. 

I am not an expert in licensing. What I can tell you is that the kit will be available to anyone to download and use
witha simple EULA based on existing TPC EULAs (although TPC hasn't had a complete end-to-end kit before, it has
publishedpartial code modules for its benchmarks). We broached the idea of open sourcing the kit, but it didn't pan
out.The people on the subcommittee represent their companies, and different companies have different rules when their
employeescontribute to open source code.  Satisfying the armies of lawyers would have been impossible. So the kit won't
beopen source, but readily available for use. It will probably be similar to the licensing for SPEC benchmarks if you
arefamiliar with them. 

I'll pick up Greg's book. We had been focusing on functionality, but our focus will shift to performance soon. To be
blunt,the team is very experienced in benchmarks and in database performance, but most of us are new to PGSQL. 

Thanks,
Reza

> -----Original Message-----
> From: Craig Ringer [mailto:ringerc@ringerc.id.au]
> Sent: Tuesday, July 03, 2012 10:19 PM
> To: pgsql-performance@postgresql.org
> Cc: Reza Taheri; Andy Bond (abond@redhat.com); Greg Kopczynski; Jignesh
> Shah; Greg Smith; Dave Page
> Subject: Re: [PERFORM] Introducing the TPC-V benchmark, and its
> relationship to PostgreSQL
>
> On 07/04/2012 07:08 AM, Reza Taheri wrote:
>
> > ... so the subcommittee moved forward with developing its own
> > reference kit. The reference kit has been developed to run on
> > PostgreSQL, and we are focusing our development efforts and testing on
> > PostgreSQL.
> That's a very positive step. The TPC seems to me to have a pretty poor
> reputation among open source database users and vendors. I think that's
> largely because the schema and tools are typically very closed and
> restrictively licensed, though the prohibition against publishing benchmarks
> by big commercial vendors doesn't help.
>
> This sounds like a promising change. The TPC benchmarks are really good for
> load-testing and regression testing, so having one that's directly PostgreSQL
> friendly will be a big plus, especially if it is appropriately licensed.
>
> The opportunity to audit the schema, queries, and test setup before the
> tool is finalized would certainly be appealing. What can you publish in draft
> form now?
>
> What license terms does the TPC plan to release the schema, queries, and
> data for TPC-V under?
>
> I've cc'd Greg Smith and Dave Page, both of whom I suspect will be
> interested in this development but could easily miss your message. If you
> haven't read Greg' book "PostgreSQL High Performance" it's probably a good
> idea to do so.
>
> --
> Craig Ringer


pgsql-performance by date:

Previous
From: Claudio Freire
Date:
Subject: Re: The need for clustered indexes to boost TPC-V performance
Next
From: Mark Kirkwood
Date:
Subject: Re: Introducing the TPC-V benchmark, and its relationship to PostgreSQL