Thread: Data sets for download

Data sets for download

From
Jayadevan M
Date:

Hello all,

Does anyone know of reasonably-sized data dumps (csv or excel or xml..) that can be used for learning/teaching about performance tuning. Say – a set of 6-7 tables, may be two of them with a few million records etc? Total data volume would be in a few GB range. There are tools which generate data, but most of them seem to generate junk data. I came across this one (pretty good) –

http://www.ourairports.com/data/

If there were schedule and bookings tables to go with this, it would have been great.

There is http://www.imdb.com/interfaces also. But the data extraction process not simple.

Anything similar – the typical warehouse/customer/order tables or emp/dept/project ?

Regards,

Jayadevan

 



DISCLAIMER: "The information in this e-mail and any attachment is intended only for the person to whom it is addressed and may contain confidential and/or privileged material. If you have received this e-mail in error, kindly contact the sender and destroy all copies of the original communication. IBS makes no warranty, express or implied, nor guarantees the accuracy, adequacy or completeness of the information contained in this email or any attachment and is not liable for any errors, defects, omissions, viruses or for resultant loss or damage, if any, direct or indirect."

Re: Data sets for download

From
Thomas Kellerer
Date:
Jayadevan M, 25.10.2012 05:15:
> There are tools which generate data, but most of them seem to
> generate junk data.

Have a look a Benerator. It can create quite reasonable test data (e.g. valid addresses, "real" looking names and so
on).

It has a bit steep learning curve, but I'm quite happy with the results
http://databene.org/databene-benerator


Another option might be the Dell DVD Store Loadtest:
http://linux.dell.com/dvdstore/

It can generate testdata with a specific scale and it works well with Postgres.

Regards
Thomas

Re: Data sets for download

From
Jayadevan M
Date:
>Have a look a Benerator. It can create quite reasonable test data (e.g. valid
>addresses, "real" looking names and so on).
>
>It has a bit steep learning curve, but I'm quite happy with the results
>http://databene.org/databene-benerator
>
>
>Another option might be the Dell DVD Store Loadtest:
>http://linux.dell.com/dvdstore/
>
>It can generate testdata with a specific scale and it works well with Postgres.
>
Thank you. Will try these.
Regards,
Jayadevan



DISCLAIMER:   "The information in this e-mail and any attachment is intended only for the person to whom it is
addressedand may contain confidential and/or privileged material. If you have received this e-mail in error, kindly
contactthe sender and destroy all copies of the original communication. IBS makes no warranty, express or implied, nor
guaranteesthe accuracy, adequacy or completeness of the information contained in this email or any attachment and is
notliable for any errors, defects, omissions, viruses or for resultant loss or damage, if any, direct or indirect."  

Re: Data sets for download

From
Thomas Boussekey
Date:
Hi,

I'm using Dell DVD store for training purposes, and I met some problems with it!
Once they are corrected it works well (except the load test config on my environment, problem encountered with a RSA fingerprint!)

The following slideshow tracks down the problems:

Have fun,


-- Thomas BOUSSEKEY


2012/10/25 Jayadevan M <jayadevan.maymala@ibsplc.com>
>Have a look a Benerator. It can create quite reasonable test data (e.g. valid
>addresses, "real" looking names and so on).
>
>It has a bit steep learning curve, but I'm quite happy with the results
>http://databene.org/databene-benerator
>
>
>Another option might be the Dell DVD Store Loadtest:
>http://linux.dell.com/dvdstore/
>
>It can generate testdata with a specific scale and it works well with Postgres.
>
Thank you. Will try these.
Regards,
Jayadevan



DISCLAIMER:   "The information in this e-mail and any attachment is intended only for the person to whom it is addressed and may contain confidential and/or privileged material. If you have received this e-mail in error, kindly contact the sender and destroy all copies of the original communication. IBS makes no warranty, express or implied, nor guarantees the accuracy, adequacy or completeness of the information contained in this email or any attachment and is not liable for any errors, defects, omissions, viruses or for resultant loss or damage, if any, direct or indirect."
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general