Re: US Census database (Tiger 2004FE) - 4.4G - Mailing list pgsql-hackers

From Mark Woodward
Subject Re: US Census database (Tiger 2004FE) - 4.4G
Date
Msg-id 22672.24.91.171.78.1123190535.squirrel@mail.mohawksoft.com
Whole thread Raw
In response to Re: US Census database (Tiger 2004FE) - 4.4G  (Ron Mayer <rm_pg@cheapcomplexdevices.com>)
List pgsql-hackers
I thought bout it, but it isn't the best program around, but it does work.
My program also reformats numbers, i.e. long/lat become properly
decimal-ed numerics, zips become integers, etc.

The question is...

Do you download the raw data and convert it into a database, or do you
download the pre-formatted database?

I would say the preformated database is easier to manage. There are
hundreds of individual zips files, in each of those files 10 or so data
files.



> Mark Woodward wrote:
>> It is 4.4G in space in a gzip package.
>>
>> I'll mail a DVD to two people who promise to host it for Hackers.
>
> Would it be easier to release the program you did to do
> this conversion?
>
>
> I use this pretty short (274 line) C program:
>     http://www.forensiclogic.com/tmp/tgr2sql.c
> to convert the raw tiger files
> from http://www.census.gov/geo/www/tiger/index.html
> into SQL statements that can be loaded by postgresql.
>
> The #define SQL line controls if it makes data
> with INSERT statements or for COPY statements.
>
> ---------------------------(end of broadcast)---------------------------
> TIP 6: explain analyze is your friend
>



pgsql-hackers by date:

Previous
From: Marko Kreen
Date:
Subject: Re: openbsd, plpython, missing threading symbols
Next
From: Ian Burrell
Date:
Subject: Re: Solving the OID-collision problem