Re: Gsoc2012 Idea --- Social Network database schema - Mailing list pgsql-hackers

From Qi Huang
Subject Re: Gsoc2012 Idea --- Social Network database schema
Date
Msg-id BAY159-W23CA8E42EE8CE5B1975B6CA3400@phx.gbl
Whole thread Raw
In response to Re: Gsoc2012 Idea --- Social Network database schema  (Andrew Dunstan <andrew@dunslane.net>)
List pgsql-hackers

> Date: Wed, 21 Mar 2012 11:00:59 -0400
> From: andrew@dunslane.net
> To: alvherre@commandprompt.com
> CC: tgl@sss.pgh.pa.us; robertmhaas@gmail.com; huangqiyx@hotmail.com; neil.conway@gmail.com; daniel@heroku.com; josh@agliodbs.com; pgsql-hackers@postgresql.org
> Subject: Re: [HACKERS] Gsoc2012 Idea --- Social Network database schema
>
>
>
> On 03/21/2012 10:47 AM, Alvaro Herrera wrote:
> > Excerpts from Tom Lane's message of mié mar 21 11:35:54 -0300 2012:
> >
> >> Now that would all be fine if this were a widely-desired feature, but
> >> AFAIR the user demand for it has been about nil. So I'm leaning to
> >> the position that we don't want it.
> > I disagree with there being zero interest ... the "order by random()"
> > stuff does come up occasionally.
> >
>
> Presumably the reason that's not good enough is that is scans the whole
> table (as well as being non-portable)? Maybe we could find some less
> invasive way of avoiding that.
>
> cheers
>
> andrew


Thanks for your discussion and ideas. 
As I checked, MS SQL Server and DB2 implemented tablesample for now. At least, it is useful for QUICK sample retrieval for large dataset. I suppose this clause itself will be much faster for using random().
About implementation, will the code change be really very large? But the general structure should still be about the same, right? 

Best Regards and Thanks
Huang Qi Victor
Computer Science of National University of Singapore

pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: Gsoc2012 Idea --- Social Network database schema
Next
From: Heikki Linnakangas
Date:
Subject: Re: Bug: walsender and high CPU usage