Thread: Enums again

Enums again

From

"Kaare Rasmussen"

Date:

08 November 2005, 03:41:10

I've been going through the thread that Andrew Dunstan started with his 
enumkit. 

Maybe I missed it, but I didn't see any conclusion. If I want to design an 
Open Source system now that may be in beta in three to six months and I'd 
like to use enums, is this a good place to look? 

I guess I'm wondering about the kit going into PgFoundry, being accepted in 
the main branch or being dropped. And the timeframe for all that.

Re: Enums again

From

Christopher Kings-Lynne

Date:

08 November 2005, 03:58:59

> Maybe I missed it, but I didn't see any conclusion. If I want to design 
> an Open Source system now that may be in beta in three to six months and 
> I'd like to use enums, is this a good place to look?

There's no way you're going to be using enums.

> I guess I'm wondering about the kit going into PgFoundry, being accepted 
> in the main branch or being dropped. And the timeframe for all that.

Time frame is more like 18 months.  The kit is also very rough - not 
like a finished feature would be like at all.  Plus, there's no 
guarantee the feature would ever make it into postgres.

Just don't use enums - they're awful.

Chris

Re: Enums again

From

"Kaare Rasmussen"

Date:

08 November 2005, 04:10:22

> Just don't use enums - they're awful.

In general? 

So, instead of using enums for order states or originating system, I'll user 
numbers or text? Or implement lookup tables ?

Re: Enums again

From

Christopher Kings-Lynne

Date:

08 November 2005, 04:18:37

> So, instead of using enums for order states or originating system, I'll 
> user numbers or text? Or implement lookup tables ?

Use a text field and a CHECK constraint if you have just a couple of 
states, and a lookup table if you have many.

Always use a lookup table if you plan on adding new states regularly.

Chris

Re: Enums again

From

Andrew Dunstan

Date:

08 November 2005, 09:46:42

Christopher Kings-Lynne wrote:

>> Maybe I missed it, but I didn't see any conclusion. If I want to 
>> design an Open Source system now that may be in beta in three to six 
>> months and I'd like to use enums, is this a good place to look?
>
>
> There's no way you're going to be using enums.
>
>> I guess I'm wondering about the kit going into PgFoundry, being 
>> accepted in the main branch or being dropped. And the timeframe for 
>> all that.
>
>
> Time frame is more like 18 months.  The kit is also very rough - not 
> like a finished feature would be like at all.  Plus, there's no 
> guarantee the feature would ever make it into postgres.
>
> Just don't use enums - they're awful.
>
>

Well, perhaps I should speak ;-)

First, enumkit is not intended for pgfoundry or contrib. It's a 
byproduct of some research I was doing. It occurred to me that making 
what I had done generic would be very simple - the only variables in 
fact are the type name and the list of enumeration values. So I spent 30 
minutes making it generic and enumkit was born.

So of course it's rough - it is a research workproduct, not finished code.

Now, my intention is to use that work product as part of allowing first 
class enumeration types. Currently the proposal is till in my head, but 
basically it would involve a fairly small set of changes. There would 
(probably) need to be a new unreserved keyword plus a new rule set in 
the grammar to allow for type creation, In the catalog, pg_type would 
get a new column of type text[] that would hold the list of values, and 
typtype would have a new possible value of 'e' for enumeration. There 
might be other consequential changes too, but I think that would be most 
of it. The only functions that actually need to have any knowledge of 
the enumeration strings are the input/output functions and the to/from 
text casts. These would get the relevant info from fcinfo.flinfo ... and 
then looking up the type cache - not sure yet if an extra cache 
operation is needed. I haven't yet worked out how to build the qsorted 
table that enumkit uses for bsearch lookup on input, or even whether it 
is worth doing. And I haven't done a line of code. Like I say it's all 
in my head right now.

As for the timeframe - if this direction is acceptable I want to get it 
into 8.2. I really hope that 8.2 is not 18 months away. I think we 
should aim for a release cycle no longer than a year. This last cycle 
worked pretty well, and I think we should try to repeat it. But within 6 
months ... no.

So ... since Kaare asks, would a feature along the lines I outlined 
above be acceptable? And does anyone have an alternative proposal that 
they are prepared to work on?

If the answer to both of these is "no", then I will probably produce a 
patch at some stage that would be hosted on pgfoundry. But that would be 
far from ideal.

BTW, does the standard have a way of doing this? ISTR hearing something 
about distinct types. If so, what should the grammar look like, and can 
we use it (or something close)? I'd rather not just invent syntax freely.

Also, Christopher - I was somewhat motivated to work on this by your 
recent comment about enums being the number one demand of migrating 
MySQL users, so I am mildly amused by your last sentence ;-)

cheers

andrew

Re: Enums again

From

Alvaro Herrera

Date:

08 November 2005, 10:01:21

Andrew Dunstan wrote:

> In the catalog, pg_type would 
> get a new column of type text[] that would hold the list of values, and 
> typtype would have a new possible value of 'e' for enumeration. There 
> might be other consequential changes too, but I think that would be most 
> of it.

Huh, why not have the actual values in a separate catalog like
pg_enumvalues or some such?

> The only functions that actually need to have any knowledge of 
> the enumeration strings are the input/output functions and the to/from 
> text casts. These would get the relevant info from fcinfo.flinfo ... and 
> then looking up the type cache - not sure yet if an extra cache 
> operation is needed.

It'd be interesting to measure the difference of having the cache vs.
not having it.

Thinking on how to pg_dump the whole thing is important too.

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
The PostgreSQL Company - Command Prompt, Inc.

Re: Enums again

From

Andrew Dunstan

Date:

08 November 2005, 10:18:23


Alvaro Herrera wrote:

>Andrew Dunstan wrote:
>
>  
>
>>In the catalog, pg_type would 
>>get a new column of type text[] that would hold the list of values, and 
>>typtype would have a new possible value of 'e' for enumeration. There 
>>might be other consequential changes too, but I think that would be most 
>>of it.
>>    
>>
>
>Huh, why not have the actual values in a separate catalog like
>pg_enumvalues or some such?
>  
>

Sure, could do that. I don't have strong feelings either way.

>  
>
>>The only functions that actually need to have any knowledge of 
>>the enumeration strings are the input/output functions and the to/from 
>>text casts. These would get the relevant info from fcinfo.flinfo ... and 
>>then looking up the type cache - not sure yet if an extra cache 
>>operation is needed.
>>    
>>
>
>It'd be interesting to measure the difference of having the cache vs.
>not having it.
>  
>

Possibly. I would expect it to make a noticeable difference.

>Thinking on how to pg_dump the whole thing is important too.
>
>  
>

Yes, that would certainly be part of the work. I should have mentioned 
that. It's not a showstopper, though - I see no reason in principal for 
it to be a difficulty.

cheers

andrew

Re: Enums again

From

Tom Lane

Date:

08 November 2005, 11:09:06

Andrew Dunstan <andrew@dunslane.net> writes:
> Alvaro Herrera wrote:
>> Huh, why not have the actual values in a separate catalog like
>> pg_enumvalues or some such?

> Sure, could do that. I don't have strong feelings either way.

I'd vote for the separate catalog instead of bloating pg_type.
        regards, tom lane

Re: Enums again

From

Andrew Dunstan

Date:

08 November 2005, 11:21:41


Tom Lane wrote:

>Andrew Dunstan <andrew@dunslane.net> writes:
>  
>
>>Alvaro Herrera wrote:
>>    
>>
>>>Huh, why not have the actual values in a separate catalog like
>>>pg_enumvalues or some such?
>>>      
>>>
>
>  
>
>>Sure, could do that. I don't have strong feelings either way.
>>    
>>
>
>I'd vote for the separate catalog instead of bloating pg_type.
>
>    
>  
>

Ok, consider that a done deal. Any other tweaks?

cheers

andrew

Re: Enums again

From

Christopher Kings-Lynne

Date:

08 November 2005, 21:29:46

> Also, Christopher - I was somewhat motivated to work on this by your 
> recent comment about enums being the number one demand of migrating 
> MySQL users, so I am mildly amused by your last sentence ;-)

They're not mutually exclusive statements :)