Re: fulltext parser strange behave - Mailing list pgsql-hackers

From Andrew Dunstan
Subject Re: fulltext parser strange behave
Date
Msg-id 47324C79.3090400@dunslane.net
Whole thread Raw
In response to Re: fulltext parser strange behave  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: fulltext parser strange behave  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers

Tom Lane wrote:
> "Pavel Stehule" <pavel.stehule@gmail.com> writes:
>   
>> I am writing tsearch2 wrapper and I testing functionality. I found
>> some little bit strange on default parser. It can't parse tags with
>> numbers:
>>     
>
> Well, the state machine definitely thinks that tag names should contain
> only ASCII letters (with possibly a leading or trailing '/').  Given the
> HTML examples I suppose we should allow non-first digits too.  Is there
> anything else that should be considered a tag?  What about dash and
> underscore for instance?
>
>     
>   

The docs say we specifically accept HTML tags. Are we really just 
accepting anything that is a string of ASCII letters as the tag name? 
Then we should adjust the docs. <foo> and <foo1234> are not HTML tags.

cheers

andrew


pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: Feature Request: inline comments
Next
From: CaT
Date:
Subject: Re: Feature Request: inline comments