Thread: tsearch2api .. wrapper for integrated fultext

tsearch2api .. wrapper for integrated fultext

From
"Pavel Stehule"
Date:
Hello

I am sending light wrapper of integrated fulltext. API is compatible
with tsearch2.

Regards
Pavel Stehule

ToDo: rewrite functions

Attachment

Re: tsearch2api .. wrapper for integrated fultext

From
Bruce Momjian
Date:
Should I apply this or wait for a final version?

---------------------------------------------------------------------------

Pavel Stehule wrote:
> Hello
>
> I am sending light wrapper of integrated fulltext. API is compatible
> with tsearch2.
>
> Regards
> Pavel Stehule
>
> ToDo: rewrite functions

[ Attachment, skipping... ]

>
> ---------------------------(end of broadcast)---------------------------
> TIP 2: Don't 'kill -9' the postmaster

--
  Bruce Momjian  <bruce@momjian.us>        http://momjian.us
  EnterpriseDB                             http://postgres.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

Re: tsearch2api .. wrapper for integrated fultext

From
"Pavel Stehule"
Date:
wait two days, please.

Pavel

I'll send finished patch on Sunday.

On 09/11/2007, Bruce Momjian <bruce@momjian.us> wrote:
>
> Should I apply this or wait for a final version?
>
> ---------------------------------------------------------------------------
>
> Pavel Stehule wrote:
> > Hello
> >
> > I am sending light wrapper of integrated fulltext. API is compatible
> > with tsearch2.
> >
> > Regards
> > Pavel Stehule
> >
> > ToDo: rewrite functions
>
> [ Attachment, skipping... ]
>
> >
> > ---------------------------(end of broadcast)---------------------------
> > TIP 2: Don't 'kill -9' the postmaster
>
> --
>   Bruce Momjian  <bruce@momjian.us>        http://momjian.us
>   EnterpriseDB                             http://postgres.enterprisedb.com
>
>   + If your life is a hard drive, Christ can be your backup. +
>

Re: tsearch2api .. wrapper for integrated fultext

From
"Magnus Hagander"
Date:
A thought on this - should it not go in contrib/tsearch2 replacing the old deprecated stuff, instead of creating yet
aothercontrib dir? 

/Magnus

> ------- Original Message -------
> From: Bruce Momjian <bruce@momjian.us>
> To: Pavel Stehule <pavel.stehule@gmail.com>
> Sent: 07-11-09, 01:35:49
> Subject: Re: [PATCHES] tsearch2api .. wrapper for integrated fultext
>
> Should I apply this or wait for a final version?
>
> ---------------------------------------------------------------------------
>
> Pavel Stehule wrote:
> > Hello
> >
> > I am sending light wrapper of integrated fulltext. API is compatible
> > with tsearch2.
> >
> > Regards
> > Pavel Stehule
> >
> > ToDo: rewrite functions
>
> [ Attachment, skipping... ]
>
> >
> > ---------------------------(end of broadcast)---------------------------
> > TIP 2: Don't 'kill -9' the postmaster
>
> --
>   Bruce Momjian  <bruce@momjian.us>        http://momjian.us
>   EnterpriseDB                             http://postgres.enterprisedb.com
>
>   + If your life is a hard drive, Christ can be your backup. +
>
> ---------------------------(end of broadcast)---------------------------
> TIP 9: In versions below 8.0, the planner will ignore your desire to
>        choose an index scan if your joining column's datatypes do not
>        match
>

Re: tsearch2api .. wrapper for integrated fultext

From
Tom Lane
Date:
"Magnus Hagander" <magnus@hagander.net> writes:
> A thought on this - should it not go in contrib/tsearch2 replacing the old deprecated stuff, instead of creating yet
aothercontrib dir? 

That was the idea, I thought.

This proposed patch is in need of review, which I'd been hoping to get
to tomorrow; but since Pavel says he's about to send in a revised
version, I will wait for that ...

            regards, tom lane

Re: tsearch2api .. wrapper for integrated fultext

From
"Pavel Stehule"
Date:
On 09/11/2007, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> "Magnus Hagander" <magnus@hagander.net> writes:
> > A thought on this - should it not go in contrib/tsearch2 replacing the old deprecated stuff, instead of creating
yetaother contrib dir? 

It can be moved - no problem.

>
> That was the idea, I thought.
>
> This proposed patch is in need of review, which I'd been hoping to get
> to tomorrow; but since Pavel says he's about to send in a revised
> version, I will wait for that ...
>
>                         regards, tom lane
>

Re: tsearch2api .. wrapper for integrated fultext

From
"Pavel Stehule"
Date:
Hello

this is revised version

I can not remove files with diff, so patch add files to tsearch2api
still. Please change it.

Regards
Pavel Stehule

On 09/11/2007, Pavel Stehule <pavel.stehule@gmail.com> wrote:
> On 09/11/2007, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> > "Magnus Hagander" <magnus@hagander.net> writes:
> > > A thought on this - should it not go in contrib/tsearch2 replacing the old deprecated stuff, instead of creating
yetaother contrib dir? 
>
> It can be moved - no problem.
>
> >
> > That was the idea, I thought.
> >
> > This proposed patch is in need of review, which I'd been hoping to get
> > to tomorrow; but since Pavel says he's about to send in a revised
> > version, I will wait for that ...
> >
> >                         regards, tom lane
> >
>

Attachment

Re: tsearch2api .. wrapper for integrated fultext

From
Tom Lane
Date:
"Pavel Stehule" <pavel.stehule@gmail.com> writes:
> this is revised version

A couple of thoughts here:

* What is the point of creating stub functions for things that users
won't attempt to call directly, such as opclass support functions and
the old dictionary support functions?  Couldn't we just leave those out
and save some code?

* The WRAPPER_FUNCTION stuff seems unnecessarily inefficient --- can't
we just declare those as LANGUAGE INTERNAL and link the SQL definition
directly to the built-in function?

* The SQL file doesn't create any of the old types (public.tsvector
etc) so it seems still a long ways short of ensuring that an old
dump file can be reloaded.  Maybe I don't understand exactly how you
intend it to interact with the definitions that will be in the dump
file.

            regards, tom lane

Re: tsearch2api .. wrapper for integrated fultext

From
"Pavel Stehule"
Date:
Hello

On 10/11/2007, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> "Pavel Stehule" <pavel.stehule@gmail.com> writes:
> > this is revised version
>
> A couple of thoughts here:
>
> * What is the point of creating stub functions for things that users
> won't attempt to call directly, such as opclass support functions and
> the old dictionary support functions?  Couldn't we just leave those out
> and save some code?
>

we can. I don't understand to tsearch2 well, so this wrapper is
complete. There are not necessary all unsupported functions. But these
improvisation   can be done simply in C preprocessor.

> * The WRAPPER_FUNCTION stuff seems unnecessarily inefficient --- can't
> we just declare those as LANGUAGE INTERNAL and link the SQL definition
> directly to the built-in function?
>

it's little bit inefficient, but it's more consistent and readable. So
it's reason.

> * The SQL file doesn't create any of the old types (public.tsvector
> etc) so it seems still a long ways short of ensuring that an old
> dump file can be reloaded.  Maybe I don't understand exactly how you
> intend it to interact with the definitions that will be in the dump
> file.
>

with this version of wrapper you cannot load old dumps. It allows
application compatibility. Dump was readable with older variant which
is really ugly and that is on pgfoundry.

Steps with this wrapper:

a) uninstall tsearch2
b) dump
c) install 8.3 and configure fulltext
d) load
e) load wrapper

Pavel Stehule

>                         regards, tom lane
>

Re: tsearch2api .. wrapper for integrated fultext

From
Tom Lane
Date:
"Pavel Stehule" <pavel.stehule@gmail.com> writes:
> On 10/11/2007, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> * The SQL file doesn't create any of the old types (public.tsvector
>> etc) so it seems still a long ways short of ensuring that an old
>> dump file can be reloaded.  Maybe I don't understand exactly how you
>> intend it to interact with the definitions that will be in the dump
>> file.

> with this version of wrapper you cannot load old dumps. It allows
> application compatibility. Dump was readable with older variant which
> is really ugly and that is on pgfoundry.

> Steps with this wrapper:

> a) uninstall tsearch2
> b) dump
> c) install 8.3 and configure fulltext
> d) load
> e) load wrapper

That seems like a non-starter.  Existing tsearch2 installations will
have tsvector columns in their tables, so I don't see how they are
going to "uninstall tsearch2" in the existing database.  The other
problem is that I think we have to provide a migration path for people
who already have dump files (and, maybe, no longer have the original
installation).

For people who have custom or tar-format dumps, the previously posted
script to strip out the old tsearch2 objects during pg_restore would
help ... but it's useless if you used text dump (notably including
pg_dumpall output).

Another problem that was already noted was that the dump might contain
explicit references to "public.tsvector", or some other schema that you
put the tsearch2 objects in.

The approach that I was hoping to see was

a) dump
b) install 8.3 and configure fulltext
c) load wrapper (into same schema as you used for tsearch2 before)
d) load dump file

Since pg_dump doesn't do CREATE OR REPLACE, step (d) would result in a
lot of error messages, but it wouldn't overwrite any of the function
definitions installed by the wrapper.  We could possibly deal with the
schema issue by having the wrapper create public.tsvector as a domain
for pg_catalog.tsvector, etc.

Thoughts?

            regards, tom lane

Re: tsearch2api .. wrapper for integrated fultext

From
"Pavel Stehule"
Date:
On 10/11/2007, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> "Pavel Stehule" <pavel.stehule@gmail.com> writes:
> > On 10/11/2007, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> >> * The SQL file doesn't create any of the old types (public.tsvector
> >> etc) so it seems still a long ways short of ensuring that an old
> >> dump file can be reloaded.  Maybe I don't understand exactly how you
> >> intend it to interact with the definitions that will be in the dump
> >> file.
>
> > with this version of wrapper you cannot load old dumps. It allows
> > application compatibility. Dump was readable with older variant which
> > is really ugly and that is on pgfoundry.
>
> > Steps with this wrapper:
>
> > a) uninstall tsearch2
> > b) dump
> > c) install 8.3 and configure fulltext
> > d) load
> > e) load wrapper
>
> That seems like a non-starter.  Existing tsearch2 installations will
> have tsvector columns in their tables, so I don't see how they are
> going to "uninstall tsearch2" in the existing database.  The other
> problem is that I think we have to provide a migration path for people
> who already have dump files (and, maybe, no longer have the original
> installation).
>
> For people who have custom or tar-format dumps, the previously posted
> script to strip out the old tsearch2 objects during pg_restore would
> help ... but it's useless if you used text dump (notably including
> pg_dumpall output).
>
> Another problem that was already noted was that the dump might contain
> explicit references to "public.tsvector", or some other schema that you
> put the tsearch2 objects in.
>

I forgot. With perl script, dump can be transformed to readable form.

I see forms of port TSearch to integrated full text:

1. clean and prefered  .. dump and application are modified
2. dump is modified, application works via some api
3. full wrapper with known issues
4. special smart mode where TSearch2 API functions are dynamicly
converted to fulltext api

like
   get function name
   if it some from tsearch2 api transform it or forgot it
   else create function

but perl script do it well and outside


> The approach that I was hoping to see was
>
> a) dump
> b) install 8.3 and configure fulltext
> c) load wrapper (into same schema as you used for tsearch2 before)
> d) load dump file

It was my original goal. But there is lot of issues.

I found different problem, that have to be solved if wrapper have to
be dump compatible.

there is about five functions with same name, and I have problem with
wrapping, because I create recursive calling. I am not able select
integrated functions. That was reason for use prefix tsa.

>
> Since pg_dump doesn't do CREATE OR REPLACE, step (d) would result in a
> lot of error messages, but it wouldn't overwrite any of the function
> definitions installed by the wrapper.  We could possibly deal with the
> schema issue by having the wrapper create public.tsvector as a domain
> for pg_catalog.tsvector, etc.
>
> Thoughts?
>
>                         regards, tom lane
>

Re: tsearch2api .. wrapper for integrated fultext

From
Tom Lane
Date:
"Pavel Stehule" <pavel.stehule@gmail.com> writes:
> I found different problem, that have to be solved if wrapper have to
> be dump compatible.

> there is about five functions with same name, and I have problem with
> wrapping, because I create recursive calling. I am not able select
> integrated functions. That was reason for use prefix tsa.

I don't follow.  The functions all have distinct names at the C-code
level, so what's the problem?

            regards, tom lane

Re: tsearch2api .. wrapper for integrated fultext

From
"Pavel Stehule"
Date:
On 10/11/2007, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> "Pavel Stehule" <pavel.stehule@gmail.com> writes:
> > I found different problem, that have to be solved if wrapper have to
> > be dump compatible.
>
> > there is about five functions with same name, and I have problem with
> > wrapping, because I create recursive calling. I am not able select
> > integrated functions. That was reason for use prefix tsa.
>
> I don't follow.  The functions all have distinct names at the C-code
> level, so what's the problem?

they don't have distinct names

tsearch_length
rewrite_query
...

>
>                         regards, tom lane
>

Re: tsearch2api .. wrapper for integrated fultext

From
Tom Lane
Date:
I wrote:
> The approach that I was hoping to see was

> a) dump
> b) install 8.3 and configure fulltext
> c) load wrapper (into same schema as you used for tsearch2 before)
> d) load dump file

I've modified Pavel's version into something that seems to support this
approach --- at least I can load the 8.2 tsearch regression test
database into 8.3 after loading this.  Still needs some polishing
probably, and some more testing.  Comments?

            regards, tom lane


Attachment

Re: tsearch2api .. wrapper for integrated fultext

From
Tom Lane
Date:
I wrote:
> I've modified Pavel's version into something that seems to support this
> approach --- at least I can load the 8.2 tsearch regression test
> database into 8.3 after loading this.  Still needs some polishing
> probably, and some more testing.  Comments?

I've committed this, replacing the old contrib/tsearch2 code.

It successfully runs most of the old module's regression test, after
some minor adjustments for default configuration names and suchlike.
One large omission is that the rewrite(ARRAY[]) aggregate isn't there.
AFAIR, we removed that just because it seemed a poorly designed API,
not because it didn't work.  I'm thinking we should probably pull the
code for it out of the CVS history and stick it into contrib/tsearch2.
Any thoughts pro or con?

            regards, tom lane

Re: tsearch2api .. wrapper for integrated fultext

From
"Pavel Stehule"
Date:
On 13/11/2007, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> I wrote:
> > I've modified Pavel's version into something that seems to support this
> > approach --- at least I can load the 8.2 tsearch regression test
> > database into 8.3 after loading this.  Still needs some polishing
> > probably, and some more testing.  Comments?
>
> I've committed this, replacing the old contrib/tsearch2 code.
>
> It successfully runs most of the old module's regression test, after
> some minor adjustments for default configuration names and suchlike.
> One large omission is that the rewrite(ARRAY[]) aggregate isn't there.
> AFAIR, we removed that just because it seemed a poorly designed API,
> not because it didn't work.  I'm thinking we should probably pull the
> code for it out of the CVS history and stick it into contrib/tsearch2.
> Any thoughts pro or con?
>

+1

Pavel
>                         regards, tom lane
>