Thread: tap tests driving the database via psql

tap tests driving the database via psql

From

Andres Freund

Date:

27 July 2019, 19:15:23

Hi,

The discussion in [1]
again reminded me how much I dislike that we currently issue database
queries in tap tests by forking psql and writing/reading from it's
stdin/stdout.

That's quite cumbersome to write, and adds a good number of additional
failure scenarios to worry about. For a lot of tasks you then have to
reparse psql's output to separate out columns etc.

I think we have a better approach for [1] than using tap tests, but I
think it's a more general issue preventing us from having more extensive
test coverage, and especially from having more robust coverage.

I think we seriously ought to consider depending on a proper perl
database driver.  I can see various approaches:

1) Just depend on DBD::Pg being installed. It's fairly common, after
   all. It'd be somewhat annoying that we'd often end up using a
   different version of libpq than what we're testing against. But in
   most cases that'd not be particularly problematic.

2) Depend on DBD::PgPP, a pure perl driver. It'd likely often be more
   lightweight to install. On the other hand, it's basically
   unmaintained (last commit 2010), and is a lot less commonly already
   installed than DBD::Pg. Also depends on DBI, which isn't part of a
   core perl IIRC.

3) Vendor a test-only copy of one of those libraries, and build them as
   part of the test setup. That'd cut down on the number of
   dependencies.

   But probably not that much, because we'd still depend on DBI, which
   IIRC isn't part of core perl.

   DBI by default does include C code, and is not that small. There's
   however a pure perl version maintained as part of DBI, and it looks
   like it might be reasonably enough sized. If we vendored that, and
   DBD::PgPP, we'd not have any additional dependencies.

   I suspect that the licensing (dual GPL *version 1* / Artistic
   License, also V1), makes this too complicated, however.

4) We develop a fairly minimal pure perl database driver, that doesn't
   depend on DBI. Include it somewhere as part of the test code, instead
   of src/interfaces, so it's clearer that it's not ment as an actual
   official driver.

   The obvious disadvantage is that this would be a noticable amount of
   code. But it's also not that crazily much.

   One big advantage I can see is that that'd make it a lot easier to
   write low-level protocol tests. Right now we either don't have them,
   or they have to go through libpq, which quite sensibly doesn't expose
   all the details to the outside.  IMO it'd be really nice if we had a
   way to to write low level protocol tests, especially for testing
   things like the v2 protocol.

I'm not volunteering to do 4), my perl skills aren't great (if the test
infra were python, otoh... I have the skeleton of a pure perl driver
that I used for testing somewhere). But I am leaning towards that being
the most reasonable choice.

Craig, IIRC you'd thought about this before too?

Greetings,

Andres Freund

[1] https://www.postgresql.org/message-id/31304.1564246011%40sss.pgh.pa.us

Re: tap tests driving the database via psql

From

David Fetter

Date:

27 July 2019, 20:32:37

On Sat, Jul 27, 2019 at 12:15:23PM -0700, Andres Freund wrote:
> Hi,
> 
> The discussion in [1]
> again reminded me how much I dislike that we currently issue database
> queries in tap tests by forking psql and writing/reading from it's
> stdin/stdout.
> 
> That's quite cumbersome to write, and adds a good number of additional
> failure scenarios to worry about. For a lot of tasks you then have to
> reparse psql's output to separate out columns etc.
> 
> I think we have a better approach for [1] than using tap tests, but I
> think it's a more general issue preventing us from having more extensive
> test coverage, and especially from having more robust coverage.
> 
> I think we seriously ought to consider depending on a proper perl
> database driver.  I can see various approaches:
> 
> 1) Just depend on DBD::Pg being installed. It's fairly common, after
>    all. It'd be somewhat annoying that we'd often end up using a
>    different version of libpq than what we're testing against. But in
>    most cases that'd not be particularly problematic.
> 
> 2) Depend on DBD::PgPP, a pure perl driver. It'd likely often be more
>    lightweight to install. On the other hand, it's basically
>    unmaintained (last commit 2010), and is a lot less commonly already
>    installed than DBD::Pg. Also depends on DBI, which isn't part of a
>    core perl IIRC.
> 
> 3) Vendor a test-only copy of one of those libraries, and build them as
>    part of the test setup. That'd cut down on the number of
>    dependencies.
> 
>    But probably not that much, because we'd still depend on DBI, which
>    IIRC isn't part of core perl.
> 
>    DBI by default does include C code, and is not that small. There's
>    however a pure perl version maintained as part of DBI, and it looks
>    like it might be reasonably enough sized. If we vendored that, and
>    DBD::PgPP, we'd not have any additional dependencies.
> 
>    I suspect that the licensing (dual GPL *version 1* / Artistic
>    License, also V1), makes this too complicated, however.
> 
> 4) We develop a fairly minimal pure perl database driver, that doesn't
>    depend on DBI. Include it somewhere as part of the test code, instead
>    of src/interfaces, so it's clearer that it's not ment as an actual
>    official driver.

There's one that may or may not need updates that's basically just a
wrapper around libpq.

https://ftp.postgresql.org/pub/projects/gborg/pgperl/stable/

>    The obvious disadvantage is that this would be a noticable amount of
>    code. But it's also not that crazily much.
> 
>    One big advantage I can see is that that'd make it a lot easier to
>    write low-level protocol tests. Right now we either don't have them,
>    or they have to go through libpq, which quite sensibly doesn't expose
>    all the details to the outside.  IMO it'd be really nice if we had a
>    way to to write low level protocol tests, especially for testing
>    things like the v2 protocol.

That sounds worth doing as a separate thing, and an obvious
application of it would be something like a febesmith, which would get
us a better idea as to whether we've implemented the protocol we say
we have.

Best,
David.
-- 
David Fetter <david(at)fetter(dot)org> http://fetter.org/
Phone: +1 415 235 3778

Remember to vote!
Consider donating to Postgres: http://www.postgresql.org/about/donate

Re: tap tests driving the database via psql

From

Andres Freund

Date:

27 July 2019, 21:14:55

Hi,

On 2019-07-27 22:32:37 +0200, David Fetter wrote:
> On Sat, Jul 27, 2019 at 12:15:23PM -0700, Andres Freund wrote:
> > 4) We develop a fairly minimal pure perl database driver, that doesn't
> >    depend on DBI. Include it somewhere as part of the test code, instead
> >    of src/interfaces, so it's clearer that it's not ment as an actual
> >    official driver.
> 
> There's one that may or may not need updates that's basically just a
> wrapper around libpq.
> 
> https://ftp.postgresql.org/pub/projects/gborg/pgperl/stable/

That's pretty darn old however (2002). Needs to be compiled. And is GPL
v1 / Artistic v1 licensed.  I think all of the other alternatives are
better than this.


> >    The obvious disadvantage is that this would be a noticable amount of
> >    code. But it's also not that crazily much.
> > 
> >    One big advantage I can see is that that'd make it a lot easier to
> >    write low-level protocol tests. Right now we either don't have them,
> >    or they have to go through libpq, which quite sensibly doesn't expose
> >    all the details to the outside.  IMO it'd be really nice if we had a
> >    way to to write low level protocol tests, especially for testing
> >    things like the v2 protocol.
> 
> That sounds worth doing as a separate thing

What would be the point of doing this separately? If we have a small
driver for writing protocol tests, why would we want something
separate for the tap tests?


> and an obvious application of it would be something like a febesmith,
> which would get us a better idea as to whether we've implemented the
> protocol we say we have.

Hm, not convinced that's useful. And fairly sure that's pretty
independent of what I was writing about.

Greetings,

Andres Freund

Re: tap tests driving the database via psql

From

Andrew Dunstan

Date:

27 July 2019, 21:48:39

On 7/27/19 3:15 PM, Andres Freund wrote:
> Hi,
>
> The discussion in [1]
> again reminded me how much I dislike that we currently issue database
> queries in tap tests by forking psql and writing/reading from it's
> stdin/stdout.
>
> That's quite cumbersome to write, and adds a good number of additional
> failure scenarios to worry about. For a lot of tasks you then have to
> reparse psql's output to separate out columns etc.
>
> I think we have a better approach for [1] than using tap tests, but I
> think it's a more general issue preventing us from having more extensive
> test coverage, and especially from having more robust coverage.
>
> I think we seriously ought to consider depending on a proper perl
> database driver.  I can see various approaches:
>
> 1) Just depend on DBD::Pg being installed. It's fairly common, after
>    all. It'd be somewhat annoying that we'd often end up using a
>    different version of libpq than what we're testing against. But in
>    most cases that'd not be particularly problematic.
>
> 2) Depend on DBD::PgPP, a pure perl driver. It'd likely often be more
>    lightweight to install. On the other hand, it's basically
>    unmaintained (last commit 2010), and is a lot less commonly already
>    installed than DBD::Pg. Also depends on DBI, which isn't part of a
>    core perl IIRC.
>
> 3) Vendor a test-only copy of one of those libraries, and build them as
>    part of the test setup. That'd cut down on the number of
>    dependencies.
>
>    But probably not that much, because we'd still depend on DBI, which
>    IIRC isn't part of core perl.
>
>    DBI by default does include C code, and is not that small. There's
>    however a pure perl version maintained as part of DBI, and it looks
>    like it might be reasonably enough sized. If we vendored that, and
>    DBD::PgPP, we'd not have any additional dependencies.
>
>    I suspect that the licensing (dual GPL *version 1* / Artistic
>    License, also V1), makes this too complicated, however.
>
> 4) We develop a fairly minimal pure perl database driver, that doesn't
>    depend on DBI. Include it somewhere as part of the test code, instead
>    of src/interfaces, so it's clearer that it's not ment as an actual
>    official driver.
>
>    The obvious disadvantage is that this would be a noticable amount of
>    code. But it's also not that crazily much.
>
>    One big advantage I can see is that that'd make it a lot easier to
>    write low-level protocol tests. Right now we either don't have them,
>    or they have to go through libpq, which quite sensibly doesn't expose
>    all the details to the outside.  IMO it'd be really nice if we had a
>    way to to write low level protocol tests, especially for testing
>    things like the v2 protocol.
>
> I'm not volunteering to do 4), my perl skills aren't great (if the test
> infra were python, otoh... I have the skeleton of a pure perl driver
> that I used for testing somewhere). But I am leaning towards that being
> the most reasonable choice.
>


+1 for #4.


I'll be happy to participate in any effort.


About 22 years ago I wrote a pure perl implementation of a wire protocol
of roughly similar complexity (RFC1459). I got it basically working in
just a few days, so this sort of thing is very doable. Let's see your
skeleton and maybe it's a good starting point.


cheers


andrew

-- 
Andrew Dunstan                https://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: tap tests driving the database via psql

From

Andres Freund

Date:

27 July 2019, 22:37:58

Hi,

On 2019-07-27 17:48:39 -0400, Andrew Dunstan wrote:
> On 7/27/19 3:15 PM, Andres Freund wrote:
> > I'm not volunteering to do 4), my perl skills aren't great (if the test
> > infra were python, otoh... I have the skeleton of a pure perl driver
> > that I used for testing somewhere). But I am leaning towards that being
> > the most reasonable choice.
> 
> +1 for #4.
> 
> 
> I'll be happy to participate in any effort.
> 
> 
> About 22 years ago I wrote a pure perl implementation of a wire protocol
> of roughly similar complexity (RFC1459). I got it basically working in
> just a few days, so this sort of thing is very doable. Let's see your
> skeleton and maybe it's a good starting point.

The skeleton's in python though, not sure how helpful that is?

/me once more regrets that perl, not python, has been chosen as the
scripting language of choice for postgres...

Greetings,

Andres Freund

Re: tap tests driving the database via psql

From

Andrew Dunstan

Date:

27 July 2019, 22:57:58

On 7/27/19 6:37 PM, Andres Freund wrote:
> Hi,
>
> On 2019-07-27 17:48:39 -0400, Andrew Dunstan wrote:
>> On 7/27/19 3:15 PM, Andres Freund wrote:
>>> I'm not volunteering to do 4), my perl skills aren't great (if the test
>>> infra were python, otoh... I have the skeleton of a pure perl driver
>>> that I used for testing somewhere). But I am leaning towards that being
>>> the most reasonable choice.
>> +1 for #4.
>>
>>
>> I'll be happy to participate in any effort.
>>
>>
>> About 22 years ago I wrote a pure perl implementation of a wire protocol
>> of roughly similar complexity (RFC1459). I got it basically working in
>> just a few days, so this sort of thing is very doable. Let's see your
>> skeleton and maybe it's a good starting point.
> The skeleton's in python though, not sure how helpful that is?


Maybe I don't write much python but I can read it without too much
difficulty :-)


But you did say your skeleton was pure perl ... slip of the fingers?


>
> /me once more regrets that perl, not python, has been chosen as the
> scripting language of choice for postgres...
>


That ship has sailed long ago.


cheers


andrew



-- 
Andrew Dunstan                https://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: tap tests driving the database via psql

From

Andres Freund

Date:

27 July 2019, 23:01:25

Hi,

On 2019-07-27 18:57:58 -0400, Andrew Dunstan wrote:
> Maybe I don't write much python but I can read it without too much
> difficulty :-)
> 
> 
> But you did say your skeleton was pure perl ... slip of the fingers?

Ooops, yea.


> > /me once more regrets that perl, not python, has been chosen as the
> > scripting language of choice for postgres...

> That ship has sailed long ago.

Indeed. Not proposing to change that. Just sad about it...

Greetings,

Andres Freund

Re: tap tests driving the database via psql

From

Craig Ringer

Date:

30 July 2019, 06:13:40

On Sun, 28 Jul 2019 at 03:15, Andres Freund <andres@anarazel.de> wrote:

1) Just depend on DBD::Pg being installed. It's fairly common, after
all. It'd be somewhat annoying that we'd often end up using a
different version of libpq than what we're testing against. But in
most cases that'd not be particularly problematic.

I advocated for this in the past, and still think it's the best option.

4) We develop a fairly minimal pure perl database driver, that doesn't
depend on DBI. Include it somewhere as part of the test code, instead
of src/interfaces, so it's clearer that it's not ment as an actual
official driver.

Why not write a new language interpreter while we're at it, and maybe a compiler and runtime? :p

The community IMO wastes *so* much time on not-invented-here make-work and on jumping through hoops to avoid depending on anything newer than the late '90s. I'd rather not go deeper down that path. If someone on SunOS or SCO OpenServer or whatever doesn't want to install DBD::Pg, have the TAP tests just skip the tests on that platform and make it the platform owner's problem.

Craig, IIRC you'd thought about this before too?

Right. But IIRC Tom vetoed it on grounds of not wanting to expect buildfarm operators to install it, something like that.

Craig Ringer http://www.2ndQuadrant.com/
2ndQuadrant - PostgreSQL Solutions for the Enterprise

Re: tap tests driving the database via psql

From

Tom Lane

Date:

30 July 2019, 13:40:54

Craig Ringer <craig@2ndquadrant.com> writes:
> On Sun, 28 Jul 2019 at 03:15, Andres Freund <andres@anarazel.de> wrote:
>> 1) Just depend on DBD::Pg being installed. It's fairly common, after
>> all. It'd be somewhat annoying that we'd often end up using a
>> different version of libpq than what we're testing against. But in
>> most cases that'd not be particularly problematic.

> I advocated for this in the past, and still think it's the best option.

I think the not-same-libpq issue would be a much bigger problem than either
of you are accounting for.  Some off-the-top-of-the-head reasons:

* Since we'd have to presume a possibly-very-back-rev libpq, we could
never add tests related to any recent libpq bug fixes or new features.

* The system libpq might have a different idea of default port
and/or socket directory than the test build.  Yeah, we could work
around that by forcing the choice all the time, but it would be a
constant hazard.

* We don't have control over what else gets brought in beside libpq.
Depending on the whims of the platform packagers, there might need
to be other parts of the platform's default postgres installation,
eg psql, sitting in one's PATH.  Again, this wouldn't be too much
of a hazard for pre-debugged test scripts --- but it would be a huge
hazard for developers, who do lots of things manually and would always be
at risk of invoking the wrong psql.  I learned long ago not to have any
part of a platform's postgres packages in place on my development systems,
and I will fight hard against any test procedure change that requires me
to do differently.

Now, none of these things are really a problem with DBD/DBI as such
--- rather, they are reasons not to depend on a pre-packaged build
of DBD::Pg that depends on a pre-packaged build of libpq.so.
I haven't looked at the size, or the license, of DBD::Pg ... but
could it be sane to include our own modified copy in the tree?

(Forking DBD::Pg would also let us add custom testing features
to it ...)

> The community IMO wastes *so* much time on not-invented-here make-work and
> on jumping through hoops to avoid depending on anything newer than the late
> '90s.

That's an unnecessary, and false, ad-hominem attack.

            regards, tom lane

Re: tap tests driving the database via psql

From

Andres Freund

Date:

30 July 2019, 14:42:49

Hi,

On 2019-07-30 09:40:54 -0400, Tom Lane wrote:
> Craig Ringer <craig@2ndquadrant.com> writes:
> > On Sun, 28 Jul 2019 at 03:15, Andres Freund <andres@anarazel.de> wrote:
> >> 1) Just depend on DBD::Pg being installed. It's fairly common, after
> >> all. It'd be somewhat annoying that we'd often end up using a
> >> different version of libpq than what we're testing against. But in
> >> most cases that'd not be particularly problematic.
> 
> > I advocated for this in the past, and still think it's the best option.
> 
> I think the not-same-libpq issue would be a much bigger problem than either
> of you are accounting for.  Some off-the-top-of-the-head reasons:

I came to the same conclusion?


> Now, none of these things are really a problem with DBD/DBI as such
> --- rather, they are reasons not to depend on a pre-packaged build
> of DBD::Pg that depends on a pre-packaged build of libpq.so.
> I haven't looked at the size, or the license, of DBD::Pg ... but
> could it be sane to include our own modified copy in the tree?

I had that as an alternative too. I think the license (Artistic v1/GPL
v1) probably makes that non-feasible. The pure-perl version of DBI
probably would otherwise be realistic.

Greetings,

Andres Freund

Re: tap tests driving the database via psql

From

Tom Lane

Date:

30 July 2019, 14:48:31

Andres Freund <andres@anarazel.de> writes:
> On 2019-07-30 09:40:54 -0400, Tom Lane wrote:
>> Now, none of these things are really a problem with DBD/DBI as such
>> --- rather, they are reasons not to depend on a pre-packaged build
>> of DBD::Pg that depends on a pre-packaged build of libpq.so.
>> I haven't looked at the size, or the license, of DBD::Pg ... but
>> could it be sane to include our own modified copy in the tree?

> I had that as an alternative too. I think the license (Artistic v1/GPL
> v1) probably makes that non-feasible. The pure-perl version of DBI
> probably would otherwise be realistic.

OK, so just lifting DBD::Pg in toto is out for license reasons.
However, maybe we could consider writing a new DBD driver from
scratch (while using a platform-provided DBI layer) rather than
doing everything from scratch.  I'm not sure how much actual
functionality is in the DBI layer, so maybe that approach
wouldn't buy much.

            regards, tom lane

Re: tap tests driving the database via psql

From

Alvaro Herrera

Date:

30 July 2019, 18:39:30

On 2019-Jul-30, Tom Lane wrote:

> OK, so just lifting DBD::Pg in toto is out for license reasons.
> However, maybe we could consider writing a new DBD driver from
> scratch (while using a platform-provided DBI layer) rather than
> doing everything from scratch.  I'm not sure how much actual
> functionality is in the DBI layer, so maybe that approach
> wouldn't buy much.

Then again, maybe we don't *need* all the functionality that DBI offers.
DBI is enormous, has a lot of extensibility, cross-database
compatibility ... and, well, just the fact that it's a layered design
(requiring a DBD on top of it before it even works) makes it even more
complicated.

I think a pure-perl standalone driver might be a lot simpler than
maintanining our own DBD ... and we don't have to convince animal
maintainers to install the right version of DBI in the first place.

-- 
Álvaro Herrera                https://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: tap tests driving the database via psql

From

Craig Ringer

Date:

31 July 2019, 01:32:10

On Tue, 30 Jul 2019 at 21:40, Tom Lane <tgl@sss.pgh.pa.us> wrote:

Craig Ringer <craig@2ndquadrant.com> writes:
> On Sun, 28 Jul 2019 at 03:15, Andres Freund <andres@anarazel.de> wrote:
>> 1) Just depend on DBD::Pg being installed. It's fairly common, after
>> all. It'd be somewhat annoying that we'd often end up using a
>> different version of libpq than what we're testing against. But in
>> most cases that'd not be particularly problematic.

> I advocated for this in the past, and still think it's the best option.

I think the not-same-libpq issue would be a much bigger problem than either
of you are accounting for.

OK. So rather than building our own $everything from scratch, lets look at solving that. I'd argue that the PostgreSQL community has enough work to do maintaining the drivers that are fairly directly community-owned like PgJDBC and psqlODBC, let alone writing new ones for less-in-demand languages just for test use.

Perl modules support local installations. Would it be reasonable to require DBI to be present, then rebuild DBD::Pg against our libpq during our own test infra compilation, and run it with a suitable LD_LIBRARY_PATH or using rpath? That'd actually get us a side-benefit of some test coverage of DBD::Pg and libpq.

Note that I'm not saying this is the only or right way. Your concerns about using a system libpq are entirely valid, as are your concerns about trying to link to our libpq using a DBD::Pg originally built against the system libpq. I've just been dealing with issues similar to those today and I know how much of a hassle it can be. However, I don't think "it's hard" is a good reason to write a whole new driver and potentially do a Netscape/Mozilla.

* Since we'd have to presume a possibly-very-back-rev libpq, we could
never add tests related to any recent libpq bug fixes or new features.

We can feature-test. We're not dealing with pg_regress where it's essentially impossible to do anything conditionally without blocks of plpgsql. We're dealing with Perl and Test::More where we can make tests conditional, we can probe the runtime environment to decide whether we can run a given test, etc.

Also, lets compare to the status quo. Will it be worse than exec()ing psql with a string argument and trying to do sane database access via stdio? Definitely. Not.

* The system libpq might have a different idea of default port
and/or socket directory than the test build. Yeah, we could work
around that by forcing the choice all the time, but it would be a
constant hazard.

I'd call that a minor irritation personally, as all we have to do is set up the environment and we're done.

* We don't have control over what else gets brought in beside libpq.

That's a very significant point that we must pay attention to.

Anyone who's worked with nss will probably know the pain of surprise library conflicts arising from nss plugins being loaded unexpectedly into the program. I still twitch thinking about libnss-ldap.

It'd make a lot of sense to capture "ldd" output and/or do a minimal run with LD_DEBUG set during buildfarm test runs to help identify any interesting surprises.

Depending on the whims of the platform packagers, there might need
to be other parts of the platform's default postgres installation,
eg psql, sitting in one's PATH. Again, this wouldn't be too much
of a hazard for pre-debugged test scripts --- but it would be a huge
hazard for developers, who do lots of things manually and would always be
at risk of invoking the wrong psql. I learned long ago not to have any
part of a platform's postgres packages in place on my development systems,
and I will fight hard against any test procedure change that requires me
to do differently.

TBH I think that's a habit/procedure issue. That doesn't make it invalid, but it might not be as big a concern for everyone as for you. I have a shell alias that sets up my environment and another that prints the current setup. I never run into issues like this, despite often multi-tasking while tired. Not only that, I find platform postgres extremely useful to have on my systems to use for simple cross-version tests.

If we adopt something like I suggested above where we (re)build DBD::Pg against our installed pg and libpq, that wouldn't be much of an issue anyway. We'd want a nice way to set up the runtime environment in a shell for manual testing of course, like a generated .sh we can source. But frankly I'd find that a useful thing to have in postgres anyway. I can't count the number of times I've wished there was an easy way to pause pg_regress and launch a psql session against its temp instance, or do the same with a TAP test.

Now, none of these things are really a problem with DBD/DBI as such
--- rather, they are reasons not to depend on a pre-packaged build
of DBD::Pg that depends on a pre-packaged build of libpq.so.

Yeah. While I don't agree with your conclusion in terms of how you weigh the priorities, I agree that all your points are valid and make sense. I'd personally go ahead anyway, because the status quo is pretty bad and an incremental improvement would still be significant. But I understand your reluctance a bit better.

I haven't looked at the size, or the license, of DBD::Pg ... but
could it be sane to include our own modified copy in the tree?

Or ... clone it on demand? I won't suggest git submodules, because they're awkward bodgy hacks, but we can reasonably enough just fetch the sources when we need them so long as they aren't a hard dependency on building postgres. Which they wouldn't need to be; at worst we'd have to skip the TAP tests if no DBD::Pg was found.

We could also permit falling back to system DBD::Pg with a suitable visible message in logs to ensure nobody gets confused.

I'd very strongly prefer not to have a copy in-tree polluting the postgres history and generally creating a mess. We don't even have "blessed" drivers that live on community infra like psqlODBC or PgJDBC in-tree.

(Forking DBD::Pg would also let us add custom testing features
to it ...)

Lets not. Unless utterly unavoidable. There's enough to do already.

> The community IMO wastes *so* much time on not-invented-here make-work and
> on jumping through hoops to avoid depending on anything newer than the late
> '90s.

That's an unnecessary, and false, ad-hominem attack.

It's none of the above. It's an honest opinion, not an attack, and it's not directed at you or any one other person.

Craig Ringer http://www.2ndQuadrant.com/
2ndQuadrant - PostgreSQL Solutions for the Enterprise

Re: tap tests driving the database via psql

From

Andres Freund

Date:

31 July 2019, 02:20:53

Hi,

On 2019-07-31 09:32:10 +0800, Craig Ringer wrote:
> OK. So rather than building our own $everything from scratch, lets look at
> solving that.

IDK, a minimal driver that just does what we need it to do is a few
hundred lines, not more. And there's plenty of stuff that we simply
won't be able to test with any driver that's not purposefully written
for testing. There's e.g. basically no way with any of the drivers to
test intentionally bogus sequences of protocol messages, yet that's
something we really ought to test.

I've written custom hacks^Wprograms to tests things like that a few
times. That really shouldn't be necessary.

> Perl modules support local installations. Would it be reasonable to require
> DBI to be present, then rebuild DBD::Pg against our libpq during our own
> test infra compilation, and run it with a suitable LD_LIBRARY_PATH or using
> rpath? That'd actually get us a side-benefit of some test coverage of
> DBD::Pg and libpq.

You're intending to download DBD::Pg? Or how would you get around the
licensing issues? Downloading it will have some issues too: For one at least I
often want to be able to run tests when offline; there's security
concerns too.

Greetings,

Andres Freund

Re: tap tests driving the database via psql

From

Michael Paquier

Date:

31 July 2019, 04:20:01

On Tue, Jul 30, 2019 at 07:20:53PM -0700, Andres Freund wrote:
> IDK, a minimal driver that just does what we need it to do is a few
> hundred lines, not more. And there's plenty of stuff that we simply
> won't be able to test with any driver that's not purposefully written
> for testing. There's e.g. basically no way with any of the drivers to
> test intentionally bogus sequences of protocol messages, yet that's
> something we really ought to test.

Yes, I don't know if a hundred of lines measure that correctly, but
really we should avoid bloating the stuff we rely on like by fetching
an independent driver if we finish by not using most of its features.
300~500 lines would be fine for me, 10k definitely not.

> I've written custom hacks^Wprograms to tests things like that a few
> times. That really shouldn't be necessary.

Mine are usually plain hacks :), and usually on top of libpq but I
have done some python-ish script to send bogus and hardcoded protocol
messages.  If we can automate a bit this area, that could be useful.

>> Perl modules support local installations. Would it be reasonable to require
>> DBI to be present, then rebuild DBD::Pg against our libpq during our own
>> test infra compilation, and run it with a suitable LD_LIBRARY_PATH or using
>> rpath? That'd actually get us a side-benefit of some test coverage of
>> DBD::Pg and libpq.
>
> You're intending to download DBD::Pg? Or how would you get around the
> licensing issues? Downloading it will have some issues too: For one at least I
> often want to be able to run tests when offline; there's security
> concerns too.

As Craig has mentioned CPAN can be configured to install all the
libraries for a local user, so there is no need to be online if a copy
has been already downloaded.
--
Michael

Attachment

signature.asc

Re: tap tests driving the database via psql

From

Craig Ringer

Date:

31 July 2019, 05:08:12

On Wed, 31 Jul 2019 at 10:20, Andres Freund <andres@anarazel.de> wrote:

Hi,

On 2019-07-31 09:32:10 +0800, Craig Ringer wrote:
> OK. So rather than building our own $everything from scratch, lets look at
> solving that.

IDK, a minimal driver that just does what we need it to do is a few
hundred lines, not more. And there's plenty of stuff that we simply
won't be able to test with any driver that's not purposefully written
for testing. There's e.g. basically no way with any of the drivers to
test intentionally bogus sequences of protocol messages, yet that's
something we really ought to test.

I've written custom hacks^Wprograms to tests things like that a few
times. That really shouldn't be necessary.

That's a good point. I've had to write simple protocol hacks myself, and having a reusable base tool for it would indeed be valuable.

I'm just worried it'll snowball into Yet Another Driver We Don't Need, or land up as a half-finished poorly-maintained burden nobody wants to touch. Though the general fondness for and familiarity with Perl in the core circle of Pg contributors and committers would probably help here, since it'd inevitably land up being written in Perl...

I'm interested in evolution of the protocol and ways to enhance it, and I can see something we can actively use for protocol testing being valuable. If done right, the protocol implementation could probably pass-through inspect messages from regular libpq etc as well as serving as either a client or even as a dumb server simulator for pregenerated responses for client testing.

We certainly couldn't do anything like that with existing tools and by reusing existing drivers.

I wonder if there's any way to make writing and maintaining it less painful though. Having to make everything work with Perl 5.8.8 and no non-core modules leads to a whole pile of bespoke code and reinvention.

> Perl modules support local installations. Would it be reasonable to require
> DBI to be present, then rebuild DBD::Pg against our libpq during our own
> test infra compilation, and run it with a suitable LD_LIBRARY_PATH or using
> rpath? That'd actually get us a side-benefit of some test coverage of
> DBD::Pg and libpq.

You're intending to download DBD::Pg?

That'd be a reasonable option IMO, yes. Either via git with https, or a tarball where we check a signature. So long as it can use any pre-existing local copy without needing to hit the Internet, and the build can also succeed without the presence of it at all, I think that'd be reasonable enough.

But I'm probably getting contaminated by excessive exposure to Apache Maven when I have to work with Java. I'm rapidly getting used to downloading things being a part of builds. Personally I'd expect that unless specifically testing new libpq functionality etc, most people would just be using their system DBD::Pg... and I consider linking the system DBD::Pg against our new-compiled libpq more feature than bug, as if we retain the same soname we should be doing a reasonable job of not breaking upward compatibility anyway. (It'd potentially be an issue if you have a very new system libpq and are running tests on an old postgres, I guess).

Or how would you get around the licensing issues? Downloading it will have some issues too: For one at least I
often want to be able to run tests when offline; there's security
concerns too.

Roughly what I was thinking of was:

For Pg source builds (git, or dist tarball), we'd generally curl a tarball of DBD::Pg from a HTTPs URL specified in Makefile variables (with =? so it can be overridden to point to an alt location, different version, local path, etc) and unpack it, then build it. The makefile can also store a signing key fingerprint so we can download the sig file and check the sig is by the expected signer if the user has imported the signing key for DBD::Pg.

We could test if the target directory already exists and is populated and re-use it, so people can git-clone DBD::Pg if they prefer.

And we'd allow users to specify --with-system-dbd-pg at configure time, or --without-dbd-pg .

The Pg perl libraries for our TAP test harness/wrappers would offer a simple function to skip a test if DBD::Pg is missing, a simple function to skip a test if the loaded DBD::Pg lacks $some_feature_or_attribute, etc.

Greetings,

Andres Freund

Craig Ringer http://www.2ndQuadrant.com/
2ndQuadrant - PostgreSQL Solutions for the Enterprise