Thread: a comparison of 4 databases

a comparison of 4 databases

From

Mark Kirkwood

Date:

16 November 1999, 18:41:20

A Comparison Of 4 Databases
---------------------------



Intro
-----

This paper shows the results of an evaluation of 4 databases. I am posting it
to this mail group as I think Postgresql emerged quite favourably.

The evaluated databases were Oracle, Informix, Mysql and Postgresql.

Features and performance were examined.



Hardware And Operating System
-----------------------------

2 x HP Vertra VE 7 each with 160M RAM + 1x3.2G + 1x13G Quantum IDE Drives were used.
Redhat 6.0 was used as the operating system. No kernel changes were made.



Initial Experiences
-------------------

Mysql was obtained in rpm format and was amazingly easy to install. The
installation process created and started a database. The version was 3.22.27

Documentation was supplied and was good.


Postgresql was similarly elementary to install, and again a database was
created and started. The product comes with the flavour of Linux used and
was in rpm format. The version was 6.5.2

Documentation was supplied and was very good.


Informix was more cryptic to install. It was obtained in rpm format and
installed. However this merely installed an archive and the real installation
process had to be run thereafter. After this it had to be divined as to what
was required next  - the install does not create a database.
Using some of the (not supplied) documentation it was discovered how to create
and configure a database. The version was 7.30 ( This is old, but all they
are supplying on this platform - 9.x is current)

Documentation was not supplied, it was available on the Informix web site. It is ok.


Oracle was difficult to judge as the author frequently installs it. However
pretending to be coming to it new, it would be very difficult to install.
It does not come in rpm format. It is downloadable from the Oracle web site.
The small amount of included documentation is sufficient to enable someone
to work out how to start the installer program. This program is a rudimentary
wizard that asks questions and presents a list of components to choose....
a newcomer would suffer confusion here. The installer can create a database as
part of the install. The version was 8.0.5 (this is slightly old - 8.1.5 is
Current but buggy, 8.0.5 is the latest stable release on this platform).

Documentation is not supplied, it is available from the Oracle web site. It is
ok.



Tests And results
-----------------

Database Feature Comparison

Database   Cost       Trans   Row   Const   Program  Sec  Fail  Hot
                   actions Lock  raints  mable    ure  Safe  back

Mysql      0 /3000 No      No    No      Partial  Yes  No    No
Postgresql 0       Yes     Yes   Partial Yes      Yes  Yes   No
Oracle     3200    Yes     Yes   Yes     Yes      Yes  Yes   Yes
Informix   2000    Yes     No    Yes     Yes      Yes  No    No


Cost

    NZ$ for 10 user license. Two prices mean that the product is charged
    for if resold as part of an application ( Mysql )
    Support is not included

Transactions

    Commit, rollback, isolation levels of at least read commited

Row Locking

    select for update that locks only the rows selected and does not
    block reads

Constraints

    primary and foreign key, with ability to enable/ disable or drop / add
    existence will give ""Partial"" and enable etc will give "Yes"

Programmable

    create trigger, procedural language extensions to SQL
    A "Partial" means that external functions only are supported

Secure

    Requires username and password validation

Fail Safe

    delete of 100 out of 100000 rows, commit ( if have it ) then power
    off and    on, check after power on - 999900 rows gets a "Yes"

Hot Backup

    Can a consistent backup of the database be run backup while it is up
    with transactions performed against it.



Database Performance Comparison - Data Warehouse

Database    Bulk Load  Summarize 3%  Summarize 0.3%  Summarize 0.3%
            1M         of 3M         of 3M           of 3M
                       1 dim 1 fact  2 dim 1 fact    3 dim 1 fact

Mysql       20         14            90              57
Postgresql  420        16            4               7
Oracle      65         5             3               3
Informix    170        8             5               6

Notes



Bulk Load

    elapsed seconds to load 1000000 rows via whatever bulk load tool is
    supplied.

Summarize 3%
1 dim + 1 fact

    Measure elapsed seconds to display summarized count and one timelike
    dimension attribute for basic star query
    Access path can be via an index scan

Summarize 0.3%
2 dim + 1 fact

    Measure elapsed seconds to display summarized count and one non timelike
    dimesion attribute for less basic star query
    Access path can be via  index scans

Summarize 0.3%
3 dim + 1 fact

    Measure elapsed seconds to display a summarized count and 1 non timelike
    dimension attribute for more complex star query
    Access path can be via  index scans



Database Performance Comparison - Online

Database    Create  Create  Query 1  Query 4  Query Summary
            10K     1M      of 10K   of 1M    10% of 1M

Mysql       7       891     0        0        1
Postgresql  21      4430    0        0        2
Oracle      31      3694    0        0        2
Informix                    0        0        10


Database    Tpm     Tpm     Tpm      Tpm
            1 sess. 2 sess. 4 sess.  8 sess.

Mysql       59/58   59/115  59/229   58/425
Postgresql  48      90      169      233
Oracle      55      108     180      291
Informix



Notes

    Unfortunately no Informix SDK was available, so there is no result for
    the first two or last tests.

Create 10K

    elapsed seconds for Php app to insert 10000 rows one by one

Create 1M

    elapsed seconds for Php app to insert 1000000 rows one by one


Query 1

    0 seconds means that no delay was measurable
    Measure elapsed seconds to show master record 1 row located 70% though
    the table. Access path can be via an index scan

Query 4

    Measure elapsed seconds to display 4 detail rows for one of the master
    records in previous test. Access path can be via  index scans

Query Summary

    Measure elapsed seconds to display a summarized sum and count
    for 5% of 1000000 rows. This is a 2 table join . Access path can be via
    index scans

Tpm n sessions

    Transactions per minute for n sessions, achievable for lightweight Perl
    DBI application. Two results for Mysql refer to two possible locking
    schemes


Analysis
--------

Features

Oracle clearly has the most features. However it is Postgresql that is next in
line.

The features fail safe, transactions and security are seen as essential,
with row locking as highly desirable.

This means that  at face value only Oracle and Postgresql satisfy the criteria.

Initially Postgresql did not force password entry and thus was not secure,
however it was discovered that access to the database is configured at install
time as trusted and has to be changed to require passwords.

It should be noted that Oracle needed a patch to be fail safe, and that this
was not advertised on the web site, thus is possible that Informix and Mysql
can also be similarly patched.

Repeated tests with Informix page level locks showed that readers were blocked at this point. This was viewed as
extremelyundesirable. 

Mysql's lack of transactions was a killer for it, the complexity of having to
programmatically undo an n-table operation was viewed as prohibitive.
( It was noted that the Mysql to do list had atomic operations - pseudo
transactions, on it .)

In conclusion on the features front, Oracle and Postgresql emerge as the leaders.



Performance

1 Online operations

Initially Postgreql refused to use indexes to help with queries. A mail to one of the Postgresql Development Team
tracedthis to an unusual pickiness with respect to the implied datatypes of constants in the where clause.    


Informix had problems with the summary query, it would not go faster that 10s, however it seemed to perform ok for the
10000and 1000000 query searches. 

Mysql demonstrates the overhead associated with multiple versioning and
transactions. It was obviously the fastest raw speed terms. It was in
general 4-5 times faster than anything else.

Postgresql was in general about 20% slower than Oracle.

The multiple session transaction test is most interesting. The idea was to
mimic a Tpc A style banking application with some user think time during the
transaction (1 second ).
Clearly Mysql with its table level locks could not scale at all. However it has
application locks that are nameable, so can mimic row locks - as long as every
access uses them. If these were turned on then a dramatic turnaround was
observed.

Thus Mysql is fastest, and most scalable if applications locks can be used.
Oracle and Postgreql are similar in scalability, but Postgresql begins to tail off a little before Oracle does.


2 Data warehousing operations

Mysql has a very fast bulk load facility.
Oracle also has a very fast bulk load.
Infomix has a reasonably fast one.
Postgresql is noticeably slower than any of the others.

Mysql performed extremely poorly at the star type queries.
Oracle and Informix performed very well.
Postgresql performed very well in all but the first query - the simplest (ironically), but it scanned the most data.
Thispoints to Oracle and Informix having faster Nested Loop or Sort algorithms. 

Both Oracle and Postgresql would experience a period of poor performance on a table immediately after it was bulk
loaded- typically until it was vacuumed or analyzed.  

In conclusion for this section Oracle, Informix are very good.
Postgresql is good but is hampered by slow bulk load and sorting of large datasets.
Mysql can get the data in fast but is poor at querying it.


Overall
-------

Informix performs mostly ok, but its locking scheme precludes it for serious on line use (it would have been
interestingto test this). 
Oracle performs well.
Mysql's lack of transactions would have been ok for data warehousing, but it could not do star queries. This rules it
out.
Postgresql performed about 20% slower than Oracle, except for bulk loads and large dataset sorts. These things mean
thatit is suitable for data warehousing and on line operations, with the proviso that large data loads and some large
summaryqueries may be unpleasantly slow. 



Appendix

( database schemas and source for programs )

Schema and queries for on line tests

/* -------------------------------------------------------------------
   file     : createupdattables.sql
   does     : create tables for updat example...
   notes    : master has 10000 rows
              detail has 1000000 rows 100 for each master row
   ------------------------------------------------------------------*/

create table master
  (
   mas_id                         int4,
   mas_name                       varchar(100),
   mas_amt                        float8
  );

create table detail
  (
   det_id                          int4,
   det_mas_id                      int4,
   det_date                        datetime,
   det_amt                         float8
  );

create table masterlog
  (
    log_mas_id                   int4,
    log_date                     datetime);



/* -----------------------------------------------------------------
   file     : createupdatindexes.sql
   does     : create indexes and keys for tables...
   ----------------------------------------------------------------- */

create unique index mas_pk on master using btree (mas_id);

create index det_pk on detail using btree (det_id);
create index det_fk on detail using btree (det_mas_id);



/* -----------------------------------------------------------------
   file     : queries1-3.sql
   does     : the queries...
   ----------------------------------------------------------------- */



select *
from master
where mas_id = 7000;


select *
from detail
where det_mas_id = 7000
and   det_amt    > 960;



select sum(mas_amt),count(*)
from detail,master
where mas_id > 8000
and   mas_id < 9000
and mas_id = det_mas_id;



#!/usr/bin/perl
# -----------------------------------------------------------------
# file     : updat.pm
# does     : on line transactions test...
#
# notes    : postgres version
# -----------------------------------------------------------------

sub  dbconnect()
{
  # connect with no autocommit...
  #
  $dbh = DBI->connect('DBI:Pg:dbname=dev1',$user,$pwd,{AutoCommit =>
0})
              or die "Couldn't connect to database: " . DBI->errstr;
}


sub  dbdisconnect()
{
  #  disconnect...
  #
  $dbh->disconnect;
}


sub  dbparse()
{
   #  parse all statements here ( actually some databases do nothing
here...
   #

   #  master select and update...
   #
   $sthmsel = $dbh->prepare("select * from master where mas_id = ? for
update")
                or die "Couldn't prepare statement: " . $dbh->errstr;

   $sthmupd = $dbh->prepare("update master set mas_amt = mas_amt + ?
where mas_id = ?")
                or die "Couldn't prepare statement: " . $dbh->errstr;

   #  detail select and update...
   #
   $sthdsel = $dbh->prepare("select * from detail where det_mas_id = ?
and det_date > '1999-12-31' for update")
                or die "Couldn't prepare statement: " . $dbh->errstr;

   $sthdupd = $dbh->prepare("update detail set det_amt = det_amt + ?
where det_mas_id = ? and det_id =?")
                or die "Couldn't prepare statement: " . $dbh->errstr;

   #  log insert...
   #
   $sthlins = $dbh->prepare("insert into masterlog values(?,datetime
'now')")
                or die "Couldn't prepare statement: " . $dbh->errstr;
}


sub  dbfree()
{
   # free master and detail and log statements...
   #
   $sthmsel->finish;
   $sthmupd->finish;
   $sthdsel->finish;
   $sthdupd->finish;
   $sthlins->finish;
}


sub  putlog()
{
   # execute the log insert...
   #
   $sthlins->execute($masid);


}


sub  getdetail()
{
   #  execute and fetch detail query...
   #
   $sthdsel->execute($masid);
   my @data = $sthdsel->fetchrow_array();
   my $detid = $data[0];

   #  execute detail update...
   #
   $sthdupd->execute($amt,$masid,$detid);

}


sub  getmaster()
{

   #  execute and fetch master query...
   #
   $sthmsel->execute($masid);

   my @data = $sthmsel->fetchrow_array();

   #  user typing allowamce
   #
   if ( $sleep == 0 )
     {
     sleep(1);
     }


   # get and update the detail...
   #
   getdetail;


   #  execute master update...
   #
   $sthmupd->execute($amt,$masid);


   # log the transaction...
   #
   putlog;

   # commit...
   #
   my $rc = $dbh->commit
            or die $dbh->errstr;
}


sub  init()
{
  use DBI;

  #  two variables are "on" if set to zero
  #
  $debug    = 1;
  $sleep    = 0;

  $updatcnt = 0;
  $updatlim = 100;

  $amt      = 10;
  $maslim   = 10000;
  $masid    = 0;

  $begin    = 0;
  $end      = 0;
  $elapsed  = 0;

  $user     = "updat";
  $pwd      = "updat";
  $db       = "dev1";

  printf("  beginning $updatlim transactions...\n");
  $begin = time;

  #  connect ...
  #
  dbconnect;

  #  parse all statements...
  #
  dbparse;

  #  loop as required...
  #

  while ( $updatcnt < $updatlim )
  {
    #  get a random master id...
    #
    $masid     =  int ( rand $maslim ) + 1;
    if ( $debug == 0 )
      {
      printf("  processing master $masid \n");
      }
    getmaster;
    $updatcnt = $updatcnt + 1;

  }

  #  calculate stats...
  #
  $end = time;
  $elapsed = $end - $begin;
  printf("  tranasctions = $updatlim\n");
  printf("  elapsed      = $elapsed\n");

  #  free all statement handles...
  #
  dbfree;

  # disconnect...
  #
  dbdisconnect;
}

init;




Schema and queries for warehouse tests

/* -----------------------------------------------------------------
   file     : createwaretables.sql
   does     : create tables for ware example...
   notes    : dimension tables (dim0-2) have 900, 10000 and 1000
              rows resp.
              fact table (fact0) has 3000000 rows.
   ---------------------------------------------------------------- */

create table dim0
  (
   d0key                         int4,
   d0f1                          datetime,
   d0f2                          int4,
   d0f3                          int4,
   d0f4                          int4
  );

create table dim1
  (
   d1key                         int4,
   d1f1                          varchar(20),
   d1f2                          varchar(20)
  );

create table dim2
  (
   d2key                         int4,
   d2f1                          varchar(20),
   d2f2                          varchar(20)
  );

create table fact0
  (
  d0key                          int4,
  d1key                          int4,
  d2key                          int4,
  f1                             int4,
  f2                             int4
  );



/* -----------------------------------------------------------------
   file     : createwareindexes.sql
   does     : create indexes for ware example...
   ----------------------------------------------------------------- */

create unique index dim0_pk on dim0 using btree (d0key);
create unique index dim1_pk on dim1 using btree (d1key);
create unique index dim2_pk on dim2 using btree (d2key);
create index fact0_q1 on fact0 using btree (d0key);
create index fact0_q2 on fact0 using btree (d0key,d1key);
create index fact0_q3 on fact0 using btree (d0key,d1key,d2key);


/* -----------------------------------------------------------------
   file     : queries1-3.sql
   does     : star queries for warehouse...
   notes    : query 1 scans 100000 rows from fact0 and 30 rows
              from dim0

              query 2 scans 10000 rows from fact0 and
              30 rows from dim0 and 10 rows from dim2

              query 3 scans 10000 rows from fact0 and 30 rows
              from dim0 and 10 rows from dim2 and 100 rows from dim1
   ----------------------------------------------------------------- */


select
       d0.d0f1,
       count(f.f1)
from dim0 d0,
     fact0 f
where d0.d0key = f.d0key
and   d0.d0f1 between '1996-05-01' and '1996-05-31'
group by d0.d0f1
;


select
       d0.d0f1,
       d2.d2f2,
       count(f.f1)
from dim0 d0,
     dim2 d2,
     fact0 f
where d0.d0key = f.d0key
and   d2.d2key = f.d2key
and   d2.d2f2  = '90th att type'
and   d0.d0f1  between  '1996-05-01' and '1996-05-31'
group by d0.d0f1,d2.d2f2
;


select
       d0.d0f1,
       d1.d1f2,
       d2.d2f2,
       count(f.f1)
from dim0 d0,
     dim1 d1,
     dim2 d2,
     fact0 f
where d0.d0key = f.d0key
and   d1.d1key = f.d1key
and   d2.d2key = f.d2key
and   d2.d2f2 between '80th att type' and '90th att type'
and   d1.d1f2 between '10th att type' and '50th att type'
and   d0.d0f1 between '1996-05-01' and '1996-05-31'
group by d0.d0f1,d1.d1f2,d2.d2f2
;


Scripts to populate tables

Data Warehouse

#!/usr/bin/perl
#  --------------------------------------------------------------------------
#  file     : popdim0.pm
#  does     : populate the dim0 timelike dimension...
#  --------------------------------------------------------------------------


sub  adddim()
{
  #  parse and execute the insert...
  #
  $sthins = $dbh->prepare("insert into dim0 values (?,('1999-05-01'::datetime +
?)::datetime,date_part('DAY',('1999-05-01'::datetime+ ?)::datetime),date_part('MONTH',('1999-05-01'::datetime +
?)::datetime))") 
            or die "  could not prepare \n";

  $sthins->execute($dimcnt,$dimcnt,$dimcnt,$dimcnt)
           or die "  failed to execute for $dimcnt\n";
}


sub  attachdb()
{
  #  connect to the database...
  #
  $dbh = DBI->connect($datasource, $user, $pwd,{ AutoCommit => 0 })
         or die "cannot connect to $datasource\n";

}


sub  unattachdb()
{
  #  commit and disconnect...
  #
  $dbh->commit;
  $dbh->disconnect();
}


sub  init()
{
  #  setup and insert $dimlim rows for the timelike
  #  dimension...
  #
  #
  use DBI;

  $datasource  = "DBI:Pg:dbname=dev1";
  $user        = "ware";
  $pwd         = "ware";
  $dbh         = 0;
  $sthins      = 0;

  $dimlim      = 900;
  $dimcnt      = 0;

  attachdb();

  while ( $dimcnt < $dimlim )
    {
    if ( $dimcnt%100 == 0 )
      {
      printf("processing ...$dimcnt\n");
      }
    adddim();
    $dimcnt = $dimcnt + 1;
    }

  unattachdb();
}

init();

#!/usr/bin/perl
#  --------------------------------------------------------------------------
#  file     : popdim1.pm
#  does     : populate the dim1 dimension...
#  --------------------------------------------------------------------------


sub  adddim()
{
  #  parse and execute the insert...
  #
  $sthins = $dbh->prepare("insert into dim1 values (?,? ||'th attribute',?%100 ||'th att type' )")
            or die "  could not prepare \n";

  $sthins->execute($dimcnt,$dimcnt,$dimcnt)
           or die "  failed to execute for $dimcnt\n";
}


sub  attachdb()
{
  #  connect to the database...
  #
  $dbh = DBI->connect($datasource, $user, $pwd,{ AutoCommit => 0 })
         or die "cannot connect to $datasource\n";

}


sub  unattachdb()
{
  #  commit and disconnect...
  #
  $dbh->commit;
  $dbh->disconnect();
}


sub  init()
{
  #  setup and insert $dimlim rows for the timelike
  #  dimension...
  #
  #
  use DBI;

  $datasource  = "DBI:Pg:dbname=dev1";
  $user        = "ware";
  $pwd         = "ware";
  $dbh         = 0;
  $sthins      = 0;

  $dimlim      = 10000;
  $dimcnt      = 0;

  attachdb();

  while ( $dimcnt < $dimlim )
    {
    if ( $dimcnt%1000 == 0 )
      {
      printf("processing ...$dimcnt\n");
      }
    adddim();
    $dimcnt = $dimcnt + 1;
    }

  unattachdb();
}

init();

#!/usr/bin/perl
#  --------------------------------------------------------------------------
#  file     : popdim2.pm
#  does     : populate the dim2 dimension...
#  --------------------------------------------------------------------------


sub  adddim()
{
  #  parse and execute the insert...
  #
  $sthins = $dbh->prepare("insert into dim2 values (?,? ||'th attribute',?%100 ||'th att type' )")
            or die "  could not prepare \n";

  $sthins->execute($dimcnt,$dimcnt,$dimcnt)
           or die "  failed to execute for $dimcnt\n";
}


sub  attachdb()
{
  #  connect to the database...
  #
  $dbh = DBI->connect($datasource, $user, $pwd,{ AutoCommit => 0 })
         or die "cannot connect to $datasource\n";

}


sub  unattachdb()
{
  #  commit and disconnect...
  #
  $dbh->commit;
  $dbh->disconnect();
}


sub  init()
{
  #  setup and insert $dimlim rows for the timelike
  #  dimension...
  #
  #
  use DBI;

  $datasource  = "DBI:Pg:dbname=dev1";
  $user        = "ware";
  $pwd         = "ware";
  $dbh         = 0;
  $sthins      = 0;

  $dimlim      = 1000;
  $dimcnt      = 0;

  attachdb();

  while ( $dimcnt < $dimlim )
    {
    if ( $dimcnt%100 == 0 )
      {
      printf("processing ...$dimcnt\n");
      }
    adddim();
    $dimcnt = $dimcnt + 1;
    }

  unattachdb();
}

init();

#!/usr/bin/perl
#  --------------------------------------------------------------------------
#  file     : popfact0.pm
#  does     : populate the fact0 fact...
#  --------------------------------------------------------------------------


sub  addfact()
{
  #  parse and execute the insert...
  #
  $sthins = $dbh->prepare("insert into fact0 values (?,?,?,100,100)")
            or die "  could not prepare \n";

  $sthins->execute($d0key,$d1key,$d2key)
           or die "  failed to execute for $factcnt\n";
}


sub  attachdb()
{
  #  connect to the database...
  #
  $dbh = DBI->connect($datasource, $user, $pwd,{ AutoCommit => 0 })
         or die "cannot connect to $datasource\n";

}


sub  unattachdb()
{
  #  disconnect...
  #
  $dbh->disconnect();
}


sub  commitdb()
{
  #  commit...
  #
  $dbh->commit;
}


sub  init()
{
  #  setup and insert $dimlim rows for the timelike
  #  dimension...
  #
  #
  use DBI;

  $datasource  = "DBI:Pg:dbname=dev1";
  $user        = "ware";
  $pwd         = "ware";
  $dbh         = 0;
  $sthins      = 0;

  #  set up to add  $dim0lim *  $dim1lim *  $dim2lim rows...
  #
  $dim0lim      = 30;
  $dim1lim      = 1000;
  $dim2lim      = 100;
  $dim0cnt      = 1;
  $dim1cnt      = 1;
  $dim2cnt      = 1;
  $factcnt      = 1;
  $begin        = time();
  $now          = 0;
  $elapsed      = 0;

  attachdb();

  #  do all the loops...
  #
  while ( $dim0cnt <=  $dim0lim )
    {
    $d0key = 30 * $dim0cnt;
    while (  $dim1cnt <=  $dim1lim )
      {
      $d1key = 10 * $dim1cnt;
      while (  $dim2cnt <=  $dim2lim )
        {
        $d2key = 10 * $dim2cnt;

        if ( $factcnt%10000 == 0 )
          {
          $now = time();
          $elapsed = ($now - $begin)/ 60;
          printf("processed ...$factcnt in $elapsed min\n");
          commitdb();
          }
        addfact();
        $factcnt = $factcnt + 1;

        $dim2cnt = $dim2cnt + 1;
        }
      $dim1cnt = $dim1cnt + 1;
      $dim2cnt = 1;
      }
    $dim0cnt = $dim0cnt + 1;
    $dim1cnt = 1;
    $dim2cnt = 1;
    }

  commitdb();
  unattachdb();
}

init();



More scripts to populate tables

On line tables

<%
  include("posglobals.phtm");
%>
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
 <HEAD>
  <TITLE>Example Database - Master</TITLE>
 </HEAD>
<!-- Background white, links blue (unvisited), navy (visited), red (active) -->
 <BODY
  BGCOLOR="#FFFFFF"
  TEXT="#000000"
  LINK="#0000FF"
  VLINK="#000080"
  ALINK="#FF0000"
 >
  <P>
  <FONT SIZE = 5 COLOR="red"><BOLD><CENTER>Populate Master on <%printf("%s",$dbstring)%></CENTER></BOLD></FONT>
  <HR>
  <BR>
  <%
   /* connect to the database and if successful then
      up a cursor for data
      do not show errors if logon fails...                                  */

    $con = pg_pconnect("dbname=" . $dbstring . " user=" . $PHP_AUTH_USER . " password=" . $PHP_AUTH_PW);

   if ( $con > 0 )
     {
     $rows = 0;
     $rowlim = 10000;
     $begin  = time();

     /* start the transaction...                                            */

     $cur = pg_exec($con,"begin");

     /* insert the rows...                                                  */

     for ( $rows = 0; $rows < $rowlim; $rows++ )
       {
       $cur = pg_exec($con,"insert into master values (" . $rows . "," . "'master " . $rows . "',100)");
       }

     /* commit the transaction...                                           */

     $cur = pg_exec($con,"commit");
     pg_freeresult($cur);

     $end  = time();
     if ( $rows == $rowlim)
       {
       $res = "successful";
       }
     else
       {
       $res = "not successful";
       }

     /* set up the table for formatting data...                               */
     printf("<CENTER><TABLE BORDER=0 WIDTH=200>");
     printf("<TR><TH BGCOLOR=#00DDDD>Variable</th><TH BGCOLOR=#00DDDD>Value</TH></TR>");

     printf("<TR>");
     printf("<TD BGCOLOR=#999999 FGCOLOR=green>result</TD>");
     printf("<TD BGCOLOR=#DDDDDD FGCOLOR=blue>%s</TD>",$res);
     printf("</TR>");

     printf("<TR>");
     printf("<TD BGCOLOR=#999999 FGCOLOR=green>rows</TD>");
     printf("<TD BGCOLOR=#DDDDDD FGCOLOR=blue>%d</TD>",$rows);
     printf("</TR>");

     printf("<TR>");
     printf("<TD BGCOLOR=#999999 FGCOLOR=green>elapsed</TD>");
     printf("<TD BGCOLOR=#DDDDDD FGCOLOR=blue>%d</TD>",$end - $begin);
     printf("</TR>");

     printf("</TABLE></CENTER>");

     pg_close($con);
     }
   else
     {
     printf("<BR><CENTER><FONT SIZE = 4 COLOR=red>Unable to connect to the database</FONT><CENTER><BR>");
     }
   %>
  </P>
  <%
    include("postail.phtm");
  %>
 </BODY>
</HTML>

<%
  include("posglobals.phtm");
%>
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
 <HEAD>
  <TITLE>Example Database - Master</TITLE>
 </HEAD>
<!-- Background white, links blue (unvisited), navy (visited), red (active) -->
 <BODY
  BGCOLOR="#FFFFFF"
  TEXT="#000000"
  LINK="#0000FF"
  VLINK="#000080"
  ALINK="#FF0000"
 >
  <P>
  <FONT SIZE = 5 COLOR="red"><BOLD><CENTER>Populate Detail on <%printf("%s",$dbstring)%></CENTER></BOLD></FONT>
  <HR>
  <BR>
  <%
   /* connect to the database and if successful then
      up a cursor for data
      do not show errors if logon fails...                                  */
    $con = @pg_pconnect("dbname=" . $dbstring . " user=" . $PHP_AUTH_USER . " password=" . $PHP_AUTH_PW);
   if ( $con > 0 )
     {
     $masrowlim =10000;
     $detrowlim = 100;
     $cnt       = 0;
     $begin     = time();
     for ( $masrows = 1; $masrows < $masrowlim + 1; $masrows++ )
       {
       for ( $detrows = 1; $detrows < $detrowlim + 1; $detrows++ )
         {
         /* start a transaction....                                         */

         if ( $cnt == 0 )
           {
           $cur = pg_exec($con,"begin");
           }

         /* do an insert and increemnt the counter....                      */

         $cur = pg_exec($con,"insert into detail values (" . $masrows * $detrows . ",". $masrows ."," . "datetime
'now'+" . $detrows . "," . 10 * $detrows .")"); 
         $cnt++;

         /* if done enough the commit and reset the counter and
            begin another one...                                            */

         if ( $cnt == 10000 )
           {
           $cnt = 1;
           $cur = pg_exec($con,"commit");
           $cur = pg_exec($con,"begin");
           }

         }
       }

     /* commit at the end...                                                */

     $cur = pg_exec($con,"commit");
     pg_freeresult($cur);

     $end = time();
     $masrows--;
     $detrows--;
     if ( $masrows == $masrowlim)
       {
       $res = "successful";
       }
     else
       {
       $res = "not successful";
       }

     /* set up the table for formatting data...                               */
     printf("<CENTER><TABLE BORDER=0 WIDTH=250>");
     printf("<TR><TH BGCOLOR=#00DDDD>Variable</th><TH BGCOLOR=#00DDDD>Value</TH></TR>");

     printf("<TR>");
     printf("<TD BGCOLOR=#999999 FGCOLOR=green>result</TD>");
     printf("<TD BGCOLOR=#DDDDDD FGCOLOR=blue>%s</TD>",$res);
     printf("</TR>");

     printf("<TR>");
     printf("<TD BGCOLOR=#999999 FGCOLOR=green>rows</TD>");
     printf("<TD BGCOLOR=#DDDDDD FGCOLOR=blue>%d</TD>",$masrows * $detrows);
     printf("</TR>");

     printf("<TR>");
     printf("<TD BGCOLOR=#999999 FGCOLOR=green>elapsed</TD>");
     printf("<TD BGCOLOR=#DDDDDD FGCOLOR=blue>%d</TD>",$end - $begin);
     printf("</TR>");

     printf("</TABLE></CENTER>");

     pg_close($con);
     }
   else
     {
     printf("<BR><CENTER><FONT SIZE = 4 COLOR=red>Unable to connect to the database</FONT><CENTER><BR>");
     }
   %>
  </P>
  <%
    include("postail.phtm");
  %>
 </BODY>
</HTML>

RE: a comparison of 4 databases

From

"Art S. Kagel"

Date:

10 December 1999, 10:17:03

As an, admittedly avid, and long time user of Informix database products I
felt I could not leave this shoddy comparison stand without comment.  There
are many errors concerning the features and capabilities of Informix database
servers and Mr. Kirkwood conducted several of his tests poorly which did not
fairly represent this fine set of database server products, nor, likely, the
other fine databases included in the comparison either.  Anyone interested in
his results should read on.  Since I have moused the original post I will
preceed my own comments with 'ASK -    ' and indent to make them clear.

A Comparison Of 4 Databases
---------------------------

Intro
-----

This paper shows the results of an evaluation of 4 databases. I am posting it
to this mail group as I think Postgresql emerged quite favourably.

The evaluated databases were Oracle, Informix, Mysql and Postgresql.

Features and performance were examined.

Hardware And Operating System
-----------------------------

2 x HP Vertra VE 7 each with 160M RAM + 1x3.2G + 1x13G Quantum IDE Drives were
used.
Redhat 6.0 was used as the operating system. No kernel changes were made.


Initial Experiences
-------------------

Mysql was obtained in rpm format and was amazingly easy to install. The
installation process created and started a database. The version was 3.22.27

Documentation was supplied and was good.

Postgresql was similarly elementary to install, and again a database was
created and started. The product comes with the flavour of Linux used and
was in rpm format. The version was 6.5.2

Documentation was supplied and was very good.

Informix was more cryptic to install. It was obtained in rpm format and
installed. However this merely installed an archive and the real installation
process had to be run thereafter. After this it had to be divined as to what
was required next  - the install does not create a database.

ASK -    Indeed the install can be cryptic but there are detailed installation
    instructions that can be downloaded from the Informix site as well
    as an available How-To that explains how to make an initial
    configuration and bring up the engine for the first time.  All of
    which are also explained in detail in the Informix Administrators
    Guide.

Using some of the (not supplied) documentation it was discovered how to create
and configure a database. The version was 7.30 ( This is old, but all they
are supplying on this platform - 9.x is current)

ASK -    I assume Mr. Kirkwood downloaded the Informix Dynamic Server v7.30
    which is NOT old.  The Informix server product line includes 4
    different servers and Informix is in the process of merging the code
    lines of some of these.  The Informix enterprise level engine for
    transaction processing IS the IDS7.3x product the latest release of
    which is 7.31 which IS available for Linux but the link that Mr.
    Kirkwood followed took him to the older (by 3 months) 7.30 version.
    The IDS/UDO v9.1x product is known as the Universal Data Option and
    is Informix's Object Relational product descended ultimately, like
    PostgreSQL, from the Postgres project.  UDO was based on the older
    IDS 7.1x code base and some features needed for transaction oriented
    applications were missing.  In the last 3 weeks, indeed on the same
    day Mr. Kirkwood posted his report, Informix released IDS.2000 which
    is IDS v9.20 and is a merger of the 7.31 and 9.1x code bases in a new
    product promising Object Relational features, full transaction server
    features, and enterprise quality performance.  But, this is a brand
    new code base - and so - many users are waiting for IDS.2000 v9.21
    or at least the first few maintenance releases of v9.20.  It is
    important to note that Informix considers Linux a Tier I port and
    released the IDS.2000 product on Linux on the same day as it did its
    other Tier I ports like Sun/Solaris and HP/HP-UX.

Documentation was not supplied, it was available on the Informix web site. It is
ok.

ASK -    The documentation is available for perusal online at the Informix
    website from whence it can also be downloaded for local access or
    you can purchase the docs on CD-ROM for $45US or in hard copy (I
    think they are about $750US) from Informix.  Note that Informix's
    documentation has won numerous awards for quality and clarity and
    compared to Oracle's completely opaque docs I have always found it
    a pleasure to use.

Oracle was difficult to judge as the author frequently installs it. However
pretending to be coming to it new, it would be very difficult to install.
It does not come in rpm format. It is downloadable from the Oracle web site.
The small amount of included documentation is sufficient to enable someone
to work out how to start the installer program. This program is a rudimentary
wizard that asks questions and presents a list of components to choose....
a newcomer would suffer confusion here. The installer can create a database as
part of the install. The version was 8.0.5 (this is slightly old - 8.1.5 is
Current but buggy, 8.0.5 is the latest stable release on this platform).

Documentation is not supplied, it is available from the Oracle web site. It is
ok.



Tests And results
-----------------

Database Feature
Comparison

Database   Cost    Trans   Row   Const   Program  Sec  Fail  Hot
                   actions Lock  raints  mable    ure  Safe  back

Mysql      0 /3000 No      No    No      Partial  Yes  No    No
Postgresql 0       Yes     Yes   Partial Yes      Yes  Yes   No
Oracle     3200    Yes     Yes   Yes     Yes      Yes  Yes   Yes
Informix   2000    Yes     No    Yes     Yes      Yes  No    No


ASK -    See below.  ALL OF THE 'No's above should be 'Yes'es.  I address
    each point below.

Cost

        NZ$ for 10 user license. Two prices mean that the product is charged
        for if resold as part of an application ( Mysql )
        Support is not included

Transactions

        Commit, rollback, isolation levels of at least read commited

ASK -    Informix permits the user to create a database in one of four modes.
    Three of these: Mode ANSI, Buffered Logged, and Unbuffered Logged
    support multiple statement transactions with BEGIN WORK (ANSI mode
    databases infer a BEGIN WORK with the first statement not in a
    transaction, ie following a COMMIT or ROLLBACK).  The fourth, which
    for historical reasons is the default mode, is non-logged which does
    NOT support multiple statement transactions, though individual SQL
    statements are treated as singleton transactions.  Based on the
    results for Fail Safe I'd say that the database was created in
    default mode without logging or perhaps in BUFFERED LOG mode and
    the COMMIT record was still in the log buffer and had not flushed
    to disk in which case Informix rightly rolled back the transaction
    (this would be noted in the engine's message log at the time the
    engine was restarted).  For this type of secure testing Mr. Kirkwood
    should have beenusing an UNBUFFERED LOG mode database which
    automatically flushes the log buffer when a commit record is written
    to it.  There is an extensive discussion of these issues and their
    ramifications for performance and safety in the Administrators
    Guide.  Note that most Informix DBAs run their databases in
    UNBUFFERED LOG mode.

Row Locking

        select for update that locks only the rows selected and does not
        block reads

ASK -    Informix DOES indeed support row level locking though the default
    for tables is page level locking.  One need only include the LOCK
    MODE ROW clause in the CREATE TABLE statement or ALTER the table
    afterward to have row level locks.

Constraints

        primary and foreign key, with ability to enable/ disable or drop / add
        existence will give ""Partial"" and enable etc will give "Yes"

Programmable

        create trigger, procedural language extensions to SQL
        A "Partial" means that external functions only are supported

Secure

        Requires username and password validation

Fail Safe

        delete of 100 out of 100000 rows, commit ( if have it ) then power
        off and on, check after power on - 999900 rows gets a "Yes"

ASK -    If Mr. Kirkwood had created his database with one of the logged
    transaction modes he would have had complete 'fail safe' behavior
    from the Informix server.  Informix cannot recover transactions
    that have not been checkpointed out to disk in a database without
    logging (the default) but it can for a logged database.

Hot Backup

        Can a consistent backup of the database be run backup while it is up
        with transactions performed against it.

ASK -    Informix was the first relational database to support online live
    consistent backup of its database servers.  Indeed Informix provides
    three archive products all of which can backup the server while being
    actively updated.  Only one of these, ontape, is available on Linux
    so far (due to some third party software the others, onarchive and
    onbar, depend on which was not available for Linux at release time)
    but that may soon change.

Database Performance Comparison - Data
Warehouse

Database    Bulk Load  Summarize 3%  Summarize 0.3%  Summarize 0.3%
            1M         of 3M         of 3M           of 3M
                       1 dim 1 fact  2 dim 1 fact    3 dim 1 fact

Mysql       20         14            90              57
Postgresql  420        16            4               7
Oracle      65         5             3               3
Informix    170        8             5               6

Notes

Bulk Load

        elapsed seconds to load 1000000 rows via whatever bulk load tool is
        supplied.

ASK -    Informix has several bulk load utilities.  It would appear that Mr.
    Kirkwood either did not use the ipoader or did not run it in either
    of the higher speed modes which includes a mode which bypasses all
    logging and is several times faster than logged modes.

ASK -    On the performance tests, Informix performance is very dependent on
    the tuning of the engine and the presence of proper database
    statistics.  I cannot know how well or poorly Mr. Kirkwood tuned the
    engine or whether he ran the recommended suite of UPDATE STATISTICS
    commands after loading the database.  Without details of how well
    he prepared the various servers and databases so the optimizers
    could do their jobs properly one cannot evaluate the quality of
    his results.  Note that I state all of this even though Informix
    performed reasonably anyway.  I just wonder how much better ANY of
    the tested server could have performed.

Summarize 3%
1 dim + 1 fact

        Measure elapsed seconds to display summarized count and one timelike
        dimension attribute for basic star query
        Access path can be via an index scan

Summarize 0.3%
2 dim + 1 fact

        Measure elapsed seconds to display summarized count and one non timelike
        dimesion attribute for less basic star query
        Access path can be via  index scans

Summarize 0.3%
3 dim + 1 fact

        Measure elapsed seconds to display a summarized count and 1 non timelike
        dimension attribute for more complex star query
        Access path can be via  index scans



Database Performance Comparison -
Online


Database    Create  Create  Query 1  Query 4  Query Summary
            10K     1M      of 10K   of 1M    10% of 1M

Mysql       7       891     0        0        1
Postgresql  21      4430    0        0        2
Oracle      31      3694    0        0        2
Informix                    0        0        10


Database    Tpm     Tpm     Tpm      Tpm
            1 sess. 2 sess. 4 sess.  8 sess.

Mysql       59/58   59/115  59/229   58/425
Postgresql  48      90      169      233
Oracle      55      108     180      291
Informix



Notes


        Unfortunately no Informix SDK was available, so there is no result for
        the first two or last tests.

ASK -    Informix's SDK is not included in the basic engine download but IS
    available for free download for Linux.  The SDK includes ESQL/C,
    CLI/ODBC, JDBC, a C++ interface, and more.  Also available in
    separate packages are I4GL (compiled version of Informix's award
    winning 4th Generation Language), R4GL (pcode version of I4GL which
    includes a source level pcode debugger), ISQL (Informix's forms,
    menus, and report writer package), and D4GL (a GUI aware version of
    I4GL).  All of these are available for download including free
    evaluation licenses and most are available with a free single user
    development license.  The SDK is free for all developers and the
    runtime package, iConnect, is free for all users.  With VERY little
    effort Mr. Kirkwood could have compiled and run his tests with
    Informix.

Create 10K

        elapsed seconds for Php app to insert 10000 rows one by
one

Create 1M

        elapsed seconds for Php app to insert 1000000 rows one by one


Query 1

        0 seconds means that no delay was measurable
        Measure elapsed seconds to show master record 1 row located 70% though
        the table. Access path can be via an index scan

Query 4

        Measure elapsed seconds to display 4 detail rows for one of the master
        records in previous test. Access path can be via  index scans

Query Summary

        Measure elapsed seconds to display a summarized sum and count
        for 5% of 1000000 rows. This is a 2 table join . Access path can be via
        index scans

Tpm n sessions

        Transactions per minute for n sessions, achievable for lightweight Perl
        DBI application. Two results for Mysql refer to two possible locking
        schemes


Analysis
--------

Features

Oracle clearly has the most features. However it is Postgresql that is next in
line.

The features fail safe, transactions and security are seen as essential,
with row locking as highly desirable.

This means that  at face value only Oracle and Postgresql satisfy the criteria.

ASK -    As does Informix if one takes the 10 seconds it takes to add the
    WITH LOG clause to the CREATE DATABASE and LOCK MODE ROW clause to
    the CREATE TABLE statements.  In several cases below Mr. Kirkwood
    bothered to check the docs and even contact developers to find out
    if or how he could get some feature he wanted that Oracle or
    PostgreSQL did not seem to have could be made to work.  It is
    suspect that he did not do the same for MySQL and Informix.  His
    Informix download came with a free support contract so if reading
    the docs or posting queries to comp.databases.informix did not get
    the information he needed (and I assure you either the docs or the
    newsgroup would have given him the answers) he could have called
    tech support toll free.

Initially Postgresql did not force password entry and thus was not secure,
however it was discovered that access to the database is configured at install
time as trusted and has to be changed to require passwords.

It should be noted that Oracle needed a patch to be fail safe, and that this
was not advertised on the web site, thus is possible that Informix and Mysql
can also be similarly patched.

Repeated tests with Informix page level locks showed that readers were blocked
at this point. This was viewed as extremely undesirable.

ASK -    In addition to setting row level locking mode for the tables Mr.
    Kirkwood should note that Informix implements all of the ANSI
    Isolation Modes.  The default for a logged database

Mysql's lack of transactions was a killer for it, the complexity of having to
programmatically undo an n-table operation was viewed as prohibitive.
( It was noted that the Mysql to do list had atomic operations - pseudo
transactions, on it .)

In conclusion on the features front, Oracle and Postgresql emerge as the
leaders.



Performance

1 Online operations

Initially Postgreql refused to use indexes to help with queries. A mail to one
of the Postgresql Development Team traced this to an unusual pickiness with
respect to the implied datatypes of constants in the where clause.


Informix had problems with the summary query, it would not go faster that 10s,
however it seemed to perform ok for the 10000 and 1000000 query searches.

Mysql demonstrates the overhead associated with multiple versioning and
transactions. It was obviously the fastest raw speed terms. It was in
general 4-5 times faster than anything else.

Postgresql was in general about 20% slower than Oracle.

The multiple session transaction test is most interesting. The idea was to
mimic a Tpc A style banking application with some user think time during the
transaction (1 second ).
Clearly Mysql with its table level locks could not scale at all. However it has
application locks that are nameable, so can mimic row locks - as long as every
access uses them. If these were turned on then a dramatic turnaround was
observed.

Thus Mysql is fastest, and most scalable if applications locks can be used.
Oracle and Postgreql are similar in scalability, but Postgresql begins to tail
off a little before Oracle does.


2 Data warehousing operations

Mysql has a very fast bulk load facility.
Oracle also has a very fast bulk load.
Infomix has a reasonably fast one.
Postgresql is noticeably slower than any of the others.

Mysql performed extremely poorly at the star type queries.
Oracle and Informix performed very well.
Postgresql performed very well in all but the first query - the simplest
(ironically), but it scanned the most data. This points to Oracle and Informix
having faster Nested Loop or Sort algorithms.

Both Oracle and Postgresql would experience a period of poor performance on a
table immediately after it was bulk loaded - typically until it was vacuumed or
analyzed.

In conclusion for this section Oracle, Informix are very good.
Postgresql is good but is hampered by slow bulk load and sorting of large
datasets.
Mysql can get the data in fast but is poor at querying it.


Overall
-------

Informix performs mostly ok, but its locking scheme precludes it for serious on
line use (it would have been interesting to test this).

ASK -    Does this make sense?  There are literally thousands of mission
    critical, speed dependent, multi-user applications running against
    Informix engines.  It makes no sense that Informix's locking
    prohibits reasonable performance.  Just this reasonablness test
    should have clued the author in that he had missed something.

Oracle performs well.
Mysql's lack of transactions would have been ok for data warehousing, but it
could not do star queries. This rules it out.
Postgresql performed about 20% slower than Oracle, except for bulk loads and
large dataset sorts. These things mean that it is suitable for data warehousing
and on line operations, with the proviso that large data loads and some large
summary queries may be unpleasantly slow.

ASK -    In brief, these tests need to be performed again, certainly for
    Informix but most likely for Oracle and the rest as well, with
    an eye to more accurately portraying the performance and features
    of each.  More care is needed.

[Benchmark code SNIPPED]

Art S. Kagel

Re: [GENERAL] RE: a comparison of 4 databases

From

Date:

10 December 1999, 11:39:07

another example that do not trust any "comparison" that is done
by one party. As our beloved pg developer suggested, any real
comparison should by done based on some reasonable-often-used-real-life
cases and by at least more than 2 or 3 proved (certificed -- for
Orakal -- or beloved -- for PG) EXPERTS of each target-dbms
by using all possible approaches available for that database system.

this kind of thing, like religion/culture "comparison", should
be read VERY VERY carefully.


On Fri, 10 Dec 1999, Art S. Kagel wrote:

>
> As an, admittedly avid, and long time user of Informix database products I
> felt I could not leave this shoddy comparison stand without comment.  There
> are many errors concerning the features and capabilities of Informix database
> servers and Mr. Kirkwood conducted several of his tests poorly which did not
> fairly represent this fine set of database server products, nor, likely, the
> other fine databases included in the comparison either.  Anyone interested in
> his results should read on.  Since I have moused the original post I will
> preceed my own comments with 'ASK -    ' and indent to make them clear.
>
> A Comparison Of 4 Databases
> ---------------------------
>
> Intro
> -----
>
> This paper shows the results of an evaluation of 4 databases. I am posting it
> to this mail group as I think Postgresql emerged quite favourably.
>
> The evaluated databases were Oracle, Informix, Mysql and Postgresql.
>
> Features and performance were examined.
>
> Hardware And Operating System
> -----------------------------
>
> 2 x HP Vertra VE 7 each with 160M RAM + 1x3.2G + 1x13G Quantum IDE Drives were
> used.
> Redhat 6.0 was used as the operating system. No kernel changes were made.
>
>
> Initial Experiences
> -------------------
>
> Mysql was obtained in rpm format and was amazingly easy to install. The
> installation process created and started a database. The version was 3.22.27
>
> Documentation was supplied and was good.
>
> Postgresql was similarly elementary to install, and again a database was
> created and started. The product comes with the flavour of Linux used and
> was in rpm format. The version was 6.5.2
>
> Documentation was supplied and was very good.
>
> Informix was more cryptic to install. It was obtained in rpm format and
> installed. However this merely installed an archive and the real installation
> process had to be run thereafter. After this it had to be divined as to what
> was required next  - the install does not create a database.
>
> ASK -    Indeed the install can be cryptic but there are detailed installation
>     instructions that can be downloaded from the Informix site as well
>     as an available How-To that explains how to make an initial
>     configuration and bring up the engine for the first time.  All of
>     which are also explained in detail in the Informix Administrators
>     Guide.
>
> Using some of the (not supplied) documentation it was discovered how to create
> and configure a database. The version was 7.30 ( This is old, but all they
> are supplying on this platform - 9.x is current)
>
> ASK -    I assume Mr. Kirkwood downloaded the Informix Dynamic Server v7.30
>     which is NOT old.  The Informix server product line includes 4
>     different servers and Informix is in the process of merging the code
>     lines of some of these.  The Informix enterprise level engine for
>     transaction processing IS the IDS7.3x product the latest release of
>     which is 7.31 which IS available for Linux but the link that Mr.
>     Kirkwood followed took him to the older (by 3 months) 7.30 version.
>     The IDS/UDO v9.1x product is known as the Universal Data Option and
>     is Informix's Object Relational product descended ultimately, like
>     PostgreSQL, from the Postgres project.  UDO was based on the older
>     IDS 7.1x code base and some features needed for transaction oriented
>     applications were missing.  In the last 3 weeks, indeed on the same
>     day Mr. Kirkwood posted his report, Informix released IDS.2000 which
>     is IDS v9.20 and is a merger of the 7.31 and 9.1x code bases in a new
>     product promising Object Relational features, full transaction server
>     features, and enterprise quality performance.  But, this is a brand
>     new code base - and so - many users are waiting for IDS.2000 v9.21
>     or at least the first few maintenance releases of v9.20.  It is
>     important to note that Informix considers Linux a Tier I port and
>     released the IDS.2000 product on Linux on the same day as it did its
>     other Tier I ports like Sun/Solaris and HP/HP-UX.
>
> Documentation was not supplied, it was available on the Informix web site. It is
> ok.
>
> ASK -    The documentation is available for perusal online at the Informix
>     website from whence it can also be downloaded for local access or
>     you can purchase the docs on CD-ROM for $45US or in hard copy (I
>     think they are about $750US) from Informix.  Note that Informix's
>     documentation has won numerous awards for quality and clarity and
>     compared to Oracle's completely opaque docs I have always found it
>     a pleasure to use.
>
> Oracle was difficult to judge as the author frequently installs it. However
> pretending to be coming to it new, it would be very difficult to install.
> It does not come in rpm format. It is downloadable from the Oracle web site.
> The small amount of included documentation is sufficient to enable someone
> to work out how to start the installer program. This program is a rudimentary
> wizard that asks questions and presents a list of components to choose....
> a newcomer would suffer confusion here. The installer can create a database as
> part of the install. The version was 8.0.5 (this is slightly old - 8.1.5 is
> Current but buggy, 8.0.5 is the latest stable release on this platform).
>
> Documentation is not supplied, it is available from the Oracle web site. It is
> ok.
>
>
>
> Tests And results
> -----------------
>
> Database Feature
> Comparison
>
> Database   Cost    Trans   Row   Const   Program  Sec  Fail  Hot
>                    actions Lock  raints  mable    ure  Safe  back
>
> Mysql      0 /3000 No      No    No      Partial  Yes  No    No
> Postgresql 0       Yes     Yes   Partial Yes      Yes  Yes   No
> Oracle     3200    Yes     Yes   Yes     Yes      Yes  Yes   Yes
> Informix   2000    Yes     No    Yes     Yes      Yes  No    No
>
>
> ASK -    See below.  ALL OF THE 'No's above should be 'Yes'es.  I address
>     each point below.
>
> Cost
>
>         NZ$ for 10 user license. Two prices mean that the product is charged
>         for if resold as part of an application ( Mysql )
>         Support is not included
>
> Transactions
>
>         Commit, rollback, isolation levels of at least read commited
>
> ASK -    Informix permits the user to create a database in one of four modes.
>     Three of these: Mode ANSI, Buffered Logged, and Unbuffered Logged
>     support multiple statement transactions with BEGIN WORK (ANSI mode
>     databases infer a BEGIN WORK with the first statement not in a
>     transaction, ie following a COMMIT or ROLLBACK).  The fourth, which
>     for historical reasons is the default mode, is non-logged which does
>     NOT support multiple statement transactions, though individual SQL
>     statements are treated as singleton transactions.  Based on the
>     results for Fail Safe I'd say that the database was created in
>     default mode without logging or perhaps in BUFFERED LOG mode and
>     the COMMIT record was still in the log buffer and had not flushed
>     to disk in which case Informix rightly rolled back the transaction
>     (this would be noted in the engine's message log at the time the
>     engine was restarted).  For this type of secure testing Mr. Kirkwood
>     should have beenusing an UNBUFFERED LOG mode database which
>     automatically flushes the log buffer when a commit record is written
>     to it.  There is an extensive discussion of these issues and their
>     ramifications for performance and safety in the Administrators
>     Guide.  Note that most Informix DBAs run their databases in
>     UNBUFFERED LOG mode.
>
> Row Locking
>
>         select for update that locks only the rows selected and does not
>         block reads
>
> ASK -    Informix DOES indeed support row level locking though the default
>     for tables is page level locking.  One need only include the LOCK
>     MODE ROW clause in the CREATE TABLE statement or ALTER the table
>     afterward to have row level locks.
>
> Constraints
>
>         primary and foreign key, with ability to enable/ disable or drop / add
>         existence will give ""Partial"" and enable etc will give "Yes"
>
> Programmable
>
>         create trigger, procedural language extensions to SQL
>         A "Partial" means that external functions only are supported
>
> Secure
>
>         Requires username and password validation
>
> Fail Safe
>
>         delete of 100 out of 100000 rows, commit ( if have it ) then power
>         off and on, check after power on - 999900 rows gets a "Yes"
>
> ASK -    If Mr. Kirkwood had created his database with one of the logged
>     transaction modes he would have had complete 'fail safe' behavior
>     from the Informix server.  Informix cannot recover transactions
>     that have not been checkpointed out to disk in a database without
>     logging (the default) but it can for a logged database.
>
> Hot Backup
>
>         Can a consistent backup of the database be run backup while it is up
>         with transactions performed against it.
>
> ASK -    Informix was the first relational database to support online live
>     consistent backup of its database servers.  Indeed Informix provides
>     three archive products all of which can backup the server while being
>     actively updated.  Only one of these, ontape, is available on Linux
>     so far (due to some third party software the others, onarchive and
>     onbar, depend on which was not available for Linux at release time)
>     but that may soon change.
>
> Database Performance Comparison - Data
> Warehouse
>
> Database    Bulk Load  Summarize 3%  Summarize 0.3%  Summarize 0.3%
>             1M         of 3M         of 3M           of 3M
>                        1 dim 1 fact  2 dim 1 fact    3 dim 1 fact
>
> Mysql       20         14            90              57
> Postgresql  420        16            4               7
> Oracle      65         5             3               3
> Informix    170        8             5               6
>
> Notes
>
> Bulk Load
>
>         elapsed seconds to load 1000000 rows via whatever bulk load tool is
>         supplied.
>
> ASK -    Informix has several bulk load utilities.  It would appear that Mr.
>     Kirkwood either did not use the ipoader or did not run it in either
>     of the higher speed modes which includes a mode which bypasses all
>     logging and is several times faster than logged modes.
>
> ASK -    On the performance tests, Informix performance is very dependent on
>     the tuning of the engine and the presence of proper database
>     statistics.  I cannot know how well or poorly Mr. Kirkwood tuned the
>     engine or whether he ran the recommended suite of UPDATE STATISTICS
>     commands after loading the database.  Without details of how well
>     he prepared the various servers and databases so the optimizers
>     could do their jobs properly one cannot evaluate the quality of
>     his results.  Note that I state all of this even though Informix
>     performed reasonably anyway.  I just wonder how much better ANY of
>     the tested server could have performed.
>
> Summarize 3%
> 1 dim + 1 fact
>
>         Measure elapsed seconds to display summarized count and one timelike
>         dimension attribute for basic star query
>         Access path can be via an index scan
>
> Summarize 0.3%
> 2 dim + 1 fact
>
>         Measure elapsed seconds to display summarized count and one non timelike
>         dimesion attribute for less basic star query
>         Access path can be via  index scans
>
> Summarize 0.3%
> 3 dim + 1 fact
>
>         Measure elapsed seconds to display a summarized count and 1 non timelike
>         dimension attribute for more complex star query
>         Access path can be via  index scans
>
>
>
> Database Performance Comparison -
> Online
>
>
> Database    Create  Create  Query 1  Query 4  Query Summary
>             10K     1M      of 10K   of 1M    10% of 1M
>
> Mysql       7       891     0        0        1
> Postgresql  21      4430    0        0        2
> Oracle      31      3694    0        0        2
> Informix                    0        0        10
>
>
> Database    Tpm     Tpm     Tpm      Tpm
>             1 sess. 2 sess. 4 sess.  8 sess.
>
> Mysql       59/58   59/115  59/229   58/425
> Postgresql  48      90      169      233
> Oracle      55      108     180      291
> Informix
>
>
>
> Notes
>
>
>         Unfortunately no Informix SDK was available, so there is no result for
>         the first two or last tests.
>
> ASK -    Informix's SDK is not included in the basic engine download but IS
>     available for free download for Linux.  The SDK includes ESQL/C,
>     CLI/ODBC, JDBC, a C++ interface, and more.  Also available in
>     separate packages are I4GL (compiled version of Informix's award
>     winning 4th Generation Language), R4GL (pcode version of I4GL which
>     includes a source level pcode debugger), ISQL (Informix's forms,
>     menus, and report writer package), and D4GL (a GUI aware version of
>     I4GL).  All of these are available for download including free
>     evaluation licenses and most are available with a free single user
>     development license.  The SDK is free for all developers and the
>     runtime package, iConnect, is free for all users.  With VERY little
>     effort Mr. Kirkwood could have compiled and run his tests with
>     Informix.
>
> Create 10K
>
>         elapsed seconds for Php app to insert 10000 rows one by
> one
>
> Create 1M
>
>         elapsed seconds for Php app to insert 1000000 rows one by one
>
>
> Query 1
>
>         0 seconds means that no delay was measurable
>         Measure elapsed seconds to show master record 1 row located 70% though
>         the table. Access path can be via an index scan
>
> Query 4
>
>         Measure elapsed seconds to display 4 detail rows for one of the master
>         records in previous test. Access path can be via  index scans
>
> Query Summary
>
>         Measure elapsed seconds to display a summarized sum and count
>         for 5% of 1000000 rows. This is a 2 table join . Access path can be via
>         index scans
>
> Tpm n sessions
>
>         Transactions per minute for n sessions, achievable for lightweight Perl
>         DBI application. Two results for Mysql refer to two possible locking
>         schemes
>
>
> Analysis
> --------
>
> Features
>
> Oracle clearly has the most features. However it is Postgresql that is next in
> line.
>
> The features fail safe, transactions and security are seen as essential,
> with row locking as highly desirable.
>
> This means that  at face value only Oracle and Postgresql satisfy the criteria.
>
> ASK -    As does Informix if one takes the 10 seconds it takes to add the
>     WITH LOG clause to the CREATE DATABASE and LOCK MODE ROW clause to
>     the CREATE TABLE statements.  In several cases below Mr. Kirkwood
>     bothered to check the docs and even contact developers to find out
>     if or how he could get some feature he wanted that Oracle or
>     PostgreSQL did not seem to have could be made to work.  It is
>     suspect that he did not do the same for MySQL and Informix.  His
>     Informix download came with a free support contract so if reading
>     the docs or posting queries to comp.databases.informix did not get
>     the information he needed (and I assure you either the docs or the
>     newsgroup would have given him the answers) he could have called
>     tech support toll free.
>
> Initially Postgresql did not force password entry and thus was not secure,
> however it was discovered that access to the database is configured at install
> time as trusted and has to be changed to require passwords.
>
> It should be noted that Oracle needed a patch to be fail safe, and that this
> was not advertised on the web site, thus is possible that Informix and Mysql
> can also be similarly patched.
>
> Repeated tests with Informix page level locks showed that readers were blocked
> at this point. This was viewed as extremely undesirable.
>
> ASK -    In addition to setting row level locking mode for the tables Mr.
>     Kirkwood should note that Informix implements all of the ANSI
>     Isolation Modes.  The default for a logged database
>
> Mysql's lack of transactions was a killer for it, the complexity of having to
> programmatically undo an n-table operation was viewed as prohibitive.
> ( It was noted that the Mysql to do list had atomic operations - pseudo
> transactions, on it .)
>
> In conclusion on the features front, Oracle and Postgresql emerge as the
> leaders.
>
>
>
> Performance
>
> 1 Online operations
>
> Initially Postgreql refused to use indexes to help with queries. A mail to one
> of the Postgresql Development Team traced this to an unusual pickiness with
> respect to the implied datatypes of constants in the where clause.
>
>
> Informix had problems with the summary query, it would not go faster that 10s,
> however it seemed to perform ok for the 10000 and 1000000 query searches.
>
> Mysql demonstrates the overhead associated with multiple versioning and
> transactions. It was obviously the fastest raw speed terms. It was in
> general 4-5 times faster than anything else.
>
> Postgresql was in general about 20% slower than Oracle.
>
> The multiple session transaction test is most interesting. The idea was to
> mimic a Tpc A style banking application with some user think time during the
> transaction (1 second ).
> Clearly Mysql with its table level locks could not scale at all. However it has
> application locks that are nameable, so can mimic row locks - as long as every
> access uses them. If these were turned on then a dramatic turnaround was
> observed.
>
> Thus Mysql is fastest, and most scalable if applications locks can be used.
> Oracle and Postgreql are similar in scalability, but Postgresql begins to tail
> off a little before Oracle does.
>
>
> 2 Data warehousing operations
>
> Mysql has a very fast bulk load facility.
> Oracle also has a very fast bulk load.
> Infomix has a reasonably fast one.
> Postgresql is noticeably slower than any of the others.
>
> Mysql performed extremely poorly at the star type queries.
> Oracle and Informix performed very well.
> Postgresql performed very well in all but the first query - the simplest
> (ironically), but it scanned the most data. This points to Oracle and Informix
> having faster Nested Loop or Sort algorithms.
>
> Both Oracle and Postgresql would experience a period of poor performance on a
> table immediately after it was bulk loaded - typically until it was vacuumed or
> analyzed.
>
> In conclusion for this section Oracle, Informix are very good.
> Postgresql is good but is hampered by slow bulk load and sorting of large
> datasets.
> Mysql can get the data in fast but is poor at querying it.
>
>
> Overall
> -------
>
> Informix performs mostly ok, but its locking scheme precludes it for serious on
> line use (it would have been interesting to test this).
>
> ASK -    Does this make sense?  There are literally thousands of mission
>     critical, speed dependent, multi-user applications running against
>     Informix engines.  It makes no sense that Informix's locking
>     prohibits reasonable performance.  Just this reasonablness test
>     should have clued the author in that he had missed something.
>
> Oracle performs well.
> Mysql's lack of transactions would have been ok for data warehousing, but it
> could not do star queries. This rules it out.
> Postgresql performed about 20% slower than Oracle, except for bulk loads and
> large dataset sorts. These things mean that it is suitable for data warehousing
> and on line operations, with the proviso that large data loads and some large
> summary queries may be unpleasantly slow.
>
> ASK -    In brief, these tests need to be performed again, certainly for
>     Informix but most likely for Oracle and the rest as well, with
>     an eye to more accurately portraying the performance and features
>     of each.  More care is needed.
>
> [Benchmark code SNIPPED]
>
> Art S. Kagel
>
> ************
>