Re: Load a csv or a avro? - Mailing list pgsql-general

From Muhammad Ikram
Subject Re: Load a csv or a avro?
Date
Msg-id CAGeimVo_EOJO+BViDnLoxEdFoFKQkeHU=gniEQ9e2GbjAUvUHg@mail.gmail.com
Whole thread Raw
In response to Re: Load a csv or a avro?  (Josef Šimánek <josef.simanek@gmail.com>)
List pgsql-general
Hi,

Performance Considerations

    Avro files are smaller due to compression so needing less I/O time. whereas CSV files are simpler but larger in size so read/write will need more time.
    COPY command works very well with CSV files whereas ETL process is required for handling Avro.

Regards,
Muhammad Ikram


On Fri, Jul 5, 2024 at 3:03 PM Josef Šimánek <josef.simanek@gmail.com> wrote:
pá 5. 7. 2024 v 11:08 odesílatel sud <suds1434@gmail.com> napsal:
>
> Hello all,
>
> Its postgres database. We have option of getting files in csv and/or in avro format messages from another system to load it into our postgres database. The volume will be 300million messages per day across many files in batches.
>
> My question was, which format should we chose in regards to faster data loading performance ? and if any other aspects to it also should be considered apart from just loading performance?

We are able to load ~300 million rows per one day using CSV and COPY
functions (https://www.postgresql.org/docs/current/libpq-copy.html#LIBPQ-COPY-SEND).




--
Muhammad Ikram

pgsql-general by date:

Previous
From: Josef Šimánek
Date:
Subject: Re: Load a csv or a avro?
Next
From: hubert depesz lubaczewski
Date:
Subject: Re: psql help