Chapter 27. Metadata Tables

The metadata stored in metadata tables of pgpro_metastore is required for the following functionality:

  • Creating analytical views.

  • Querying analytical tables that use partitioning and indexes of Parquet files.

  • Supporting basic ETL operations:

    • performing ETL operations

    • scheduling ETL operations to be performed at a certain date and time, as well as sequentially (one after another)

    • limiting resources available for ETL operations to avoid impacting the OLTP workload

  • Working with the OLAP data:

    • creating Parquet files with the OLAP data from Postgres Pro tables and importing this data to analytical tables

    • exporting the OLAP data from analytical tables to Parquet, CSV, and JSON files

    • creating reports with filtered OLAP data using SQL queries

  • Making transaction updates to pgpro_metastore objects, as well as providing consistent updates of analytical tables when reading and writing to these tables.

  • Checking data integrity and restoring Parquet files from backups or by using S3 storage mirrors.

  • Monitoring pgpro_metastore and notifying administrators about events requiring their immediate attention.

  • Storing Parquet files in separate storages to avoid competition with the OLTP workload for I/O throughput.

The schema of metadata tables is shown in the diagram below.