38.9. pga_data_file #

This metadata table contains information about Parquet files.

Column Name

Column Type

Constraint

Description

data_file_id

BIGINT

Primary key

The Parquet file ID assigned by increasing the next_file_id value of the pga_snapshot metadata table.

begin_snapshot

BIGINT

Foreign key, NOT NULL

The snapshot associated with creating the Parquet file using the metastore.add_files or metastore.copy_table operation, from the pga_snapshot metadata table.

end_snapshot

BIGINT

Foreign key

The snapshot associated with marking snapshots as expired, from the pga_snapshot metadata table.

If the value is NULL, the Parquet file is not deleted.

table_id

BIGINT

Foreign key, NOT NULL

The ID of the analytical table that contains the Parquet file, from the pga_table metadata table.

file_order

BIGINT

NOT NULL

The order of the Parquet file in the analytical table.

file_order values must be unique within the analytical table, but the ordered sequence can have gaps.

path

VARCHAR

NOT NULL, UNIQUE

The path to the Parquet file, for example, my_file.parquet.

path_is_relative

BOOLEAN

NOT NULL

Specifies whether the path to the Parquet file is relative to the path to the analytical table or absolute.

Default value: false.

file_format

VARCHAR

NOT NULL

The file format.

Currently, only the Parquet format is supported.

record_count

BIGINT

NOT NULL

The number of rows in the Parquet file.

file_size_bytes

BIGINT

NOT NULL

The Parquet file size, in bytes.

row_id_start

BIGINT

NOT NULL

The row_id value for the first row in the Parquet file.

It indicates the starting position of the file within the analytical table.

checksum_crc32

BIGINT

NOT NULL

The current checksum value of the Parquet file.

It is used for data integrity checks.