Calculate Vacuum Metrics - Mailing list pgsql-general

From Venkat Balaji
Subject Calculate Vacuum Metrics
Date
Msg-id CAFrxt0gkVnq1EyRpj4z+2Uz2OT4uSF51aEurvjiQ7dnjnMEXBg@mail.gmail.com
Whole thread Raw
List pgsql-general
Hello Everyone,

I am in the process of scheduling a VACUUM FULL for our production databases where in downtime is extremely critical.

Can someone please help me calculate the amount of free space (or free pages) in the Table and Index (even after regular autovacuum or vacuum analyze is performed).

This will help me calculate the amount of space that will be claimed after the vacuum full is performed.

I am using the below query (which i got from Google) to calculate the wasted space and bloats in the Table and Index.

SELECT 
  current_database(), schemaname, tablename, /*reltuples::bigint, relpages::bigint, otta,*/ 
  ROUND(CASE WHEN otta=0 THEN 0.0 ELSE sml.relpages/otta::numeric END,1) AS tbloat, 
  CASE WHEN relpages < otta THEN 0 ELSE bs*(sml.relpages-otta)::bigint END AS wastedbytes, 
  iname, /*ituples::bigint, ipages::bigint, iotta,*/ 
  ROUND(CASE WHEN iotta=0 OR ipages=0 THEN 0.0 ELSE ipages/iotta::numeric END,1) AS ibloat, 
  CASE WHEN ipages < iotta THEN 0 ELSE bs*(ipages-iotta) END AS wastedibytes 
FROM ( 
  SELECT 
    schemaname, tablename, cc.reltuples, cc.relpages, bs, 
    CEIL((cc.reltuples*((datahdr+ma- 
      (CASE WHEN datahdr%ma=0 THEN ma ELSE datahdr%ma END))+nullhdr2+4))/(bs-20::float)) AS otta, 
    COALESCE(c2.relname,'?') AS iname, COALESCE(c2.reltuples,0) AS ituples, COALESCE(c2.relpages,0) AS ipages, 
    COALESCE(CEIL((c2.reltuples*(datahdr-12))/(bs-20::float)),0) AS iotta -- very rough approximation, assumes all cols 
  FROM ( 
    SELECT 
      ma,bs,schemaname,tablename, 
      (datawidth+(hdr+ma-(case when hdr%ma=0 THEN ma ELSE hdr%ma END)))::numeric AS datahdr, 
      (maxfracsum*(nullhdr+ma-(case when nullhdr%ma=0 THEN ma ELSE nullhdr%ma END))) AS nullhdr2 
    FROM ( 
      SELECT 
        schemaname, tablename, hdr, ma, bs, 
        SUM((1-null_frac)*avg_width) AS datawidth, 
        MAX(null_frac) AS maxfracsum, 
        hdr+( 
          SELECT 1+count(*)/8 
          FROM pg_stats s2 
          WHERE null_frac<>0 AND s2.schemaname = s.schemaname AND s2.tablename = s.tablename 
        ) AS nullhdr 
      FROM pg_stats s, ( 
        SELECT 
          (SELECT current_setting('block_size')::numeric) AS bs, 
          CASE WHEN substring(v,12,3) IN ('8.0','8.1','8.2') THEN 27 ELSE 23 END AS hdr, 
          CASE WHEN v ~ 'mingw32' THEN 8 ELSE 4 END AS ma 
        FROM (SELECT version() AS v) AS foo 
      ) AS constants 
      GROUP BY 1,2,3,4,5 
    ) AS foo 
  ) AS rs 
  JOIN pg_class cc ON cc.relname = rs.tablename 
  JOIN pg_namespace nn ON cc.relnamespace = nn.oid AND nn.nspname = rs.schemaname AND nn.nspname <> 'information_schema' 
  LEFT JOIN pg_index i ON indrelid = cc.oid 
  LEFT JOIN pg_class c2 ON c2.oid = i.indexrelid 
) AS sml 
ORDER BY wastedbytes DESC ;

Can i continue to use it ? or is there any other better way to get the info ?

Please help !

Thanks
Venkat

pgsql-general by date:

Previous
From: Szymon Guz
Date:
Subject: Re: postgis and pgpool
Next
From: Henry Drexler
Date:
Subject: Re: General guidance if there is an in dadabase solution or should stay as excel vba solution.