Where the open source community meets: Secure your spot for Percona Live Amsterdam! - Register

Downloads

Blog

Tuning Autovacuum in PostgreSQL and Autovacuum Internals

August 10, 2018

Author

Avinash Vallarapu

Insight for DBAs

PostgreSQL

Share this Post:

The performance of a PostgreSQL database can be compromised by dead tuples since they continue to occupy space and can lead to bloat. We provided an introduction to VACUUM and bloat in an earlier blog post. Now, though, it’s time to look at autovacuum for postgres, and the internals you to know to maintain a high-performance PostgreSQL database needed by demanding applications.

What is autovacuum?

Autovacuum is one of the background utility processes that starts automatically when you start PostgreSQL. As you see in the following log, the postmaster (parent PostgreSQL process) with pid 2862 has started the autovacuum launcher process with pid 2868. To start autovacuum, you must have the parameter autovacuum set to ON. In fact, you should not set it to OFF in a production system unless you are 100% sure about what you are doing and its implications.

avi@percona:~$ps -eaf | egrep "/post|autovacuum"
postgres  2862     1  0 Jun17 pts/0    00:00:11 /usr/pgsql-10/bin/postgres -D /var/lib/pgsql/10/data
postgres  2868  2862  0 Jun17 ?        00:00:10 postgres: autovacuum launcher process   
postgres 15427  4398  0 18:35 pts/1    00:00:00 grep -E --color=auto /post|autovacuum

avi@percona:~$ps -eaf | egrep "/post|autovacuum"

postgres 2862 1 0 Jun17 pts/0 00:00:11 /usr/pgsql-10/bin/postgres -D /var/lib/pgsql/10/data

postgres 2868 2862 0 Jun17 ? 00:00:10 postgres: autovacuum launcher process

postgres 15427 4398 0 18:35 pts/1 00:00:00 grep -E --color=auto /post|autovacuum

Why is autovacuum needed?

We need VACUUM to remove dead tuples so that the space occupied by dead tuples can be re-used by the table for future inserts/updates. To know more about dead tuples and bloat, please read our previous blog post. We also need ANALYZE on the table that updates the table statistics, so that the optimizer can choose optimal execution plans for an SQL statement. It is the autovacuum in postgres that is responsible for performing both vacuum and analyze on tables.

There exists another background process in postgres called Stats Collector that tracks the usage and activity information. The information collected by this process is used by autovacuum launcher to identify the list of candidate tables for autovacuum. PostgreSQL identifies the tables needing vacuum or analyze automatically, but only when autovacuum is enabled. This ensures that postgres heals itself and stops the database from developing more bloat/fragmentation.

Parameters needed to enable autovacuum in PostgreSQL are :

autovacuum = on  # ( ON by default )
track_counts = on # ( ON by default )

1 2	autovacuum = on # ( ON by default ) track_counts = on # ( ON by default )

track_counts is used by the stats collector. Without that in place, autovacuum cannot access the candidate tables.

Logging autovacuum

Eventually, you may want to log the tables on which autovacuum spends more time. In that case, set the parameter log_autovacuum_min_duration to a value (defaults to milliseconds), so that any autovacuum that runs for more than this value is logged to the PostgreSQL log file. This may help tune your table level autovacuum settings appropriately.

# Setting this parameter to 0 logs every autovacuum to the log file.
log_autovacuum_min_duration = '250ms' # Or 1s, 1min, 1h, 1d

1 2	# Setting this parameter to 0 logs every autovacuum to the log file. log_autovacuum_min_duration = '250ms' # Or 1s, 1min, 1h, 1d

Here is an example log of autovacuum vacuum and analyze

< 2018-08-06 07:22:35.040 EDT > LOG: automatic vacuum of table "vactest.scott.employee": index scans: 0
pages: 0 removed, 1190 remain, 0 skipped due to pins, 0 skipped frozen
tuples: 110008 removed, 110008 remain, 0 are dead but not yet removable
buffer usage: 2402 hits, 2 misses, 0 dirtied
avg read rate: 0.057 MB/s, avg write rate: 0.000 MB/s
system usage: CPU 0.00s/0.02u sec elapsed 0.27 sec
< 2018-08-06 07:22:35.199 EDT > LOG: automatic analyze of table "vactest.scott.employee" system usage: CPU 0.00s/0.02u sec elapsed 0.15 sec

< 2018-08-06 07:22:35.040 EDT > LOG: automatic vacuum of table "vactest.scott.employee": index scans: 0

pages: 0 removed, 1190 remain, 0 skipped due to pins, 0 skipped frozen

tuples: 110008 removed, 110008 remain, 0 are dead but not yet removable

buffer usage: 2402 hits, 2 misses, 0 dirtied

avg read rate: 0.057 MB/s, avg write rate: 0.000 MB/s

system usage: CPU 0.00s/0.02u sec elapsed 0.27 sec

< 2018-08-06 07:22:35.199 EDT > LOG: automatic analyze of table "vactest.scott.employee" system usage: CPU 0.00s/0.02u sec elapsed 0.15 sec

When does PostgreSQL run autovacuum on a table?

As discussed earlier, autovacuum in postgres refers to both automatic VACUUM and ANALYZE and not just VACUUM. An automatic vacuum or analyze runs on a table depending on the following mathematic equations.

The formula for calculating the effective table level autovacuum threshold is :

Autovacuum VACUUM thresold for a table = autovacuum_vacuum_scale_factor * number of tuples + autovacuum_vacuum_threshold

1	Autovacuum VACUUM thresold for a table = autovacuum_vacuum_scale_factor * number of tuples + autovacuum_vacuum_threshold

With the equation above, it is clear that if the actual number of dead tuples in a table exceeds this effective threshold, due to updates and deletes, that table becomes a candidate for autovacuum vacuum.

Autovacuum ANALYZE threshold for a table = autovacuum_analyze_scale_factor * number of tuples + autovacuum_analyze_threshold

1	Autovacuum ANALYZE threshold for a table = autovacuum_analyze_scale_factor * number of tuples + autovacuum_analyze_threshold

The above equation says that any table with a total number of inserts/deletes/updates exceeding this threshold—since last analyze—is eligible for an autovacuum analyze.

Let’s understand these parameters in detail.

- autovacuum_vacuum_scale_factor Or autovacuum_analyze_scale_factor : Fraction of the table records that will be added to the formula. For example, a value of 0.2 equals to 20% of the table records.

- autovacuum_vacuum_threshold Or autovacuum_analyze_threshold : Minimum number of obsolete records or dml’s needed to trigger an autovacuum.

Let’s consider a table: percona.employee with 1000 records and the following autovacuum parameters.

autovacuum_vacuum_scale_factor = 0.2
autovacuum_vacuum_threshold = 50
autovacuum_analyze_scale_factor = 0.1
autovacuum_analyze_threshold = 50

autovacuum_vacuum_scale_factor = 0.2

autovacuum_vacuum_threshold = 50

autovacuum_analyze_scale_factor = 0.1

autovacuum_analyze_threshold = 50

Using the above mentioned mathematical formulae as reference,

Table : percona.employee becomes a candidate for autovacuum Vacuum when,
Total number of Obsolete records = (0.2 * 1000) + 50 = 250

1 2	Table : percona.employee becomes a candidate for autovacuum Vacuum when, Total number of Obsolete records = (0.2 * 1000) + 50 = 250

Table : percona.employee becomes a candidate for autovacuum ANALYZE when,
Total number of Inserts/Deletes/Updates = (0.1 * 1000) + 50 = 150

1 2	Table : percona.employee becomes a candidate for autovacuum ANALYZE when, Total number of Inserts/Deletes/Updates = (0.1 * 1000) + 50 = 150

Tuning Autovacuum in PostgreSQL

We need to understand that these are global settings. These settings are applicable to all the databases in the instance. This means, regardless of the table size, if the above formula is reached, a table is eligible for autovacuum vacuum or analyze.

Is this a problem?

Consider a table with ten records versus a table with a million records. Even though the table with a million records may be involved in transactions far more often, the frequency at which a vacuum or an analyze runs automatically could be greater for the table with just ten records.

Consequently, PostgreSQL allows you to configure individual table level autovacuum settings that bypass global settings.

ALTER TABLE scott.employee SET (autovacuum_vacuum_scale_factor = 0, autovacuum_vacuum_threshold = 100);

1	ALTER TABLE scott.employee SET (autovacuum_vacuum_scale_factor = 0, autovacuum_vacuum_threshold = 100);

Output Log
----------
avi@percona:~$psql -d percona
psql (10.4)
Type "help" for help.

percona=# ALTER TABLE scott.employee SET (autovacuum_vacuum_scale_factor = 0, autovacuum_vacuum_threshold = 100);
ALTER TABLE

Output Log

----------

avi@percona:~$psql -d percona

psql (10.4)

Type "help" for help.

percona=# ALTER TABLE scott.employee SET (autovacuum_vacuum_scale_factor = 0, autovacuum_vacuum_threshold = 100);

ALTER TABLE

The above setting runs autovacuum vacuum on the table scott.employee only once there is more than 100 obsolete records.

How do we identify the tables that need their autovacuum settings tuned?

In order to tune autovacuum for tables individually, you must know the number of inserts/deletes/updates on a table for an interval. You can also view the postgres catalog view : pg_stat_user_tables to get that information.

percona=# SELECT n_tup_ins as "inserts",n_tup_upd as "updates",n_tup_del as "deletes", n_live_tup as "live_tuples", n_dead_tup as "dead_tuples"
FROM pg_stat_user_tables
WHERE schemaname = 'scott' and relname = 'employee';
 inserts | updates | deletes | live_tuples | dead_tuples 
---------+---------+---------+-------------+-------------
      30 |      40 |       9 |          21 |          39
(1 row)

percona=# SELECT n_tup_ins as "inserts",n_tup_upd as "updates",n_tup_del as "deletes", n_live_tup as "live_tuples", n_dead_tup as "dead_tuples"

FROM pg_stat_user_tables

WHERE schemaname = 'scott' and relname = 'employee';

inserts | updates | deletes | live_tuples | dead_tuples

---------+---------+---------+-------------+-------------

30 | 40 | 9 | 21 | 39

(1 row)

As observed in the above log, taking a snapshot of this data for a certain interval should help you understand the frequency of DMLs on each table. In turn, this should help you with tuning your autovacuum settings for individual tables.

How many autovacuum processes can run at a time?

There cannot be more than autovacuum_max_workers number of autovacuum processes running at a time, across the instance/cluster that may contain more than one database. Autovacuum launcher background process starts a worker process for a table that needs a vacuum or an analyze. If there are four databases with autovacuum_max_workers set to 3, then, the 4th database has to wait until one of the existing worker process gets free.

Before starting the next autovacuum, it waits for autovacuum_naptime, the default is 1 min on most of the versions. If you have three databases, the next autovacuum waits for 60/3 seconds. So, the wait time before starting next autovacuum is always (autovacuum_naptime/N) where N is the total number of databases in the instance.

Does increasing autovacuum_max_workers alone increase the number of autovacuum processes that can run in parallel?
NO. This is explained better in the next few lines.

Is VACUUM IO intensive?

Autovacuum can be considered as a cleanup. As discussed earlier, we have 1 worker process per table. Autovacuum reads 8KB (default block_size) pages of a table from disk and modifies/writes to the pages containing dead tuples. This involves both read and write IO. Thus, this could be an IO intensive operation, when there is an autovacuum running on a huge table with many dead tuples, during a peak transaction time. To avoid this issue, we have a few parameters that are set to minimize the impact on IO due to vacuum.

The following are the parameters used to tune autovacuum IO

- autovacuum_vacuum_cost_limit : total cost limit autovacuum could reach (combined by all autovacuum jobs).

- autovacuum_vacuum_cost_delay : autovacuum will sleep for these many milliseconds when a cleanup reaching autovacuum_vacuum_cost_limit cost is done.

- vacuum_cost_page_hit : Cost of reading a page that is already in shared buffers and doesn’t need a disk read.

- vacuum_cost_page_miss : Cost of fetching a page that is not in shared buffers.

- vacuum_cost_page_dirty : Cost of writing to each page when dead tuples are found in it.

Default Values for the parameters discussed above.
------------------------------------------------------
autovacuum_vacuum_cost_limit = -1 (So, it defaults to vacuum_cost_limit) = 200
autovacuum_vacuum_cost_delay = 20ms
vacuum_cost_page_hit = 1
vacuum_cost_page_miss = 10
vacuum_cost_page_dirty = 20

Default Values for the parameters discussed above.

------------------------------------------------------

autovacuum_vacuum_cost_limit = -1 (So, it defaults to vacuum_cost_limit) = 200

autovacuum_vacuum_cost_delay = 20ms

vacuum_cost_page_hit = 1

vacuum_cost_page_miss = 10

vacuum_cost_page_dirty = 20

Consider autovacuum VACUUM running on the table percona.employee.

Let’s imagine what can happen in 1 second. (1 second = 1000 milliseconds)

In a best-case scenario where read latency is 0 milliseconds, autovacuum can wake up and go for sleep 50 times (1000 milliseconds / 20 ms) because the delay between wake-ups needs to be 20 milliseconds.

1 second = 1000 milliseconds = 50 * autovacuum_vacuum_cost_delay

1	1 second = 1000 milliseconds = 50 * autovacuum_vacuum_cost_delay

Since the cost associated per reading a page in shared_buffers is 1, in every wake up 200 pages can be read, and in 50 wake-ups 50*200 pages can be read.

If all the pages with dead tuples are found in shared buffers, with an autovacuum_vacuum_cost_delay of 20ms, then it can read: ((200 / vacuum_cost_page_hit) * 8) KB in each round that needs to wait forautovacuum_vacuum_cost_delay amount of time.

Thus, at the most, an autovacuum can read : 50 * 200 * 8 KB = 78.13 MB per second (if blocks are already found in shared_buffers), considering the block_size as 8192 bytes.

If the blocks are not in shared buffers and need to be fetched from disk, an autovacuum can read : 50 * ((200 / vacuum_cost_page_miss) * 8) KB = 7.81 MB per second.

All the information we have seen above is for read IO.

Now, in order to delete dead tuples from a page/block, the cost of a write operation is : vacuum_cost_page_dirty, set to 20 by default.

At the most, an autovacuum can write/dirty : 50 * ((200 / vacuum_cost_page_dirty) * 8) KB = 3.9 MB per second.

Generally, this cost is equally divided to all the autovacuum_max_workers number of autovacuum processes running in the Instance. So, increasing the autovacuum_max_workers may delay the autovacuum execution for the currently running autovacuum workers. And increasing the autovacuum_vacuum_cost_limit may cause IO bottlenecks. An important point to note is that this behavior can be overridden by setting the storage parameters of individual tables, which would subsequently ignore the global settings.

postgres=# alter table percona.employee set (autovacuum_vacuum_cost_limit = 500);
ALTER TABLE
postgres=# alter table percona.employee set (autovacuum_vacuum_cost_delay = 10);
ALTER TABLE
postgres=# 
postgres=# d+ percona.employee
Table "percona.employee"
Column | Type | Collation | Nullable | Default | Storage | Stats target | Description 
--------+---------+-----------+----------+---------+---------+--------------+-------------
id | integer | | | | plain | | 
Options: autovacuum_vacuum_threshold=10000, autovacuum_vacuum_cost_limit=500, autovacuum_vacuum_cost_delay=10

postgres=# alter table percona.employee set (autovacuum_vacuum_cost_limit = 500);

ALTER TABLE

postgres=# alter table percona.employee set (autovacuum_vacuum_cost_delay = 10);

ALTER TABLE

postgres=#

postgres=# d+ percona.employee

Table "percona.employee"

--------+---------+-----------+----------+---------+---------+--------------+-------------

Options: autovacuum_vacuum_threshold=10000, autovacuum_vacuum_cost_limit=500, autovacuum_vacuum_cost_delay=10

Thus, on a busy OLTP database, always have a strategy to implement manual VACUUM on tables that are frequently hit with DMLs, during a low peak window. You may have as many parallel vacuum jobs as possible when you run it manually after setting relevant autovacuum_* settings. For this reason, a scheduled manual Vacuum Job is always recommended alongside finely tuned autovacuum settings.

The value and importance of the DBA isn’t diminished by a migration to the cloud. While some tasks are being automated, other aspects of the job (e.g. data modeling and data security) will only grow. Our white paper discusses how your company’s DBA staff needs to adapt to their new cloud database environment and more.

Learn how to vacuum your PostgreSQL databases without negatively impacting performance in our free webinar, “Using Vacuum to Clean Up PostgreSQL for Performance”

0 0 votes

Article Rating

21 Comments

Oldest

Newest Most Voted

Shaik

7 years ago

Great

Admin

Peter Zaitsev

7 years ago

The overhead in terms of additional IO resource usage is clear. Couple of other questions

– Does autovacuum uses the cache (and hence can cause cache pollution) or does it use some different IO path ?

– What is about locking ? Are there any locks which are being set by AutoVacum which can impact parallel queries beyond competition for IO resources ?

Andrey

7 years ago

Reply to Peter Zaitsev

Autovacuum acquire AccessExclusiveLock or ShareUpdateExclusiveLock on a vacuumed relation according to vacuum options.

Editor

Jobin Augustine

7 years ago

Reply to Peter Zaitsev

Regarding the IO overhead, PostgreSQL is just delaying the I/O.
Few databases move old snapshots/records to Undo area as part of transaction causing extra I/O as part of the transaction. But PostgreSQL takes an approach “Leave it there. don’t move/copy anywhere”. So PostgreSQL saves some IO there. But that saving is not permanent and ultimately it needs to do the garbage collection which is called vacuum.

– Does autovacuum uses the cache (and hence can cause cache pollution) or does it use some different IO path ?
Yes, it can cause cache pollution just like any garbage collector. However, The autovacuum algorithms are improved continuously from version to version and newer versions are able to avoid scanning pages wherever possible. The community effort is to bring it down to scan only those pages which are really modified. There is a high chance that such modified pages are there in the cache already.

– What is about locking? Are there any locks which are being set by AutoVacum which can impact parallel queries beyond competition for IO resources?
Autovacuum won’t acquire locks which can interfere with other sessions doing SELECT/INSERT/UPDATE/DELETE. The lightweight “cleanup lock” it acquires can potentially can affect DDLs.
However, Autovacuum is designed to be so gentle on the system, It has mechanisms to cancel itself, skip blocks or entire table itself,
Running a newer version PostgreSQL is very important to get the best of it.

Having said, Autovacuum is far away from perfectness. More efforts are underway to improve it further and address all edge cases.

Ujang Jaenudin

7 years ago

Reply to Jobin Augustine

“Yes, it can cause cache pollution just like any garbage collector.”
I heard there is “ring buffer” for special purposes i.e. vacuuming and also scanning big tables which 1/4 of shared buffer; could you please elaborate?

Editor

Jobin Augustine

7 years ago

Reply to Ujang Jaenudin

I think that is a related topic but extension to what we discussed in this blog. Yes, you are correct, there are separate ring buffers to help autovacuum workers, let us consider what happens in a common workload.
Autovacuum is not an aggressive backend process, it scans through tables from the candidate list one by one. so the pages required by autovacuum could be quite different from the current workload demands. There is a chance of getting the pages from current cache. However, the pages dirtied by autovacuum may not be needed again.
To reduce the overall impact of autovacuum on shared buffers, the seperate ring buffer is maintained. There is a good chance that vacuum will almost always dirty pages once and never again, and flushing dirty pages back to the OS cache ASAP helps avoid a read-modify-write cycle if the page didn’t leave the OS cache.
vacuum related ring buffers are not currently tunable by parameters, But we can modify the source code and build the PostgreSQL. But we can expect an improved vacuum performance by doing so in servers with several GBs of RAM
Source file : /backend/storage/buffer/freelist.c
Function : GetAccessStrategy(BufferAccessStrategyType btype)
However, There is a counter-argument from the community that if we optimize autovacuum to make it run faster, it may take away more IO bandwidth and cache. Still, many community members reported overall better performance in their workload.
The takeaway is that same size won’t fit for every workload. Do modifications with good amount of testing.
When we work with PostgreSQL, always we have to keep in mind that we generally depends on OS level page cache to a good extent. General recommendation is to give 1/4 of RAM to shared buffers and rest to OS page cache. Any process accessing any memory page will have a side effect on the cache.

Ian Warford (@iwarford)

7 years ago

Reply to Jobin Augustine

>Autovacuum won’t acquire locks
>which can interfere with other
>sessions doing SELECT/INSERT/
>UPDATE/DELETE. The lightweight
>“cleanup lock” it acquires can
>potentially can affect DDLs.

Another question related to this. What are the long-term consequences from vacuum/autovacuum being unable to acquire that cleanup lock? Our experience showed that it was a brief exclusive lock on the table required to free up the space. However, on large busy tables, this wasn’t possible for autovacuum to ever get. Has this been your experience?

raghavendra

7 years ago

What is the role of “autovacuum_vacuum_cost_limit”?

Editor

Jobin Augustine

7 years ago

Reply to raghavendra

This is a mechanism to prevent autovacuum consuming excessive resource (I/O) on the server. By specifying this parameter we are saying to autovacuum workers to take a pause when the cost reaches a specific value.

Sreyas

7 years ago

Reply to Jobin Augustine

Hi Jobin, could you please give me your contact details, i have some queries regarding vacuum

Henry

7 years ago

is autovacuum starting its modification as a transaction? what if autovacuum failed in the middle of cleaning dead tuples (like a power outage sth).

Andrey

7 years ago

Reply to Henry

All VACUUM modifications are logging into WAL, block-by-block, and after crashing will be rollbacked

Henry

7 years ago

thanks, i tried to add a stack dump to each XLogInsert call, and check if there are any that originates from autovacuum, but failed to find any instance to prove autovacuum calls are logged. possible to show the code path that dumps to wal? thanks

Andrey

7 years ago

Reply to Henry

I do not understand your task in general, but you can use WAL record XLOG_BTREE_VACUUM as a strong prove – it is log of index cleanup during vacuum.

raviteja

7 years ago

We are facing the issue with dead tuples where it’s not getting deleted even vacuum invokes and there is no lock on that table

Carlos Carmona

7 years ago

You say “So, increasing the autovacuum_max_workers may delay the autovacuum execution for the currently running autovacuum workers. And increasing the autovacuum_vacuum_cost_limit may cause IO bottlenecks.”

So my question is, how can I tunning up my autovacuum process without create a bottlenecks or delay the autovacuum execution?

Steve N.

7 years ago

Are there any parameters that could be used to tell autovacuum to only operate during non peak hours? For example, we want autovacuum to be on, but not occur between 1PM and 4PM on the database? Or would we have to temporarily turn off autovacuum during that period?

dharshanr

7 years ago

Does Auto Vacuum clean up indexes as well or does it only work on the data files?

Andrey

7 years ago

Reply to dharshanr

It is clean up all indexes, associated with the heap relation: Vacuum scans all index page-by page, deletes dead and unused tuples, removes empty pages.

Andrey Lepikhov

7 years ago

Reply to dharshanr

It currently has no such settings.This can be explained by the following reasons: 1) most databases has 24/7 load with small oscillations; 2) Autovacuum is softly locks relations and cancel vacuuming if relation is used.
On the over hand: not very difficult to develop an extension, which will launch background worker and control autovacuum according to some rules.

Zeeshan moin

7 years ago

Hi Jobin,
can you please elaborate a little on autovacuum_vacuum_cost_limit. You had said it takes a default as 200.
I am trying to understand what does “cost” exactly mean here. Is that the no. of pages being scanned after every autovacuum wakeup by the worker process..? what does “cost” imply here.