Reduce WAL pressure #3055

benjamin-bergia · 2022-02-23T13:22:20Z

I am currently struggling to run timescaledb on my instances. I have access to storage with a limited bandwidth and the huge amount of WALs that my instances generate with wal_level = 'logical' just doesn't cut it, despite using a separate PVC. I have pretty lax requirements. I do not need incremental backups and I can afford to lose the data between two full backups (daily full backup). I also don't use any standby, replica, etc.

After trying to set wal_level in the cluster definition using patroni.dynamicConfiguration, I noticed that it is overwritten. Is it possilble to have basic log archiving using the minimal log level and still have full backups?

The text was updated successfully, but these errors were encountered:

tjmoore4 · 2022-02-25T15:44:36Z

Hello @benjamin-bergia. It appears you are using PGO v5, but other details would be helpful as well. Your exact PGO version, your PostgresCluster definition and environment details (Kubernetes version, etc) would give a better idea of what options may be available.

benjamin-bergia · 2022-02-25T16:46:58Z

Sure,

I am using:

Kubernetes v1.21
This kustomize file for the operator so the version should be 5.0.4

And here is my cluster definition:

apiVersion: postgres-operator.crunchydata.com/v1beta1
kind: PostgresCluster
metadata:
  name: metrics-postgres
spec:
  image: myregistry/myproject/crunchy-postgres@sha256:94c1162fe12cde697049168a0c8474a7983bb28404c025c7d358cb021c451090
  postgresVersion: 13
  instances:
    - name: primary
      dataVolumeClaimSpec:
        storageClassName: 'io'
        accessModes:
          - 'ReadWriteOnce'
        resources:
          requests:
            storage: '2000Gi'
      walVolumeClaimSpec:
        storageClassName: 'io'
        accessModes:
          - 'ReadWriteOnce'
        resources:
          requests:
            storage: '300Gi'
      tolerations:
        - key: 'reservedFor'
          value: 'database'
          effect: 'NoSchedule'

  backups:
    pgbackrest:
      image: registry.developers.crunchydata.com/crunchydata/crunchy-pgbackrest:centos8-2.35-0
      global:
        archive-async: 'y'
        spool-path: '/pgwal/pgbackrest/spool'
        archive-push-queue-max: 200GB
        process-max: '4'
        repo1-path: '/pgbackrest/metrics-db/metrics-db'
        repo1-retention-full: '3'
        repo1-retention-full-type: time
        repo1-retention-archive-type: 'full'
        repo1-retention-archive: '1'
      configuration:
        - secret:
            name: ***
      repos:
        - name: repo1
          s3:
            endpoint: '***'
            region: '***'
            bucket: '***'
          schedules:
            full: '0 0/12 * * *'

  monitoring:
    pgmonitor:
      exporter:
        image: registry.developers.crunchydata.com/crunchydata/crunchy-postgres-exporter:ubi8-5.0.4-0

  users:
    - name: promscale
      options: 'SUPERUSER'

  patroni:
    dynamicConfiguration:
      postgresql:
        parameters:
          shared_preload_libraries: timescaledb,promscale
          timescaledb.license: timescale
          # Generated by timescaledb-tune
          shared_buffers: 14714MB
          effective_cache_size: 44143MB
          maintenance_work_mem: 2047MB
          work_mem: 4708kB
          timescaledb.max_background_workers: 8
          max_worker_processes: 43
          max_parallel_maintenance_workers: 12
          max_parallel_workers_per_gather: 16
          max_parallel_workers: 32
          wal_buffers: 16MB
          min_wal_size: 512MB
          max_wal_size: 4GB
          default_statistics_target: 500
          random_page_cost: 1.1
          checkpoint_completion_target: 0.9
          max_connections: 100
          max_locks_per_transaction: 512
          autovacuum_max_workers: 10
          autovacuum_naptime: 10
          effective_io_concurrency: 256
          log_checkpoints: t
          wal_level: replica

As explained in here, I am running a custom image based on the crunchy one just to switch the license and add the promscale extension.

cbandy · 2022-03-09T22:07:23Z

Is it possilble to have basic log archiving using the minimal log level and still have full backups?

Quoting pgBackRest,

No. wal_level > minimal is absolutely required for online backups.

benjaminjb · 2022-06-06T21:25:32Z

@benjamin-bergia I hope that ^ answered your question re: the requirement that wal_level > minimal in order to have a full backup.

I hope you have solved the issue you were experiencing -- perhaps wal_level = replica would help? -- and if this is still an issue and you would like to discuss some other possible solution, feel free to reopen this issue.

ThommyH · 2022-10-18T13:44:53Z

Setting wal_level = replica in spec.patroni.dynamicConfiguration is being ignored. Is wal_level = logical necessary for pgo to work properly?

MSandro · 2024-09-13T14:55:15Z

I have the same issue, I have no idea how to reduce the wal size.

ThommyH · 2024-09-13T15:02:45Z

spool-path: /pgdata/pgbackrest-spool
archive-async: 'y'
archive-push-queue-max: 100GiB

Try these pgbackrest settings

edit: added spool path

andrewlecuyer · 2024-09-13T15:26:26Z

Concur with @ThommyH's advice for async archiving. This will actually be a default in upcoming CPK releases: #3962.

dberardo-com · 2025-03-04T20:25:29Z

@benjamin-bergia have you tried wal_compression ? if so, any luck with it ?

i am also facing issue with very high wal log generation and wondering if it is possible to instruct pgbackrest to expirre wal logs between backups as i dont care about PITR.

also, have you had any benefit in using async archiving ? or was that just an attempt to improve the situation that lead nowhere?

benjamin-bergia · 2025-03-05T08:47:06Z

@dberardo-com I haven't tried. Currently I am not using this operator but at the time, the storage that was available to me couldn't handle the amount of IO from the WALs. So all these settings were just there to try to mitigate this.

dberardo-com · 2025-03-05T10:04:17Z

Thanks for your comment, hopefully the chruncy team will come up with some official guides in how to reduce this Wal burden.

benjamin-bergia · 2025-03-05T10:31:13Z

AFAIK it's a limitation on pgbackrest and as such pretty much out of the hands of Crunchy.

dberardo-com · 2025-03-27T07:23:07Z

is there any way to change wal_level to replica without having to pause the cluster ? currently i have a cluster running with wal_level replica and it's seems to do just fine ...

tjmoore4 added the v5 label Feb 25, 2022

benjaminjb closed this as completed Jun 6, 2022

rmiguelac mentioned this issue Aug 12, 2022

No space left on device #3270

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce WAL pressure #3055

Reduce WAL pressure #3055

benjamin-bergia commented Feb 23, 2022

tjmoore4 commented Feb 25, 2022

benjamin-bergia commented Feb 25, 2022

cbandy commented Mar 9, 2022

benjaminjb commented Jun 6, 2022 •

edited

Loading

ThommyH commented Oct 18, 2022

MSandro commented Sep 13, 2024

ThommyH commented Sep 13, 2024 •

edited

Loading

andrewlecuyer commented Sep 13, 2024

dberardo-com commented Mar 4, 2025

benjamin-bergia commented Mar 5, 2025

dberardo-com commented Mar 5, 2025

benjamin-bergia commented Mar 5, 2025

dberardo-com commented Mar 27, 2025

Reduce WAL pressure #3055

Reduce WAL pressure #3055

Comments

benjamin-bergia commented Feb 23, 2022

tjmoore4 commented Feb 25, 2022

benjamin-bergia commented Feb 25, 2022

cbandy commented Mar 9, 2022

benjaminjb commented Jun 6, 2022 • edited Loading

ThommyH commented Oct 18, 2022

MSandro commented Sep 13, 2024

ThommyH commented Sep 13, 2024 • edited Loading

andrewlecuyer commented Sep 13, 2024

dberardo-com commented Mar 4, 2025

benjamin-bergia commented Mar 5, 2025

dberardo-com commented Mar 5, 2025

benjamin-bergia commented Mar 5, 2025

dberardo-com commented Mar 27, 2025

benjaminjb commented Jun 6, 2022 •

edited

Loading

ThommyH commented Sep 13, 2024 •

edited

Loading