Fixed postgresql>=10 secondary server lag always 0, SuperQ proposed a… #977

ARPABoy · 2023-11-29T09:24:32Z

Fixed postgresql>=10 secondary server lag always 0, SuperQ proposed a more clean code solution :), pg_replication_test modified to test pgReplicationQueryBeforeVersion10 or pgReplicationQueryAfterVersion10 depending of the postgresql version

… more clean code solution :), pg_replication_test modified to test pgReplicationQueryBeforeVersion10 or pgReplicationQueryAfterVersion10 depending of the postgresql version Signed-off-by: kr0m <kr0m@Garrus.alfaexploit.com>

SuperQ

LGTM, Thanks!

SuperQ · 2023-11-29T10:04:12Z

To add some more information here. It was discovered that if replication is failed, say blocked with an iptables firewall rule, pg_last_wal_receive_lsn() and pg_last_wal_replay_lsn() will remain the same. This masks the lag value.

IMO, we should probably expose separate metrics for each of these things rather than compute it in the exporter.

SuperQ · 2023-11-29T10:06:14Z

This is a fix to the bug introduced by #895.

CC @IamLuksha.

IamLuksha · 2023-11-29T10:34:21Z

Maybe I'm wrong
There is no way to check and test right now
But in versions above 9.3 pg_last_xact_replay_timestamp does not work correctly. And in the proposed commit it is used for versions above 10

The case of channel blocking must be considered separately. Check connection between devices

SuperQ · 2023-11-29T13:30:14Z

@IamLuksha Are there recommended metrics/alerts for monitoring replication? Server side? Client side?

I think I am beginning to understand some of how this is being done. I'm going to propose an additional metric that will will help with this.

Maybe we should add some example alerts to the mixin.

IamLuksha · 2023-11-29T15:03:55Z

@SuperQ

Are there recommended metrics/alerts for monitoring replication? Server side? Client side?

Myabe reply_time. I need some time for test. Maybe next week. I don't have a working cluster right now

An additional metric is a great idea. Because it will clearly indicate the problem.
Since this commit will return issue #895

ARPABoy · 2023-11-30T07:17:22Z

Are there recommended metrics/alerts for monitoring replication? Server-side? Client-side?
If you want to monitor lag looking at the Primary server I think it would be better to monitor replay_lag but if there are more than one synchronized Secondary servers you have to know the IP/FQDN of the Secondary to assign an identification tag to the metric.

You can detect broken synchronization on Secondary servers, if you know It's IP/FQDN:
SELECT COUNT(*) FROM pg_stat_replication WHERE client_addr='SLAVE_IP' AND state = 'streaming';

And you can get the LAG using replay_lag metric in that way:
SELECT COALESCE(EXTRACT(EPOCH FROM replay_lag)::bigint, 0) AS replay_lag FROM pg_stat_replication WHERE client_addr='SLAVE_IP';

Anyway, with the current Secondary metric monitoring, we can monitor lag if we apply the proposed patch, there are two queries, one for PostgreSQL>=10 and another for PostgreSQL<10.

The old versions will execute:
pgReplicationQueryBeforeVersion10 = SELECT
CASE
WHEN NOT pg_is_in_recovery() THEN 0
WHEN pg_last_wal_receive_lsn () = pg_last_wal_replay_lsn () THEN 0
ELSE GREATEST (0, EXTRACT(EPOCH FROM (now() - pg_last_xact_replay_timestamp())))
END AS lag,
CASE
WHEN pg_is_in_recovery() THEN 1
ELSE 0
END as is_replica

And the new versions will execute:
pgReplicationQueryAfterVersion10 = SELECT
CASE
WHEN NOT pg_is_in_recovery() THEN 0
ELSE GREATEST (0, EXTRACT(EPOCH FROM (now() - pg_last_xact_replay_timestamp())))
END AS lag,
CASE
WHEN pg_is_in_recovery() THEN 1
ELSE 0
END as is_replica

I think that by monitoring pg_up(Primary/Secondary) and pg_replication_lag_seconds(Secondary) metric we have all the troubleshooting conflicts covered.

IamLuksha · 2023-11-30T07:22:48Z

Anyway, with the current Secondary metric monitoring, we can monitor lag if we apply the proposed patch, there are two queries, one for PostgreSQL>=10 and another for PostgreSQL<10.

This method does not work and was abandoned earlier
If there are no changes in the database for a long time and this is normal
We will get a replication error

ARPABoy · 2023-11-30T07:36:13Z

@IamLuksha This method does not work in postgresql>=10 or postgresql<10 or both?

IamLuksha · 2023-11-30T07:45:16Z

@IamLuksha This method does not work in postgresql>=10 or postgresql<10 or both?

both

		WHEN NOT pg_is_in_recovery() THEN 0
                WHEN pg_last_wal_receive_lsn () = pg_last_wal_replay_lsn () THEN 0
		ELSE GREATEST (0, EXTRACT(EPOCH FROM (now() - pg_last_xact_replay_timestamp())))

This code for version > 9.3. But you propose to use it for <10

And your 'postgresql>=10' will give a replication error with an empty database or when it is rarely used.
In the end, having solved your problem, you return the old ones

In your case you need to check the connection between the servers

ARPABoy · 2023-11-30T08:23:49Z

@IamLuksha Correct me if I am wrong.

It can be two kind of lag:

Local-Secondary server lag, the information was received by the Secondary server and written to disk but it hasn't been already dumped to DB.
Network lag between Primary and Secondary servers.

https://www.postgresql.org/docs/current/functions-admin.html

For Local lag we can check:
pg_last_wal_receive_lsn (): Returns the last write-ahead log location that has been received and synced to disk by streaming replication.
pg_last_wal_replay_lsn (): Returns the last write-ahead log location that has been replayed during recovery.

For Network lag:
pg_last_xact_replay_timestamp (): Returns the time stamp of the last transaction replayed during recovery. This is the time at which the commit or abort WAL record for that transaction was generated on the primary.

The problem with using pg_last_xact_replay_timestamp is that it remains the same value when there's no activity in the Primary server.

Have I understood the whole problem correctly?

IamLuksha · 2023-11-30T08:31:34Z

Have I understood the whole problem correctly?

Yes!

So we need to check pg_last_wal_receive_lsn () = pg_last_wal_replay_lsn ()

ARPABoy · 2023-11-30T08:33:19Z

When you say: In your case you need to check the connection between the servers

Check the connection using node_exporter or any other external monitoring system? Or querying some PosqtgreSQL data?

IamLuksha · 2023-11-30T08:36:58Z

Check the connection using node_exporter or any other external monitoring system? Or querying some PosqtgreSQL data?

Every method can be used

I don't remember if Postgres has a method to check the connection.
I won't be able to check until next week.

ARPABoy · 2023-12-04T08:10:20Z

Hello @IamLuksha , what do you think about monitoring lag and availability from Primary server?

You can detect broken synchronizations, if you know It's IP/FQDN:
SELECT COUNT(*) FROM pg_stat_replication WHERE client_addr='SECONDARY_IP' AND state = 'streaming';

And you can get the LAG using replay_lag metric in this way:
SELECT COALESCE(EXTRACT(EPOCH FROM replay_lag)::bigint, 0) AS replay_lag FROM pg_stat_replication WHERE client_addr='SECONDARY_IP';

The only problem that I can detect in this way is that when the Secondary server is unavailable from Primary, pg_stat_replication returns no results, so it simply disappears, the only solution that I have thought is saving previous watched Secondary servers in a list file, and if someone of them disappears, then trigger an alarm.

Do you think it's a worthy approximation? Any suggestion or solution?

The exported replication lag does not handle all failure modes, and can report 0 for replicas that are out of sync and incapable of recovery. A proper replacement for that metric would require a different approach (see e.g. prometheus-community#1007), but for a lot of folks, simply exporting the age of the last replay can provide a pretty strong signal for something being amiss. I think this solution might be preferrable to prometheus-community#977, though the lag metric needs to be fixed or abandoned eventually. Signed-off-by: Conrad Hoffmann <ch@bitfehler.net>

The exported replication lag does not handle all failure modes, and can report 0 for replicas that are out of sync and incapable of recovery. A proper replacement for that metric would require a different approach (see e.g. prometheus-community#1007), but for a lot of folks, simply exporting the age of the last replay can provide a pretty strong signal for something being amiss. I think this solution might be preferable to prometheus-community#977, though the lag metric needs to be fixed or abandoned eventually. Signed-off-by: Conrad Hoffmann <ch@bitfehler.net>

SuperQ approved these changes Nov 29, 2023

View reviewed changes

SuperQ requested a review from sysadmind November 29, 2023 10:04

bitfehler mentioned this pull request Nov 6, 2024

Export last replay age in replication collector #1085

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed postgresql>=10 secondary server lag always 0, SuperQ proposed a… #977

Fixed postgresql>=10 secondary server lag always 0, SuperQ proposed a… #977

ARPABoy commented Nov 29, 2023

SuperQ left a comment

SuperQ commented Nov 29, 2023

SuperQ commented Nov 29, 2023

IamLuksha commented Nov 29, 2023

SuperQ commented Nov 29, 2023

IamLuksha commented Nov 29, 2023 •

edited

Loading

ARPABoy commented Nov 30, 2023 •

edited

Loading

IamLuksha commented Nov 30, 2023

ARPABoy commented Nov 30, 2023

IamLuksha commented Nov 30, 2023

ARPABoy commented Nov 30, 2023

IamLuksha commented Nov 30, 2023

ARPABoy commented Nov 30, 2023

IamLuksha commented Nov 30, 2023

ARPABoy commented Dec 4, 2023 •

edited

Loading

Fixed postgresql>=10 secondary server lag always 0, SuperQ proposed a… #977

Are you sure you want to change the base?

Fixed postgresql>=10 secondary server lag always 0, SuperQ proposed a… #977

Conversation

ARPABoy commented Nov 29, 2023

SuperQ left a comment

Choose a reason for hiding this comment

SuperQ commented Nov 29, 2023

SuperQ commented Nov 29, 2023

IamLuksha commented Nov 29, 2023

SuperQ commented Nov 29, 2023

IamLuksha commented Nov 29, 2023 • edited Loading

ARPABoy commented Nov 30, 2023 • edited Loading

IamLuksha commented Nov 30, 2023

ARPABoy commented Nov 30, 2023

IamLuksha commented Nov 30, 2023

ARPABoy commented Nov 30, 2023

IamLuksha commented Nov 30, 2023

ARPABoy commented Nov 30, 2023

IamLuksha commented Nov 30, 2023

ARPABoy commented Dec 4, 2023 • edited Loading

IamLuksha commented Nov 29, 2023 •

edited

Loading

ARPABoy commented Nov 30, 2023 •

edited

Loading

ARPABoy commented Dec 4, 2023 •

edited

Loading