Tracking IST Progress in Percona XtraDB Cluster

In this blog post, we’ll look at how Percona XtraDB Cluster uses IST.

Introduction

Percona XtraDB Cluster uses the concept of an Incremental State Transfer (IST). When a node of the cluster leaves the cluster for a short period of time, it can rejoin the cluster by getting the delta set of missing changes from any active node in the cluster.

This process of getting the delta set of changes is named as IST in Percona XtraDB Cluster.

Tracking IST Progress

The number of write-sets/changes that the joining node needs to catch up on when rejoining the cluster is dictated by:

The duration the node was not present in the cluster
The workload of the cluster during that time frame

This catch-up process can be time-consuming. Until this process is complete, the rejoining node is not ready to process any active workloads.

We believe that any process that is time-consuming should have a progress monitor attached to it. This is exactly what we have done.

In the latest release of Percona XtraDB Cluster 5.7.17-29.20, we added an IST progress monitor that is exposed through SHOW STATUS. This helps you to monitor the percentage of write-sets which has been applied by the rejoining node.

Let’s see this in a working example:

Start a two-node cluster
Process some basic workloads, allow cluster replication
Shutdown node-2
Node-1 then continues to process more workloads (the workload fits the allocated gcache)
Restart Node-2, causing it to trigger an IST

mysql&gt; show status like 'wsrep_ist_receive_status';
+--------------------------+--------------------------------------------------------+
| Variable_name | Value |
+--------------------------+--------------------------------------------------------+
| wsrep_ist_receive_status | 3% complete, received seqno 1421771 of 1415410-1589676 |
+--------------------------+--------------------------------------------------------+
1 row in set (0.00 sec)

....

mysql&gt; show status like 'wsrep_ist_receive_status';
+--------------------------+---------------------------------------------------------+
| Variable_name | Value |
+--------------------------+---------------------------------------------------------+
| wsrep_ist_receive_status | 52% complete, received seqno 1506799 of 1415410-1589676 |
+--------------------------+---------------------------------------------------------+
1 row in set (0.00 sec)

....

mysql&gt; show status like 'wsrep_ist_receive_status';
+--------------------------+---------------------------------------------------------+
| Variable_name | Value |
+--------------------------+---------------------------------------------------------+
| wsrep_ist_receive_status | 97% complete, received seqno 1585923 of 1415410-1589676 |
+--------------------------+---------------------------------------------------------+
1 row in set (0.00 sec)

mysql&gt; show status like 'wsrep_ist_receive_status';
+--------------------------+-------+
| Variable_name | Value |
+--------------------------+-------+
| wsrep_ist_receive_status | |
+--------------------------+-------+
1 row in set (0.00 sec)

mysql> show status like 'wsrep_ist_receive_status';

+--------------------------+--------------------------------------------------------+

| Variable_name | Value |

+--------------------------+--------------------------------------------------------+

| wsrep_ist_receive_status | 3% complete, received seqno 1421771 of 1415410-1589676 |

+--------------------------+--------------------------------------------------------+

1 row in set (0.00 sec)

....

mysql> show status like 'wsrep_ist_receive_status';

+--------------------------+---------------------------------------------------------+

| Variable_name | Value |

+--------------------------+---------------------------------------------------------+

| wsrep_ist_receive_status | 52% complete, received seqno 1506799 of 1415410-1589676 |

+--------------------------+---------------------------------------------------------+

1 row in set (0.00 sec)

....

mysql> show status like 'wsrep_ist_receive_status';

+--------------------------+---------------------------------------------------------+

| Variable_name | Value |

+--------------------------+---------------------------------------------------------+

| wsrep_ist_receive_status | 97% complete, received seqno 1585923 of 1415410-1589676 |

+--------------------------+---------------------------------------------------------+

1 row in set (0.00 sec)

mysql> show status like 'wsrep_ist_receive_status';

+--------------------------+-------+

| Variable_name | Value |

+--------------------------+-------+

| wsrep_ist_receive_status | |

+--------------------------+-------+

1 row in set (0.00 sec)

As you can see, the wsrep_ist_receive_status monitoring string indicates the percentage completed, currently received write-set and the range of write-sets applicable to the IST.

Once the IST activity is complete, the variable shows an empty-string.

Closing Comments

I hope you enjoy this newly added feature. Percona Engineering would be happy to hear from you, about more such features that can help you make effective use of Percona XtraDB Cluster. We will try our best to include them in our future plans (based on feasibility).

Note: Special thanks for Kenn Takara and Roel Van de Paar for helping me edit this post.

2 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

tommymcneely

6 years ago

I like the idea! Is it reasonable to believe that the end number (1589676) will not change throughout the process (outside of a lab)? Lets assume this is a production environment with a 3+ node cluster, as a “two-node” cluster is not really reasonable with WSREP due to the “donor” is desync’d as well. A proper healthcheck should probably not be sending database connections to the “donor” node, right? So, the rest of the nodes, besides the donor and the “down” node, are still receiving transactions. Does this increment the ending number? or do they have another incremental sync when they get done with their lengthy sync?

I would also really like to see SST progress, which we inevitably end up doing every time the cluster crashes. 🙁

~tommy

Krunal Bauskar

Author

Reply to tommymcneely

6 years ago

Other cluster nodes continue to receive the traffic that is replicated on group-channel and consumed by DONOR and JOINER node. Once IST apply action is complete JOINER node will proceed with apply of this traffic.

MySQL 5.7
End of Life

Compare Percona to Leading Database Solutions

Software
Downloads

Product
Documentation

Resource Hub

Financial Services

Driving Database Success

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

Tracking IST Progress in Percona XtraDB Cluster

Introduction

Tracking IST Progress

Closing Comments

Related

Related Blog Articles

RECOMMENDED ARTICLES

How to Improve Database Performance: The Ultimate Guide

A Guide to Better Understanding MySQL Charset Levels

Why SELECT COUNT(*) FROM TABLE Is Sometimes Very Slow in MySQL or MariaDB

MOST POPULAR ARTICLES

Auditing login attempts in MySQL

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL “Got an error reading communication packet”

MySQL 5.7 End of Life

Compare Percona to Leading Database Solutions

Software Downloads

Product Documentation

Resource Hub

Financial Services

Driving Database Success

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

Tracking IST Progress in Percona XtraDB Cluster

Introduction

Tracking IST Progress

Closing Comments

Related

Share This Post!

Want to get weekly updates listing the latest blog posts?

Related Blog Articles

RECOMMENDED ARTICLES

How to Improve Database Performance: The Ultimate Guide

A Guide to Better Understanding MySQL Charset Levels

Why SELECT COUNT(*) FROM TABLE Is Sometimes Very Slow in MySQL or MariaDB

MOST POPULAR ARTICLES

Auditing login attempts in MySQL

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL “Got an error reading communication packet”

MySQL 5.7
End of Life

Software
Downloads

Product
Documentation