Monitoring partition percentages

Replication Server operates on 1MB partition segments. Whenever it allocates or deallocates a partition segment, it calculates these statistics:

If the percentage of partition segments in use rises above the percentage specified by sqm_warning_thr1 or sqm_warning_thr2, a message like the following is written to the log file:

WARNING: Stable Storage Use is Above <threshold> percent

If you see this message often, you may need to add partitions to the Replication Server or correct a recurring failure that causes the queues to fill.

When the first percentage drops below the percentage specified by sqm_warning_thr1 or sqm_warning_thr2, a message like the following is written to the log file to note that the condition that caused the original warning no longer exists:

WARNING CANCEL: Stable Storage Use is Below <threshold> percent

The percentage of total partition segments in use by the affected stable queue triggers the following warning message when the percentage of the total space used by a single stable queue exceeds the percentage specified by sqm_warning_thr_ind:

WARNING: Stable Storage Use by <queue name> is Above <threshold> percent

This warning alerts you to problems that cause a particular stable queue to fill until it is using a disproportionate share of the total partition space. For example, if a route is suspended for a length of time, its stable queue may fill until it occupies enough partition space to trigger a warning.

When the percentage of the total partition space used by a stable queue drops below the sqm_warning_thr_ind percentage, Replication Server writes a cancel message like the following to the log file:

WARNING CANCEL: Stable Storage Use by <queue name> is Below <threshold> percent.