You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|The lag of the shadow partition against the source partition, calculated as source partition LSO (Last Stable Offset) minus shadow partition HWM (High Watermark). Monitor by `shadow_link_name`, `topic`, and `partition` to understand replication lag for each partition.
|The total number of bytes fetched by a sharded replicator (bytes received by the client). Labeled by `shadow_link_name` and `shard` to track data transfer volume from the source cluster.
|The total number of bytes written by a sharded replicator (bytes written to the write_at_offset_stm). Uses `shadow_link_name` and `shard` labels to monitor data written to the shadow cluster.
70
70
71
-
|`redpanda_shadow_link_client_errors`
72
-
|Count
73
-
|The number of errors seen by the client. Track by `shadow_link_name` and `shard` to identify connection or protocol issues between clusters.
|Number of shadow topics in the respective states. Labeled by `shadow_link_name` and `state` to monitor topic state distribution across your shadow links.
|The total number of records fetched by the sharded replicator (records received by the client). Monitor by `shadow_link_name` and `shard` to track message throughput from the source.
|The total number of records written by a sharded replicator (records written to the write_at_offset_stm). Uses `shadow_link_name` and `shard` labels to monitor message throughput to the shadow cluster.
86
82
|===
87
83
88
-
See also: xref:reference:public-metrics-reference.adoc[]
84
+
For detailed descriptions of each metric, including usage examples and label definitions, see xref:reference:public-metrics-reference.adoc#shadow-link-metrics[Shadow Link metrics reference].
85
+
86
+
See also: xref:reference:public-metrics-reference.adoc#shadow-link-metrics[Shadow Link metrics reference]
Configure monitoring alerts for following conditions, which indicate problems with Shadowing:
108
106
109
-
* **High replication lag**: When `redpanda_shadow_link_shadow_lag` exceeds your RPO requirements
110
-
* **Connection errors**: When `redpanda_shadow_link_client_errors` increases rapidly
107
+
* **High replication lag**: When xref:reference:public-metrics-reference.adoc#redpanda_shadow_link_shadow_lag[`redpanda_shadow_link_shadow_lag`] exceeds your RPO requirements
111
108
* **Topic state changes**: When topics move to `FAULTED` state
112
109
* **Task failures**: When replication tasks enter `FAULTED` or `NOT_RUNNING` states
113
110
* **Throughput drops**: When bytes/records fetched drops significantly
Copy file name to clipboardExpand all lines: modules/reference/pages/public-metrics-reference.adoc
+81Lines changed: 81 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2343,6 +2343,87 @@ Total number of bytes uploaded for the topic to object storage.
2343
2343
- `redpanda_namespace`
2344
2344
- `redpanda_topic`
2345
2345
2346
+
---
2347
+
2348
+
== Shadow Link metrics
2349
+
2350
+
=== redpanda_shadow_link_shadow_lag
2351
+
2352
+
The lag of the shadow partition against the source partition, calculated as source partition LSO (Last Stable Offset) minus shadow partition HWM (High Watermark). Monitor this metric to understand replication lag for each partition and ensure your RPO requirements are being met.
2353
+
2354
+
*Type*: gauge
2355
+
2356
+
*Labels*:
2357
+
2358
+
- `shadow_link_name` - Name of the shadow link
2359
+
- `topic` - Topic name
2360
+
- `partition` - Partition identifier
2361
+
2362
+
---
2363
+
2364
+
=== redpanda_shadow_link_shadow_topic_state
2365
+
2366
+
Number of shadow topics in the respective states. Monitor this metric to track the health and status distribution of shadow topics across your shadow links.
Total number of bytes fetched by a sharded replicator (bytes received by the client). Use this metric to track data transfer volume from the source cluster.
2380
+
2381
+
*Type*: counter
2382
+
2383
+
*Labels*:
2384
+
2385
+
- `shadow_link_name` - Name of the shadow link
2386
+
- `shard` - Shard identifier
2387
+
2388
+
---
2389
+
2390
+
=== redpanda_shadow_link_total_bytes_written
2391
+
2392
+
Total number of bytes written by a sharded replicator (bytes written to the write_at_offset_stm). Use this metric to monitor data written to the shadow cluster.
2393
+
2394
+
*Type*: counter
2395
+
2396
+
*Labels*:
2397
+
2398
+
- `shadow_link_name` - Name of the shadow link
2399
+
- `shard` - Shard identifier
2400
+
2401
+
---
2402
+
2403
+
=== redpanda_shadow_link_total_records_fetched
2404
+
2405
+
Total number of records fetched by the sharded replicator (records received by the client). Monitor this metric to track message throughput from the source cluster.
2406
+
2407
+
*Type*: counter
2408
+
2409
+
*Labels*:
2410
+
2411
+
- `shadow_link_name` - Name of the shadow link
2412
+
- `shard` - Shard identifier
2413
+
2414
+
---
2415
+
2416
+
=== redpanda_shadow_link_total_records_written
2417
+
2418
+
Total number of records written by a sharded replicator (records written to the write_at_offset_stm). Use this metric to monitor message throughput to the shadow cluster.
2419
+
2420
+
*Type*: counter
2421
+
2422
+
*Labels*:
2423
+
2424
+
- `shadow_link_name` - Name of the shadow link
2425
+
- `shard` - Shard identifier
2426
+
2346
2427
== Related topics
2347
2428
2348
2429
* xref:manage:monitoring.adoc[Learn how to monitor Redpanda]
0 commit comments