|
| 1 | +.. _health-managers-overview: |
| 2 | + |
| 3 | +.. include:: /includes/health-manager-short-names.rst |
| 4 | + |
| 5 | +================================================== |
| 6 | +Manage Sharded Cluster Health with Health Managers |
| 7 | +================================================== |
| 8 | + |
| 9 | +.. default-domain:: mongodb |
| 10 | + |
| 11 | +.. contents:: On this page |
| 12 | + :local: |
| 13 | + :backlinks: none |
| 14 | + :depth: 1 |
| 15 | + :class: singlecol |
| 16 | + |
| 17 | +This document describes how to use |HMS| to monitor and manage sharded |
| 18 | +cluster health issues. |
| 19 | + |
| 20 | +Overview |
| 21 | +-------- |
| 22 | + |
| 23 | +A |HM| runs health checks on a :term:`health manager facet` |
| 24 | +at a specified :ref:`intensity level |
| 25 | +<health-managers-intensity-levels>`. |HM| checks |
| 26 | +run at specified time intervals. A |HM| can be configured to |
| 27 | +move a failing :ref:`mongos <mongos>` out of a cluster automatically. |
| 28 | +:ref:`Progress Monitor <health-managers-progress-monitor>` ensures |
| 29 | +that |HM| checks do not become stuck or unresponsive. |
| 30 | + |
| 31 | +.. _health-managers-facets: |
| 32 | + |
| 33 | +Health Manager Facets |
| 34 | +~~~~~~~~~~~~~~~~~~~~~ |
| 35 | + |
| 36 | +The following table shows the available |HM| facets: |
| 37 | + |
| 38 | +.. include:: /includes/fact-health-manager-facets.rst |
| 39 | + |
| 40 | +.. _health-managers-intensity-levels: |
| 41 | + |
| 42 | +Health Manager Intensity Levels |
| 43 | +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ |
| 44 | + |
| 45 | +The following table shows the available |HM| intensity levels: |
| 46 | + |
| 47 | +.. include:: /includes/fact-health-manager-intensities.rst |
| 48 | + |
| 49 | +.. _health-managers-active-fault: |
| 50 | + |
| 51 | +Active Fault Duration |
| 52 | +--------------------- |
| 53 | + |
| 54 | +When a failure is detected and the |HM| intensity level |
| 55 | +is set to ``critical``, the |HM| waits the amount of time specified by |
| 56 | +:parameter:`activeFaultDurationSecs` before stopping and moving the |
| 57 | +:ref:`mongos <mongos>` out of the cluster automatically. |
| 58 | + |
| 59 | +.. _health-managers-progress-monitor: |
| 60 | + |
| 61 | +Progress Monitor |
| 62 | +---------------- |
| 63 | + |
| 64 | +.. include:: /includes/fact-progressMonitor.rst |
| 65 | + |
| 66 | +``progressMonitor`` Fields |
| 67 | +~~~~~~~~~~~~~~~~~~~~~~~~~~ |
| 68 | + |
| 69 | +.. include:: /includes/fact-progress-monitor-fields.rst |
| 70 | + |
| 71 | +Examples |
| 72 | +-------- |
| 73 | + |
| 74 | +The following examples show how |HMS| can be configured. For |
| 75 | +information on |HM| parameters, see :ref:`health-manager-parameters`. |
| 76 | + |
| 77 | +Intensity |
| 78 | +~~~~~~~~~ |
| 79 | + |
| 80 | +.. include:: /includes/example-healthMonitoringIntensities.rst |
| 81 | + |
| 82 | +.. include:: /includes/fact-healthMonitoringIntensities-values-array.rst |
| 83 | + |
| 84 | +See :parameter:`healthMonitoringIntensities` for details. |
| 85 | + |
| 86 | +Intervals |
| 87 | +~~~~~~~~~ |
| 88 | + |
| 89 | +.. include:: /includes/example-healthMonitoringIntervals.rst |
| 90 | + |
| 91 | +.. include:: /includes/fact-healthMonitoringIntervals-values-array.rst |
| 92 | + |
| 93 | +See :parameter:`healthMonitoringIntervals` for details. |
| 94 | + |
| 95 | +Active Fault Duration |
| 96 | +~~~~~~~~~~~~~~~~~~~~~ |
| 97 | + |
| 98 | +.. include:: /includes/example-activeFaultDurationSecs.rst |
| 99 | + |
| 100 | +See :parameter:`activeFaultDurationSecs` for details. |
| 101 | + |
| 102 | +Progress Monitor |
| 103 | +~~~~~~~~~~~~~~~~ |
| 104 | + |
| 105 | +.. include:: /includes/fact-progressMonitor.rst |
| 106 | + |
| 107 | +.. include:: /includes/example-progress-monitor.rst |
| 108 | + |
| 109 | +See :parameter:`progressMonitor` for details. |
0 commit comments