OpsGuide/Maintenance, Failures, and Debugging
< OpsGuide
Revision as of 02:33, 14 November 2017 by David.desrosiers (talk | contribs) (David.desrosiers moved page Maintenance, Failures, and Debugging to OpsGuide/Maintenance, Failures, and Debugging without leaving a redirect)
- Cloud Controller and Storage Proxy Failures and Maintenance
- Compute Node Failures and Maintenance
- Storage Node Failures and Maintenance
- Handling a Complete Failure
- Configuration Management
- Working with Hardware
- Databases
- RabbitMQ troubleshooting
- HDWMY
- Determining Which Component Is Broken
- What to do when things are running slowly
- Uninstalling
Downtime, whether planned or unscheduled, is a certainty when running a cloud. This chapter aims to provide useful information for dealing proactively, or reactively, with these occurrences.