Jump to: navigation, search

Large Scale SIG/ScaleUp

The third stage in the Scaling Journey is Scale Up.

As you monitor your cluster at scale, you will see that it hits scaling limits within one cluster. All hope is not lost, though! There are things you can put in place push back how much a single cluster can handle, before having to resort to setting up a more complex deployment configuration. This page aims to help answer those questions.

Once you are past that stage, you are ready to proceed to next stage of the Scaling Journey: Scale Out.


Q: Cleaning up deleted entries in my database is a bit of a hassle. is there a tool I could use to help me with that?

A: The OSarchiver tool, developed by OVH, can help you there: see https://github.com/ovh/osarchiver/ . We are working on making it maintained upstream as part of the OSops tooling.

Q: How many compute nodes can a typical OpenStack cluster contain ?

A: It is hard to give this question a singular answer, as it depends on the power of each compute node and the usage pattern in your cloud (amount of churn, oversubscription, I/O vs. CPU...). The answer ranges from 100 nodes (in case of high density nodes with lots of churn) to 2,000 nodes (simpler nodes with longer-lived workloads).

Q: How do you decide to add a new node for control plane

A: If you found out that your rabbitmq queue keep piling up for a certain service, it usually means that it's time to add more control plane workers to those service to consume the queue.


Other SIG work on that stage