Jump to: navigation, search

Difference between revisions of "Large Scale SIG"

Line 4: Line 4:
 
'''Chairs''':  
 
'''Chairs''':  
 
   Belmiro Moreira <belmiro.moreira@cern.ch>
 
   Belmiro Moreira <belmiro.moreira@cern.ch>
   Pengju Jiao <jiaopengju@cmss.chinamobile.com>
+
   Gene Kuo <ching.kuo@linecorp.com>
 
   Thierry Carrez <thierry@openstack.org>
 
   Thierry Carrez <thierry@openstack.org>
  
 
The aim of the group is to facilitate running OpenStack at large scale, answer questions that OpenStack operators have as they need to scale up and scale out, and help address some of the limitations operators encounter in large OpenStack clusters.
 
The aim of the group is to facilitate running OpenStack at large scale, answer questions that OpenStack operators have as they need to scale up and scale out, and help address some of the limitations operators encounter in large OpenStack clusters.
  
=== Goals ===
 
==== Scaling with one cluster ====
 
Document the typical scaling issues. Push back the limits preventing scaling within one single cluster. Instrument the bottlenecks.
 
  
Resources:
+
== The Scaling Journey ==
* Work tracker: https://etherpad.openstack.org/p/large-scale-sig-cluster-scaling
+
The work of the group is organized along the various stages in the scaling journey for someone growing an OpenStack deployment. That path was successfully traveled by many before, . The job of the SIG is to extract that knowledge and provide answers for those who come next.
* [[Large Scale Scaling Stories]]
 
* Oslo-metrics spec: https://specs.openstack.org/openstack/oslo-specs/specs/ussuri/oslo-metrics.html
 
  
==== Meaningful monitoring ====
+
For each step the SIG will collect frequently-asked questions and answers, articles, presentations. When documentation or tools are missing, we help to produce them.  
How to get meaningful and actionable information about the state of your OpenStack deployments.
 
  
Resources:
+
=== Stage 1: Configure ===
* Work tracker: https://etherpad.openstack.org/p/large-scale-sig-meaningful-monitoring
+
Tune configuration options and optimize the parameters for your OpenStack cluster, so that it can handle additional load.
  
==== Document large scale configuration ====
+
See [[Large_Scale_SIG/Configure]] for more details !
Produce a set of documentation and guidelines on how to configure OpenStack for large scale usage.
 
  
Resources:
+
=== Stage 2: Monitor ===
* Work tracker: https://etherpad.openstack.org/p/large-scale-sig-documentation
+
Meaningful monitoring of your cluster to detect strain and limits.
* [[Large Scale Configuration Guidelines]]
 
  
=== Communications ===
+
See [[Large_Scale_SIG/Monitor]] for more details !
 +
 
 +
=== Stage 3: Scale up ===
 +
As you reach those limits, what can be done to handle more load within one cluster.
 +
 
 +
See [[Large_Scale_SIG/ScaleUp]] for more details !
 +
 
 +
=== Stage 4: Scaling out ===
 +
Past a given scale, you will have to scale out to multiple clusters, regions, cells or zones. What are the available options?
 +
 
 +
See [[Large_Scale_SIG/ScaleOut]] for more details !
 +
 
 +
 
 +
== Join the SIG! ==
 
The Large Scale SIG will use mostly asynchronous communications means: discussions on the [http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-discuss openstack-discuss mailing-list] using the [largescale-sig] prefix, and various etherpads. Occasionally we may leverage the #openstack-operators IRC channel for synchronous discussion.
 
The Large Scale SIG will use mostly asynchronous communications means: discussions on the [http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-discuss openstack-discuss mailing-list] using the [largescale-sig] prefix, and various etherpads. Occasionally we may leverage the #openstack-operators IRC channel for synchronous discussion.
  
=== Meeting ===
+
=== IRC Meeting ===
The Large Scale SIG meets on IRC typically every two weeks. We rotate between an APAC/EU-friendly meeting time (8 UTC) and an EU/US-friendly meeting time (16 UTC).
+
The Large Scale SIG meets on IRC typically every two weeks:
See https://etherpad.openstack.org/p/large-scale-sig-meeting for details on the next meeting(s), and to propose meeting agenda items.
+
* Current [http://eavesdrop.openstack.org/meetings/large_scale_sig/ IRC meeting schedule]
 +
* Propose [https://etherpad.openstack.org/p/large-scale-sig-meeting agenda items] for our next meeting
 +
* Past meetings [http://eavesdrop.openstack.org/meetings/large_scale_sig/ summary and logs]
  
=== Past meetings summary and logs ===
+
=== Future events ===
See http://eavesdrop.openstack.org/meetings/large_scale_sig/
+
* tbd
  
=== Past and future events ===
+
=== Past events ===
* Project Teams Gathering meetings Wednesday Oct 28 7UTC-8UTC and 16UTC-17UTC
+
* Project Teams Gathering meetings Wednesday Oct 28 7UTC-8UTC and 16UTC-17UTC: see https://etherpad.opendev.org/p/wallaby-ptg-largescale-sig
** Notes at https://etherpad.opendev.org/p/w-forum-scaling-stories
+
* Open Infrastructure Summit, virtual, Oct 20, 2020 (forum session): see https://etherpad.opendev.org/p/vSummit2020_OpenStackScalingStory
* Open Infrastructure Summit, virtual, Oct 20, 2020 (forum session)
 
 
* OpenDev (virtual), June 2020
 
* OpenDev (virtual), June 2020
* Open Infrastructure Summit, Shanghai, Nov 4, 2019 (forum session)
+
* Open Infrastructure Summit, Shanghai, Nov 4, 2019 (forum session): see https://etherpad.openstack.org/p/PVG-large-scale-SIG
** Notes at https://etherpad.openstack.org/p/PVG-large-scale-SIG
 

Revision as of 13:23, 24 November 2020

Status: Active

Chairs:

 Belmiro Moreira <belmiro.moreira@cern.ch>
 Gene Kuo <ching.kuo@linecorp.com>
 Thierry Carrez <thierry@openstack.org>

The aim of the group is to facilitate running OpenStack at large scale, answer questions that OpenStack operators have as they need to scale up and scale out, and help address some of the limitations operators encounter in large OpenStack clusters.


The Scaling Journey

The work of the group is organized along the various stages in the scaling journey for someone growing an OpenStack deployment. That path was successfully traveled by many before, . The job of the SIG is to extract that knowledge and provide answers for those who come next.

For each step the SIG will collect frequently-asked questions and answers, articles, presentations. When documentation or tools are missing, we help to produce them.

Stage 1: Configure

Tune configuration options and optimize the parameters for your OpenStack cluster, so that it can handle additional load.

See Large_Scale_SIG/Configure for more details !

Stage 2: Monitor

Meaningful monitoring of your cluster to detect strain and limits.

See Large_Scale_SIG/Monitor for more details !

Stage 3: Scale up

As you reach those limits, what can be done to handle more load within one cluster.

See Large_Scale_SIG/ScaleUp for more details !

Stage 4: Scaling out

Past a given scale, you will have to scale out to multiple clusters, regions, cells or zones. What are the available options?

See Large_Scale_SIG/ScaleOut for more details !


Join the SIG!

The Large Scale SIG will use mostly asynchronous communications means: discussions on the openstack-discuss mailing-list using the [largescale-sig] prefix, and various etherpads. Occasionally we may leverage the #openstack-operators IRC channel for synchronous discussion.

IRC Meeting

The Large Scale SIG meets on IRC typically every two weeks:

Future events

  • tbd

Past events