Jump to: navigation, search

Difference between revisions of "Meetings/InfraTeamMeeting"

(Agenda for next meeting)
(Weekly Project Infrastructure team meeting)
(7 intermediate revisions by 2 users not shown)
Line 20: Line 20:
 
*** Zuul as CD engine
 
*** Zuul as CD engine
 
** OpenDev
 
** OpenDev
*** Service Coordinator position nominations
 
**** http://lists.opendev.org/pipermail/service-discuss/2021-January/000161.html Only nomination appears to be from Clark.
 
 
*** Gerrit account and group inconsistencies
 
*** Gerrit account and group inconsistencies
 
**** https://etherpad.opendev.org/p/gerrit-user-consistency-2021 High level notes.
 
**** https://etherpad.opendev.org/p/gerrit-user-consistency-2021 High level notes.
**** Group problems and 81 accounts with preferred emails missing external ids have been fixed.
+
**** We have 17 accounts with preferred email addresses that don't have a matching external id
**** We have 28 accounts with preferred email addresses that don't have a matching external id
 
**** We have ~642 accounts with conflicting emails in their external ids. This needs more investigating to better understand the fix for.
 
 
**** Need to correct the ~642 external id issues before we can push updates to refs/meta/external-ids with Gerrit online.
 
**** Need to correct the ~642 external id issues before we can push updates to refs/meta/external-ids with Gerrit online.
**** Workaround is we can stop gerrit, push to external ids directly, reindex accounts (and groups?), start gerrit, then clear accounts caches (and groups caches?)
+
**** Workaround is we can stop Gerrit, push to external ids directly, reindex accounts (and groups?), start gerrit, then clear accounts caches (and groups caches?)
 
**** Next steps
 
**** Next steps
***** Classify users further into situation groups
+
***** Identify accounts that are inactive and can be more forcefully retired. Retire these to fix those errors.
***** Decide on next steps for users depending on their situation group.
+
***** Identify accounts that are unlikely to be used anymore based on activity and more forcefully retire those to fix these errors. (We can always undo specific updates to these accounts if necessary)
***** Fix the preferred email issue if possible as this can be done with gerrit online
+
***** Work with remaining accounts to figure out how to best resolve the account conflicts. This may take some time.
***** Start a refs/meta/external-ids checkout in a shared location and begin committing fixes to it. If we can't push all the fixes as separate commits we can squash them together and then push.
+
***** https://review.opendev.org/c/opendev/system-config/+/777846 Collecting scripting efforts here
*** WIP changes (ianw 20210105)
 
**** Zuul should now support these properly. We need to retest.
 
*** Gerrit 3.3.1 includes the fix for Zuul and Zuul has the fixes too.
 
**** https://review.opendev.org/c/opendev/system-config/+/765021 Build 3.3 images, currently appears to need some work.
 
 
*** Configuration tuning
 
*** Configuration tuning
 
**** Using strong refs for jgit caches
 
**** Using strong refs for jgit caches
Line 43: Line 35:
  
 
* General topics
 
* General topics
** OpenAFS cluster status (clarkb 20210202)
+
** OpenAFS cluster status (clarkb 20210302)
*** What is server cluster status? Have they all been upgraded to 1.8.6?
+
*** Upgrading servers to Bionic then Focal next.
**** Upgrading servers to bionic then focal in place is next?
+
*** New third db server for proper quorum.
** Bup and Borg Backups (clarkb 20210202)
+
** Bup and Borg Backups (clarkb 20210302)
*** wiki backup status
+
*** gitea sql db backup issues.
*** borg disk consumption workarounds
+
** Picking up steam on Puppet -> Ansible rewrites (clarkb 20210302)
** Picking up steam on Puppet -> Ansible rewrites (clarkb 20210202)
 
 
*** Enable Xenial -> Bionic/Focal system upgrades
 
*** Enable Xenial -> Bionic/Focal system upgrades
*** Clarkb to write up an etherpad that captures the rough TODO list.
+
*** https://etherpad.opendev.org/p/infra-puppet-conversions-and-xenial-upgrades Start capturing TODO list here
** Deploy a new refstack.openstack.org server (kopecmartin 20210202)
+
*** Zuul service host updates in progress now. Mergers are done. Executors in progress.
*** https://review.opendev.org/c/opendev/system-config/+/705258
+
** Deploy a new refstack.openstack.org server (kopecmartin 20210302)
*** Help with deploying the server is needed
+
*** Ready for testing?
*** kopecmartin will help with testing the new instance
+
** Bridge disk space (clarkb 20210302)
 +
*** This appears at least partially related to ansible and python caching. Should we just clear those caches then profile them?
  
 
* Open discussion
 
* Open discussion

Revision as of 00:53, 2 March 2021

Weekly Project Infrastructure team meeting

The OpenDev Team holds public weekly meetings in #opendev-meeting, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.

Please feel free to add agenda items (and your IRC nick in parenthesis).

Agenda for next meeting

  • Announcements
  • Actions from last meeting
  • Specs approval
  • Priority Efforts (Standing meeting agenda items. Please expand if you have subtopics.)
    • Update Config Management
      • topic:update-cfg-mgmt
      • Zuul as CD engine
    • OpenDev
      • Gerrit account and group inconsistencies
        • https://etherpad.opendev.org/p/gerrit-user-consistency-2021 High level notes.
        • We have 17 accounts with preferred email addresses that don't have a matching external id
        • Need to correct the ~642 external id issues before we can push updates to refs/meta/external-ids with Gerrit online.
        • Workaround is we can stop Gerrit, push to external ids directly, reindex accounts (and groups?), start gerrit, then clear accounts caches (and groups caches?)
        • Next steps
          • Identify accounts that are inactive and can be more forcefully retired. Retire these to fix those errors.
          • Identify accounts that are unlikely to be used anymore based on activity and more forcefully retire those to fix these errors. (We can always undo specific updates to these accounts if necessary)
          • Work with remaining accounts to figure out how to best resolve the account conflicts. This may take some time.
          • https://review.opendev.org/c/opendev/system-config/+/777846 Collecting scripting efforts here
      • Configuration tuning
        • Using strong refs for jgit caches
        • Batch user groups and threads
  • General topics
    • OpenAFS cluster status (clarkb 20210302)
      • Upgrading servers to Bionic then Focal next.
      • New third db server for proper quorum.
    • Bup and Borg Backups (clarkb 20210302)
      • gitea sql db backup issues.
    • Picking up steam on Puppet -> Ansible rewrites (clarkb 20210302)
    • Deploy a new refstack.openstack.org server (kopecmartin 20210302)
      • Ready for testing?
    • Bridge disk space (clarkb 20210302)
      • This appears at least partially related to ansible and python caching. Should we just clear those caches then profile them?
  • Open discussion

Upcoming Project Renames

(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)

Previous meetings

Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/