Jump to: navigation, search

Difference between revisions of "Meetings/InfraTeamMeeting"

(Agenda for next meeting)
(Agenda for next meeting)
(430 intermediate revisions by 28 users not shown)
Line 3: Line 3:
 
= Weekly Project Infrastructure team meeting =
 
= Weekly Project Infrastructure team meeting =
  
The OpenStack Project Infrastructure Team holds public weekly meetings in <code><nowiki>#openstack-meeting</nowiki></code>, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.
+
The OpenDev Team holds public weekly meetings in <code><nowiki>#opendev-meeting</nowiki></code>, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.
  
 
Please feel free to add agenda items (and your IRC nick in parenthesis).
 
Please feel free to add agenda items (and your IRC nick in parenthesis).
Line 15: Line 15:
 
* Specs approval
 
* Specs approval
  
* Priority Efforts
+
* Priority Efforts (Standing meeting agenda items. Please expand if you have subtopics.)
** [http://specs.openstack.org/openstack-infra/infra-specs/specs/task-tracker.html A Task Tracker for OpenStack]
+
** [http://specs.openstack.org/openstack-infra/infra-specs/specs/update-config-management.html Update Config Management]
** [http://specs.openstack.org/openstack-infra/infra-specs/specs/zuulv3.html Zuul v3]
+
*** topic:update-cfg-mgmt
 
+
*** Zuul as CD engine
 +
** OpenDev
 +
*** Gerrit account and group inconsistencies
 +
**** https://etherpad.opendev.org/p/gerrit-user-consistency-2021 High level notes.
 +
**** Group problems and 92 accounts with preferred emails missing external ids have been fixed.
 +
**** We have 17 accounts with preferred email addresses that don't have a matching external id
 +
**** We have ~642 accounts with conflicting emails in their external ids. This needs more investigating to better understand the fix for.
 +
**** Need to correct the ~642 external id issues before we can push updates to refs/meta/external-ids with Gerrit online.
 +
**** Workaround is we can stop Gerrit, push to external ids directly, reindex accounts (and groups?), start gerrit, then clear accounts caches (and groups caches?)
 +
**** Next steps
 +
***** Classify users further into situation groups
 +
***** Decide on next steps for users depending on their situation group.
 +
***** Fix the preferred email issue if possible as this can be done with gerrit online
 +
***** Start a refs/meta/external-ids checkout in a shared location and begin committing fixes to it. If we can't push all the fixes as separate commits we can squash them together and then push.
 +
***** Fungi suggests we simply identify the active accounts then retire the rest for simplicity and speed. Clarkb likes this idea.
 +
***** Could really use a second or third set of eyes to review my notes and decisions. Will help ensure that the next steps I've described for specific accounts are good.
 +
*** Configuration tuning
 +
**** Using strong refs for jgit caches
 +
**** Batch user groups and threads
 +
*** Gitea OOMs
 +
**** https://review.opendev.org/c/opendev/system-config/+/774023 Rate limiting framework change for haproxy.
 +
**** https://review.opendev.org/c/opendev/system-config/+/775051 Dstat stat gathering in our system-config-run jobs to measure relative performance impacts.
  
 
* General topics
 
* General topics
** rax-ord clean up for nodepool (pabelanger)
+
** OpenAFS cluster status (clarkb 20210223)
*** 2 x fg-test, 1 x pypi.slave.openstack.org can we delete?
+
*** Upgrading servers to Bionic then Focal next.
** Backup server (ianw 2017-11-14)
+
** Bup and Borg Backups (clarkb 20210223)
*** retire ci-backup-rs-ord bup server
+
*** wiki backup status
*** https://review.openstack.org/516159 move other hosts
+
*** borg disk consumption workarounds
*** turn off ci-backup-rs-ord & attach old volumes to new host
+
** Picking up steam on Puppet -> Ansible rewrites (clarkb 20210223)
** puppetmaster health (ianw 2017-11-14)
+
*** Enable Xenial -> Bionic/Focal system upgrades
*** very small vm for it's job
+
*** https://etherpad.opendev.org/p/infra-puppet-conversions-and-xenial-upgrades Start capturing TODO list here
*** OOM'd and had to get rax to reboot it (credentials to reboot it on the host)
+
*** Zuul service host updates in progress now.
*** migration plan?
+
** Deploy a new refstack.openstack.org server (kopecmartin 20210223)
** bindep & external repositories (ianw 2017-11-14)
+
*** Ready for testing?
*** related to removal of centos-release-openstack-* repo (https://review.openstack.org/#/c/519533/)
+
** Bridge disk space (clarkb 20210223)
*** should bindep have a concept of being able to enable a repo as part of bindep.txt?
+
*** Our ansible logging is consuming a fair bit but user homedirs and /opt are other major consumers.
*** e.g. liberasurecode-devel [platform:centos enable_repo:epel] -> yum install --enablerepo=epel liberasurecode-devel ...
 
*** alternative is to insert a "- enable_blah_repo" role early in your job before bindep. But then your bindep.txt file really has an unexpressed dependency on that
 
** Jobs requiring IPv6 (bswartz)
 
*** Is it possible to create a job that requires a routeable IPv6 address to exist on the node?
 
** Zanata upgrade (to 4.x?) (clarkb/aeng)
 
*** Does not require newer java so this is purely a config and war update
 
  
 
* Open discussion
 
* Open discussion
Line 46: Line 61:
 
(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)
 
(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)
  
* collectd-ceilometer-plugin->collectd-openstack-plugins  https://review.openstack.org/#/c/500768
+
* foo/example -> bar/example: https://review.opendev.org/#/c/123456
  
 
== Previous meetings ==
 
== Previous meetings ==
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/

Revision as of 21:46, 22 February 2021

Weekly Project Infrastructure team meeting

The OpenDev Team holds public weekly meetings in #opendev-meeting, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.

Please feel free to add agenda items (and your IRC nick in parenthesis).

Agenda for next meeting

  • Announcements
  • Actions from last meeting
  • Specs approval
  • Priority Efforts (Standing meeting agenda items. Please expand if you have subtopics.)
    • Update Config Management
      • topic:update-cfg-mgmt
      • Zuul as CD engine
    • OpenDev
      • Gerrit account and group inconsistencies
        • https://etherpad.opendev.org/p/gerrit-user-consistency-2021 High level notes.
        • Group problems and 92 accounts with preferred emails missing external ids have been fixed.
        • We have 17 accounts with preferred email addresses that don't have a matching external id
        • We have ~642 accounts with conflicting emails in their external ids. This needs more investigating to better understand the fix for.
        • Need to correct the ~642 external id issues before we can push updates to refs/meta/external-ids with Gerrit online.
        • Workaround is we can stop Gerrit, push to external ids directly, reindex accounts (and groups?), start gerrit, then clear accounts caches (and groups caches?)
        • Next steps
          • Classify users further into situation groups
          • Decide on next steps for users depending on their situation group.
          • Fix the preferred email issue if possible as this can be done with gerrit online
          • Start a refs/meta/external-ids checkout in a shared location and begin committing fixes to it. If we can't push all the fixes as separate commits we can squash them together and then push.
          • Fungi suggests we simply identify the active accounts then retire the rest for simplicity and speed. Clarkb likes this idea.
          • Could really use a second or third set of eyes to review my notes and decisions. Will help ensure that the next steps I've described for specific accounts are good.
      • Configuration tuning
        • Using strong refs for jgit caches
        • Batch user groups and threads
      • Gitea OOMs
  • General topics
    • OpenAFS cluster status (clarkb 20210223)
      • Upgrading servers to Bionic then Focal next.
    • Bup and Borg Backups (clarkb 20210223)
      • wiki backup status
      • borg disk consumption workarounds
    • Picking up steam on Puppet -> Ansible rewrites (clarkb 20210223)
    • Deploy a new refstack.openstack.org server (kopecmartin 20210223)
      • Ready for testing?
    • Bridge disk space (clarkb 20210223)
      • Our ansible logging is consuming a fair bit but user homedirs and /opt are other major consumers.
  • Open discussion

Upcoming Project Renames

(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)

Previous meetings

Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/