Jump to: navigation, search

Difference between revisions of "Meetings/InfraTeamMeeting"

(Agenda for next meeting)
(Upcoming Project Renames)
 
(91 intermediate revisions by 9 users not shown)
Line 3: Line 3:
 
= Weekly Project Infrastructure team meeting =
 
= Weekly Project Infrastructure team meeting =
  
The OpenDev Team holds public weekly meetings in <code><nowiki>#opendev-meeting</nowiki></code>, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.
+
The OpenDev Team holds public weekly meetings in <code><nowiki>#opendev-meeting</nowiki></code> on OFTC, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.
  
 
Please feel free to add agenda items (and your IRC nick in parenthesis).
 
Please feel free to add agenda items (and your IRC nick in parenthesis).
Line 10: Line 10:
  
 
* Announcements
 
* Announcements
 +
** PTG this week.
 +
*** OpenDev session Wednesday October 20, 2021 at 14:00 - 16:00 UTC in https://meetpad.opendev.org/oct2021-ptg-opendev
 +
*** Zuul session Thursday October 21, 2021 at 14:00 UTC in https://meetpad.opendev.org/zuul-2021-10-21
  
 
* Actions from last meeting
 
* Actions from last meeting
  
* Specs approval
+
* Specs Review
 +
** Mailman 3 spec https://review.opendev.org/810990
  
* Priority Efforts (Standing meeting agenda items. Please expand if you have subtopics.)
+
* Topics
** [http://specs.openstack.org/openstack-infra/infra-specs/specs/update-config-management.html Update Config Management]
+
** Improving OpenDev's CD throughput (clarkb 20211019)
*** topic:update-cfg-mgmt
+
*** We can run many of our jobs in parallel in all of our CD pipelines. But this requires we properly document/address dependencies
*** Zuul as CD engine
+
**** Need to understand our job dependencies and properly note them in Zuul config or address them by combining jobs.
** OpenDev
+
***** Example 1: Combine service-gitea-lb and service-gitea jobs.
*** Service Coordinator position nominations
+
***** Example 2: Combine letsencrypt and nameserver jobs
**** http://lists.opendev.org/pipermail/service-discuss/2021-January/000161.html less than a week remaining.
+
***** Example 3: Have all jobs with webserver config express a dependency on the letsencrypt job
*** Gerrit account and group inconsistencies
+
**** Suggest we document the known job dependencies in a human readable format, then encode this into zuul, then we can switch to parallel runs.
**** ~1 group has a subgroup membership loop.
+
**** https://review.opendev.org/c/opendev/system-config/+/807672
**** We have ~109 accounts with preferred email addresses that don't have a matching external id
+
***** should list dependencies for all jobs
***** Some of these are inactive accounts that we can more properly retire, others are accounts that can be set to inactive due to lack of external ids, some have email addresses we might convert to preferred email address, and others will need proper digging.
+
***** zuul doesn't trigger on this?  not sure on best approach to make it mergable
**** We have ~642 accounts with conflicting emails in their external ids. This needs more investigating to better understand the fix for.
+
**** https://review.opendev.org/c/opendev/base-jobs/+/807807
**** Need to correct the ~642 external id issues before we can push updates to refs/meta/external-ids with Gerrit online.
+
***** currently every executor adds keys for bridge, then logs in and clones system-config before running playbooks
**** Workaround is we can stop gerrit, push to external ids directly, reindex accounts (and groups?), start gerrit, then clear accounts caches (and groups caches?)
+
***** this change makes split jobs to do this. however, production remains the same as both are called.
*** WIP changes (ianw 20210105)
+
**** https://review.opendev.org/c/opendev/system-config/+/807808
**** Zuul should now support these properly. We need to retest.
+
***** this is a follow-on that adds a base job to clone system-config, and stops the other production jobs re-cloning.
*** Gerrit 3.3.1 includes the fix for Zuul
+
***** this job must run first, but then all other jobs can run in parallel, as they are all in the same buildset and using the same "view" of system-config for that particular run
*** Configuration tuning
+
** Gerrit Account cleanups (clarkb 20211019)
**** Using strong refs for jgit caches
+
*** 33 conflicts remain. Clarkb has written notes on proposed plans for each user in the comments of review02:~clarkb/gerrit_user_cleanups/audit-results-annotated.yaml
**** Batch user groups and threads
+
** Gerrit project renames (clarkb 20211019)
 
+
*** Overall went well.
* General topics
+
*** Possible trouble renaming secrets and doing ZK secrets backups?
** OpenAFS cluster status (clarkb 20210119)
+
*** We accidentally updated all Gitea projects. Should we just do that by default: https://review.opendev.org/c/opendev/system-config/+/814443 ?
*** New packages built for CentOS
+
** Improve zuul restarts (frickler 20211014)
*** Need to sort out openafs-client installation on Debian Buster
+
*** Docs at https://docs.opendev.org/opendev/system-config/latest/zuul.html#restarting-the-scheduler need updating
*** Are we properly installing openafs-client on Xenial?
+
**** What to restart (scheduler,web,fingergw?) and how (docker restart vs. docker-compose)
*** https://review.opendev.org/c/opendev/system-config/+/771268
+
**** When to run the re-enqueue
** Picking up steam on Puppet -> Ansible rewrites (clarkb 20210119)
+
**** Collecting debug information
*** Enable Xenial -> Bionic/Focal system upgrades
+
*** Don't reenqueue periodic jobs
** Discuss infra-core (on behalf of OpenStack) (mnaser)
 
** two-review rule impact on low-activity projects (zbr 20210115)
 
*** projects like git-review rely on infra team and delay changes
 
*** risks for enabling exceptions? how could we expose these in gerrit?
 
*** apparently that issue is affecting multiple projects, maybe we can think about a generic solution?
 
** Bup and Borg Backups (clarkb 20210105)
 
  
 
* Open discussion
 
* Open discussion
Line 56: Line 54:
 
(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)
 
(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)
  
* foo/example -> bar/example: https://review.opendev.org/#/c/123456
+
* Rename foo/example -> bar/example: https://review.opendev.org/123456
  
 
== Previous meetings ==
 
== Previous meetings ==
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/

Latest revision as of 22:56, 18 October 2021

Weekly Project Infrastructure team meeting

The OpenDev Team holds public weekly meetings in #opendev-meeting on OFTC, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.

Please feel free to add agenda items (and your IRC nick in parenthesis).

Agenda for next meeting

  • Actions from last meeting
  • Topics
    • Improving OpenDev's CD throughput (clarkb 20211019)
      • We can run many of our jobs in parallel in all of our CD pipelines. But this requires we properly document/address dependencies
        • Need to understand our job dependencies and properly note them in Zuul config or address them by combining jobs.
          • Example 1: Combine service-gitea-lb and service-gitea jobs.
          • Example 2: Combine letsencrypt and nameserver jobs
          • Example 3: Have all jobs with webserver config express a dependency on the letsencrypt job
        • Suggest we document the known job dependencies in a human readable format, then encode this into zuul, then we can switch to parallel runs.
        • https://review.opendev.org/c/opendev/system-config/+/807672
          • should list dependencies for all jobs
          • zuul doesn't trigger on this? not sure on best approach to make it mergable
        • https://review.opendev.org/c/opendev/base-jobs/+/807807
          • currently every executor adds keys for bridge, then logs in and clones system-config before running playbooks
          • this change makes split jobs to do this. however, production remains the same as both are called.
        • https://review.opendev.org/c/opendev/system-config/+/807808
          • this is a follow-on that adds a base job to clone system-config, and stops the other production jobs re-cloning.
          • this job must run first, but then all other jobs can run in parallel, as they are all in the same buildset and using the same "view" of system-config for that particular run
    • Gerrit Account cleanups (clarkb 20211019)
      • 33 conflicts remain. Clarkb has written notes on proposed plans for each user in the comments of review02:~clarkb/gerrit_user_cleanups/audit-results-annotated.yaml
    • Gerrit project renames (clarkb 20211019)
    • Improve zuul restarts (frickler 20211014)
  • Open discussion

Upcoming Project Renames

(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)

Previous meetings

Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/