Jump to: navigation, search

Difference between revisions of "Meetings/InfraTeamMeeting"

(Agenda for next meeting)
(Agenda for next meeting)
(11 intermediate revisions by 4 users not shown)
Line 14: Line 14:
  
 
* Specs Review
 
* Specs Review
** Prometheus spec https://review.opendev.org/c/opendev/infra-specs/+/804122
+
** Mailman 3 spec https://review.opendev.org/810990
*** Need to decide on using node-exporter or snmp-exporter.
 
*** Seems like running node-exporter in docker is weird but doable. Using distro packages exposes us to changing metric names as they are all pre 1.0 release.
 
  
 
* Topics
 
* Topics
** Mailman Ansible and Server Upgrades (clarkb 20210921)
+
** Improving OpenDev's CD throughput (clarkb 20211026)
*** lists.openstack.org is running focal now
 
**** Kernel packages have been pinned. If you update the kernel you need to decompress it.
 
**** Fungi is working on a mailman3 spec. We can decide the best route for replacing the server with that context.
 
** Improving OpenDev's CD throughput (clarkb 20210921)
 
 
*** We can run many of our jobs in parallel in all of our CD pipelines. But this requires we properly document/address dependencies
 
*** We can run many of our jobs in parallel in all of our CD pipelines. But this requires we properly document/address dependencies
 
**** Need to understand our job dependencies and properly note them in Zuul config or address them by combining jobs.
 
**** Need to understand our job dependencies and properly note them in Zuul config or address them by combining jobs.
Line 30: Line 24:
 
***** Example 3: Have all jobs with webserver config express a dependency on the letsencrypt job
 
***** Example 3: Have all jobs with webserver config express a dependency on the letsencrypt job
 
**** Suggest we document the known job dependencies in a human readable format, then encode this into zuul, then we can switch to parallel runs.
 
**** Suggest we document the known job dependencies in a human readable format, then encode this into zuul, then we can switch to parallel runs.
**** https://review.opendev.org/c/opendev/system-config/+/807672/ starts to sketch this out.
+
**** https://review.opendev.org/c/opendev/system-config/+/807672
** Gerrit Account cleanups (clarkb 20210921)
+
***** should list dependencies for all jobs
 +
***** zuul doesn't trigger on this?  not sure on best approach to make it mergable
 +
**** https://review.opendev.org/c/opendev/base-jobs/+/807807
 +
***** currently every executor adds keys for bridge, then logs in and clones system-config before running playbooks
 +
***** this change makes split jobs to do this. however, production remains the same as both are called.
 +
**** https://review.opendev.org/c/opendev/system-config/+/807808
 +
***** this is a follow-on that adds a base job to clone system-config, and stops the other production jobs re-cloning.
 +
***** this job must run first, but then all other jobs can run in parallel, as they are all in the same buildset and using the same "view" of system-config for that particular run
 +
** Gerrit Account cleanups (clarkb 20211026)
 
*** 33 conflicts remain. Clarkb has written notes on proposed plans for each user in the comments of review02:~clarkb/gerrit_user_cleanups/audit-results-annotated.yaml
 
*** 33 conflicts remain. Clarkb has written notes on proposed plans for each user in the comments of review02:~clarkb/gerrit_user_cleanups/audit-results-annotated.yaml
** OpenDev logo hosting (clarkb 20210921)
+
** Fedora 34 test node booting problems (clarkb 20211026)
*** Next up would be updating paste and gerrit. Then we can safely upgrade the gitea server without impacting theming on other services.
+
*** Changes to Fedora's kernel packaging broke Xen
** InMotion cloud has been scaled up (clarkb 20210921)
+
*** Not yet sure if that may have also somehow broken OVH and iweb.
*** We need to work on tuning the over subscription ratios.
+
** Begin planning for Gerrit 3.4 upgrade (clarkb 20211026)
** Scheduling Gerrit project renames (clarkb 20210921)
+
*** Read through release notes and identify areas of concern.
*** Penciled in the week of October 11-15
+
**** https://www.gerritcodereview.com/3.4.html
** Gerrit replication "leaks" (clarkb 20210921)
+
*** Do we need to modify system-config-run-review-3.4 or system-config-upgrade-review jobs to test any specific behaviors?
*** We've added a 15 minute network timeout to the gerrit config. Need to restart the server to pick that up.
+
*** Probably worth holding a 3.4 test node and doing a skim for any unexpected behaviors.
  
 
* Open discussion
 
* Open discussion
Line 47: Line 49:
 
(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)
 
(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)
  
* Venus project (below repos) moving from inspur to openstack namespace: https://review.opendev.org/808149
+
* Rename foo/example -> bar/example: https://review.opendev.org/123456
** inspur/python-venusclient -> openstack/python-venusclient
 
** inspur/venus -> openstack/venus
 
** inspur/venus-specs -> openstack/venus-specs
 
** inspur/venus-tempest-plugin -> openstack/venus-tempest-plugin
 
** inspur/venus-dashboard -> openstack/venus-dashboard
 
* osf to openinfra namespace move: https://review.opendev.org/808479
 
* ansible-role-refstack-client from x to openinfra: https://review.opendev.org/765787
 
  
 
== Previous meetings ==
 
== Previous meetings ==
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/

Revision as of 23:40, 25 October 2021

Weekly Project Infrastructure team meeting

The OpenDev Team holds public weekly meetings in #opendev-meeting on OFTC, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.

Please feel free to add agenda items (and your IRC nick in parenthesis).

Agenda for next meeting

  • Announcements
  • Actions from last meeting
  • Topics
    • Improving OpenDev's CD throughput (clarkb 20211026)
      • We can run many of our jobs in parallel in all of our CD pipelines. But this requires we properly document/address dependencies
        • Need to understand our job dependencies and properly note them in Zuul config or address them by combining jobs.
          • Example 1: Combine service-gitea-lb and service-gitea jobs.
          • Example 2: Combine letsencrypt and nameserver jobs
          • Example 3: Have all jobs with webserver config express a dependency on the letsencrypt job
        • Suggest we document the known job dependencies in a human readable format, then encode this into zuul, then we can switch to parallel runs.
        • https://review.opendev.org/c/opendev/system-config/+/807672
          • should list dependencies for all jobs
          • zuul doesn't trigger on this? not sure on best approach to make it mergable
        • https://review.opendev.org/c/opendev/base-jobs/+/807807
          • currently every executor adds keys for bridge, then logs in and clones system-config before running playbooks
          • this change makes split jobs to do this. however, production remains the same as both are called.
        • https://review.opendev.org/c/opendev/system-config/+/807808
          • this is a follow-on that adds a base job to clone system-config, and stops the other production jobs re-cloning.
          • this job must run first, but then all other jobs can run in parallel, as they are all in the same buildset and using the same "view" of system-config for that particular run
    • Gerrit Account cleanups (clarkb 20211026)
      • 33 conflicts remain. Clarkb has written notes on proposed plans for each user in the comments of review02:~clarkb/gerrit_user_cleanups/audit-results-annotated.yaml
    • Fedora 34 test node booting problems (clarkb 20211026)
      • Changes to Fedora's kernel packaging broke Xen
      • Not yet sure if that may have also somehow broken OVH and iweb.
    • Begin planning for Gerrit 3.4 upgrade (clarkb 20211026)
      • Read through release notes and identify areas of concern.
      • Do we need to modify system-config-run-review-3.4 or system-config-upgrade-review jobs to test any specific behaviors?
      • Probably worth holding a 3.4 test node and doing a skim for any unexpected behaviors.
  • Open discussion

Upcoming Project Renames

(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)

Previous meetings

Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/