Jump to: navigation, search

Difference between revisions of "Meetings/InfraTeamMeeting"

(Agenda for next meeting)
(Agenda for next meeting)
 
(322 intermediate revisions by 22 users not shown)
Line 3: Line 3:
 
= Weekly Project Infrastructure team meeting =
 
= Weekly Project Infrastructure team meeting =
  
The OpenStack Project Infrastructure Team holds public weekly meetings in <code><nowiki>#openstack-meeting</nowiki></code>, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.
+
The OpenDev Team holds public weekly meetings in <code><nowiki>#opendev-meeting</nowiki></code> on OFTC, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.
  
 
Please feel free to add agenda items (and your IRC nick in parenthesis).
 
Please feel free to add agenda items (and your IRC nick in parenthesis).
Line 10: Line 10:
  
 
* Announcements
 
* Announcements
 +
** Gerrit User Summit happening December 2&3 virtually.
 +
** clarkb out next week. Should we skip the meeting November 23?
  
 
* Actions from last meeting
 
* Actions from last meeting
  
* Specs approval
+
* Specs Review
  
* Priority Efforts (Standing meeting agenda items. Please expand if you have subtopics.)
+
* Topics
** [http://specs.openstack.org/openstack-infra/infra-specs/specs/task-tracker.html A Task Tracker for OpenStack]
+
** Improving OpenDev's CD throughput (clarkb 20211116)
** [http://specs.openstack.org/openstack-infra/infra-specs/specs/update-config-management.html Update Config Management]
+
*** We can run many of our jobs in parallel in all of our CD pipelines. But this requires we properly document/address dependencies
*** topic:puppet-4 and topic:update-cfg-mgmt
+
**** Need to understand our job dependencies and properly note them in Zuul config or address them by combining jobs.
*** Zuul as CD engine
+
***** Example 1: Combine service-gitea-lb and service-gitea jobs.
** OpenDev
+
***** Example 2: Combine letsencrypt and nameserver jobs
*** https://storyboard.openstack.org/#!/story/2004627
+
***** Example 3: Have all jobs with webserver config express a dependency on the letsencrypt job
*** Setting specific time for maintenance work to start
+
**** Suggest we document the known job dependencies in a human readable format, then encode this into zuul, then we can switch to parallel runs.
*** Are we ready? What do we need to do between now and Friday?
+
**** https://review.opendev.org/c/opendev/system-config/+/807672
 
+
***** should list dependencies for all jobs
* General topics
+
***** zuul doesn't trigger on this?  not sure on best approach to make it mergable
** LetsEncrypt Progress (clarkb 20190416)
+
**** https://review.opendev.org/c/opendev/base-jobs/+/807807
*** graphite.opendev.org is now LetsEncypted. Next steps are restarting services when certs update and testing of that process.
+
***** currently every executor adds keys for bridge, then logs in and clones system-config before running playbooks
** Trusty Upgrade Progress (clarkb 20190416)
+
***** this change makes split jobs to do this.  however, production remains the same as both are called.
*** lists.openstack.org has been upgraded.
+
**** https://review.opendev.org/c/opendev/system-config/+/807808
** Removal of ugo+rw chmod from default job definitions (zbr 20190415)
+
***** this is a follow-on that adds a base job to clone system-config, and stops the other production jobs re-cloning.
*** Mainly this insanely unsafe chmod does prevent ansible from loading ansible.cfg files from our repos. That is happening from "Change zuul-cloner permissions" task which is run even for basic jobs like openstack-tox-py* ones.
+
***** this job must run first, but then all other jobs can run in parallel, as they are all in the same buildset and using the same "view" of system-config for that particular run
 +
** Gerrit Account cleanups (clarkb 20211116)
 +
*** 33 conflicts remain. Clarkb has written notes on proposed plans for each user in the comments of review02:~clarkb/gerrit_user_cleanups/audit-results-annotated.yaml
 +
** Zuul multi scheduler setup (clarkb 20211116)
 +
*** Zuul is currently running with two schedulers (zuul01.o.o and zuul02.o.o with zuul02.o.o being "primary")
 +
*** Did first rolling restart of schedulers over the weekend.
 +
*** Zuul-web should return consistent results now as it talk to ZooKeeper directly.
 +
** User management on our systems (clarkb 20211116)
 +
*** Give gerritbot and matrix-gerritbot a shared user: https://review.opendev.org/c/opendev/system-config/+/816769/
 +
*** Eventually convert mariadb container's from uid 999 to something that makes more sense on the system.
 +
** Caching openstack/openstack on our DIB images (clarkb 20211116)
 +
*** There are semi frequent errors when updating the DIB cache for openstack/openstack
 +
*** Seems related to verifying or updating submodule content.
 +
*** Should we simply stop caching this repo entirely? It isn't really used for much.
  
 
* Open discussion
 
* Open discussion
Line 38: Line 53:
 
(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)
 
(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)
  
* AJKavanagh [tinwood] - Rename charm-lxd to charm-nova-lxd, with a project-config change at https://review.openstack.org/644584 .
+
* Rename foo/example -> bar/example: https://review.opendev.org/123456
  
 
== Previous meetings ==
 
== Previous meetings ==
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/

Latest revision as of 00:11, 16 November 2021

Weekly Project Infrastructure team meeting

The OpenDev Team holds public weekly meetings in #opendev-meeting on OFTC, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.

Please feel free to add agenda items (and your IRC nick in parenthesis).

Agenda for next meeting

  • Announcements
    • Gerrit User Summit happening December 2&3 virtually.
    • clarkb out next week. Should we skip the meeting November 23?
  • Actions from last meeting
  • Specs Review
  • Topics
    • Improving OpenDev's CD throughput (clarkb 20211116)
      • We can run many of our jobs in parallel in all of our CD pipelines. But this requires we properly document/address dependencies
        • Need to understand our job dependencies and properly note them in Zuul config or address them by combining jobs.
          • Example 1: Combine service-gitea-lb and service-gitea jobs.
          • Example 2: Combine letsencrypt and nameserver jobs
          • Example 3: Have all jobs with webserver config express a dependency on the letsencrypt job
        • Suggest we document the known job dependencies in a human readable format, then encode this into zuul, then we can switch to parallel runs.
        • https://review.opendev.org/c/opendev/system-config/+/807672
          • should list dependencies for all jobs
          • zuul doesn't trigger on this? not sure on best approach to make it mergable
        • https://review.opendev.org/c/opendev/base-jobs/+/807807
          • currently every executor adds keys for bridge, then logs in and clones system-config before running playbooks
          • this change makes split jobs to do this. however, production remains the same as both are called.
        • https://review.opendev.org/c/opendev/system-config/+/807808
          • this is a follow-on that adds a base job to clone system-config, and stops the other production jobs re-cloning.
          • this job must run first, but then all other jobs can run in parallel, as they are all in the same buildset and using the same "view" of system-config for that particular run
    • Gerrit Account cleanups (clarkb 20211116)
      • 33 conflicts remain. Clarkb has written notes on proposed plans for each user in the comments of review02:~clarkb/gerrit_user_cleanups/audit-results-annotated.yaml
    • Zuul multi scheduler setup (clarkb 20211116)
      • Zuul is currently running with two schedulers (zuul01.o.o and zuul02.o.o with zuul02.o.o being "primary")
      • Did first rolling restart of schedulers over the weekend.
      • Zuul-web should return consistent results now as it talk to ZooKeeper directly.
    • User management on our systems (clarkb 20211116)
    • Caching openstack/openstack on our DIB images (clarkb 20211116)
      • There are semi frequent errors when updating the DIB cache for openstack/openstack
      • Seems related to verifying or updating submodule content.
      • Should we simply stop caching this repo entirely? It isn't really used for much.
  • Open discussion

Upcoming Project Renames

(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit)

Previous meetings

Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/