Jump to: navigation, search

Difference between revisions of "Meetings/InfraTeamMeeting"

(Agenda for next meeting)
(Agenda for next meeting)
(26 intermediate revisions by 2 users not shown)
Line 10: Line 10:
  
 
* Announcements
 
* Announcements
** OpenInfra Live will feature OpenDev January 18, 2024.
+
** OpenStack Release next week then PTG the week after
 +
** Put your PTG agenda items on the etherpad: https://etherpad.opendev.org/p/apr2024-ptg-opendev
  
 
* Actions from last meeting
 
* Actions from last meeting
Line 20: Line 21:
 
*** https://etherpad.opendev.org/p/opendev-bionic-server-upgrades
 
*** https://etherpad.opendev.org/p/opendev-bionic-server-upgrades
 
*** https://review.opendev.org/q/topic:jitsi_meet-jammy-update
 
*** https://review.opendev.org/q/topic:jitsi_meet-jammy-update
*** I'm about 80% certain this is working as expected.  I think the next steps are to actually launch and deploy "real servers" with LE certs for more complete testing.
 
 
*** Started looking at the wiki there are rough notes at: https://etherpad.opendev.org/p/opendev-bionic-server-upgrades#L58
 
*** Started looking at the wiki there are rough notes at: https://etherpad.opendev.org/p/opendev-bionic-server-upgrades#L58
*** Hound looks to be a fairly simple service with indexed/ached data on disk.  It's currently running Focal with "bookwork" containers Any reason to stay there?
+
** MariaDB Upgrades (clarkb 20240220)
*** I think there was a suggestion we could implement prometheus and once were happy with that drop cacti, Is that correct?
+
*** Relying on the container image MARIADB_AUTO_UPGRADE flag
** Python container updates (tonyb 20230718)
+
*** Etherpad, Gitea, Gerrit, and Mailman could use upgrades.
*** https://review.opendev.org/c/opendev/system-config/+/905018 Drop Bullseye python3.11 images
+
*** https://review.opendev.org/c/opendev/system-config/+/911000 Upgrade etherpad mariadb to 10.11
*** zuul-operator is the last hold out now
+
** AFS Mirror cleanups (clarkb 20240220)
**** https://review.opendev.org/c/zuul/zuul-operator/+/881245 is the change we need to get landed.
+
*** Ubuntu Xenial is next but currently busy with PTG, Release, and other tasks.
** Updating Zuul's database server (clarkb 20231121)
+
*** Can followup with webserver log processing to determine which other mirrors may be dead.
*** Currently this is an older mysql 5.7 trove instance
+
** Rebuilding Gerrit Images (clarkb 20240312)
*** We can move it to a self hosted instance (maybe on a dedicated host?) running out of docker like many of our other services and get it more up to date.
+
*** Gerrit 3.9.2 has been released finally.
*** Are there other services we should consider this for as well?
+
*** https://review.opendev.org/c/opendev/system-config/+/912470 Update our 3.9 image to 3.9.2
*** Research/Planning questions: https://etherpad.opendev.org/p/opendev-zuul-mysql-upgrade
+
**** This will also rebuild our 3.8.4 image so we should try and restart prod gerrit on the new 3.8.4 image when available.
** EMS discontinuing legacy/consumer hosting plans (fungi 20231219)
+
*** Sounds like there are a number of bugfixes that a rebuild will get us. May be worth doing this just after the openstack release completes?
*** Fungi engaged to renew under the new discounted terms.
+
** Review02 had an oops last night (clarkb 20240326)
** AFS quota issues (frickler 20231217)
+
*** Found the server was shutdown. After giving it a few minutes to potentially resolve itself (mostly worried about cloud action) clarkb proceeded to manually start the instance then start the containers.
*** mirror.openeuler has reached its quota limit and the mirror job seems to be failing since two weeks. I'm also a bit worried that they seem do have doubled their volume over the last 12 months
+
*** mnaser reports it may have been an OOM event on the hosting side.
*** ubuntu mirrors are also getting close, but we might have another couple of months time there
+
** Rackspace MFA Requirement (clarkb 20240312)
*** mirror.centos-stream seems to have a steep increase in the last two months and might also run into quota limits soon
+
*** MFA is enabled. Enforcement day is today. Please lookout for any issues.
*** We should be able to clean up older arm64 ubuntu-ports and possibly also older debian arm mirroring.
+
** Project Renames (clarkb 20240227)
** Broken wheel build issues (frickler 20231217)
+
*** https://review.opendev.org/c/opendev/system-config/+/911622 Move gerrit replication queue aside during project renames.
*** For arm nodes this is due to lacking working openafs packages for centos on arm. This should be resolvable now that we have working arm dib builds again
+
*** Penciled in April 19, 2024 submit your rename requests now.
*** There is also a publication issue in choosing the wrong afs volume to vos release that affects x86 and presumably arm once openafs on arm works again.
+
** Nodepool image delete after upload (clarkb 20240319)
** Gitea repo-archives filling server disk (clarkb 20240109)
+
*** Nodepool now has the ability to delete on disk files for images after they are uploaded. We could potentially keep only small qcow2s using this functionality to save disk space.
*** https://review.opendev.org/c/opendev/system-config/+/904868 update robots.txt on upstream's suggestion
 
*** The cron job to clear archives weekly is configured but services need to be restarted to pick up the change. Maybe land https://review.opendev.org/c/opendev/system-config/+/905020 first and then restart things.
 
** OpenDev Service Coordinator Election happening in February (clarkb 20240109)
 
*** https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/TB2OFBIGWZEYC7L4MCYA46EXIX5T47TY/
 
  
 
* Open discussion
 
* Open discussion
Line 55: Line 51:
 
Changes should have their topic set to project-rename.
 
Changes should have their topic set to project-rename.
  
* Rename foo/example -> bar/example: https://review.opendev.org/123456
+
* Rename vexxhost/ansible-role-frrouting > openstack/ansible-role-frrouting: https://review.opendev.org/c/openstack/project-config/+/910018
  
 
== Previous meetings ==
 
== Previous meetings ==
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/
 
Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/

Revision as of 16:08, 26 March 2024

Weekly Project Infrastructure team meeting

The OpenDev Team holds public weekly meetings in #opendev-meeting on OFTC, Tuesdays at 1900 UTC. Everyone interested in infrastructure and process surrounding automated testing and deployment is encouraged to attend.

Please feel free to add agenda items (and your IRC nick in parenthesis).

Agenda for next meeting

  • Actions from last meeting
  • Specs Review
  • Topics
    • Upgrading Bionic servers to Focal/Jammy (clarkb 20230627)
    • MariaDB Upgrades (clarkb 20240220)
    • AFS Mirror cleanups (clarkb 20240220)
      • Ubuntu Xenial is next but currently busy with PTG, Release, and other tasks.
      • Can followup with webserver log processing to determine which other mirrors may be dead.
    • Rebuilding Gerrit Images (clarkb 20240312)
      • Gerrit 3.9.2 has been released finally.
      • https://review.opendev.org/c/opendev/system-config/+/912470 Update our 3.9 image to 3.9.2
        • This will also rebuild our 3.8.4 image so we should try and restart prod gerrit on the new 3.8.4 image when available.
      • Sounds like there are a number of bugfixes that a rebuild will get us. May be worth doing this just after the openstack release completes?
    • Review02 had an oops last night (clarkb 20240326)
      • Found the server was shutdown. After giving it a few minutes to potentially resolve itself (mostly worried about cloud action) clarkb proceeded to manually start the instance then start the containers.
      • mnaser reports it may have been an OOM event on the hosting side.
    • Rackspace MFA Requirement (clarkb 20240312)
      • MFA is enabled. Enforcement day is today. Please lookout for any issues.
    • Project Renames (clarkb 20240227)
    • Nodepool image delete after upload (clarkb 20240319)
      • Nodepool now has the ability to delete on disk files for images after they are uploaded. We could potentially keep only small qcow2s using this functionality to save disk space.
  • Open discussion

Upcoming Project Renames

(any additions should mention original->new full names and link to the corresponding project-config rename change in Gerrit) Changes should have their topic set to project-rename.

Previous meetings

Previous meetings, with their notes and logs, can be found at http://eavesdrop.openstack.org/meetings/infra/ and earlier at http://eavesdrop.openstack.org/meetings/ci/