| 2024-09-26
      
      § | 
    
  | 07:32 | <kartik@deploy1003> | kartik, abi: Continuing with sync | [production] | 
            
  | 07:26 | <jelto@cumin1002> | START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Upgrade GitLab Replica to new version | [production] | 
            
  | 07:26 | <kartik@deploy1003> | kartik, abi: Backport for [[gerrit:1075522|Revert^2 "Enable message group subscription feature for Test Wikipedia" (T372386)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 07:24 | <kartik@deploy1003> | Started scap sync-world: Backport for [[gerrit:1075522|Revert^2 "Enable message group subscription feature for Test Wikipedia" (T372386)]] | [production] | 
            
  | 07:20 | <elukey@cumin1002> | START - Cookbook sre.hosts.reimage for host registry1004.eqiad.wmnet with OS bookworm | [production] | 
            
  | 07:19 | <jmm@cumin2002> | START - Cookbook sre.puppet.migrate-host for host cloudcephosd1034.eqiad.wmnet | [production] | 
            
  | 07:18 | <kartik@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1075521|Translate: Add VirtualDomainsMapping (T372287)]] (duration: 13m 25s) | [production] | 
            
  | 07:13 | <kartik@deploy1003> | abi, kartik: Continuing with sync | [production] | 
            
  | 07:07 | <kartik@deploy1003> | abi, kartik: Backport for [[gerrit:1075521|Translate: Add VirtualDomainsMapping (T372287)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 07:04 | <kartik@deploy1003> | Started scap sync-world: Backport for [[gerrit:1075521|Translate: Add VirtualDomainsMapping (T372287)]] | [production] | 
            
  | 07:00 | <elukey> | apt-get clean on grafana2001 to free some space in the root partition | [production] | 
            
  | 00:41 | <sukhe@cumin1002> | END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: pool site eqiad [reason: repooling due to repeated port utilization alerts, T370962] | [production] | 
            
  | 00:41 | <sukhe@cumin1002> | START - Cookbook sre.dns.admin DNS admin: pool site eqiad [reason: repooling due to repeated port utilization alerts, T370962] | [production] | 
            
  
    | 2024-09-25
      
      § | 
    
  | 21:26 | <jforrester@deploy1003> | helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:25 | <dcaro@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1039.eqiad.wmnet with OS bullseye | [production] | 
            
  | 21:25 | <dcaro@cumin1002> | END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - dcaro@cumin1002" | [production] | 
            
  | 21:25 | <jforrester@deploy1003> | helmfile [codfw] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:25 | <jforrester@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:25 | <jforrester@deploy1003> | helmfile [eqiad] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:24 | <jforrester@deploy1003> | helmfile [staging] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:23 | <jforrester@deploy1003> | helmfile [staging] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:19 | <jforrester@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:18 | <jforrester@deploy1003> | helmfile [eqiad] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:18 | <jforrester@deploy1003> | helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:17 | <jforrester@deploy1003> | helmfile [codfw] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:16 | <jforrester@deploy1003> | helmfile [staging] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:16 | <jforrester@deploy1003> | helmfile [staging] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:12 | <jforrester@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:11 | <jforrester@deploy1003> | helmfile [eqiad] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:11 | <jforrester@deploy1003> | helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:10 | <jforrester@deploy1003> | helmfile [codfw] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:09 | <jforrester@deploy1003> | helmfile [staging] DONE helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 21:08 | <jforrester@deploy1003> | helmfile [staging] START helmfile.d/services/wikifunctions: apply | [production] | 
            
  | 18:34 | <aokoth@cumin1002> | END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts vrts2001.codfw.wmnet | [production] | 
            
  | 18:34 | <aokoth@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 18:34 | <aokoth@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: vrts2001.codfw.wmnet decommissioned, removing all IPs except the asset tag one - aokoth@cumin1002" | [production] | 
            
  | 18:14 | <brennen@deploy1003> | rebuilt and synchronized wikiversions files: group1 to 1.43.0-wmf.24  refs T373643 | [production] | 
            
  | 18:06 | <ejegg> | fundraising civicrm upgraded from 8a5ab89b to cf27c789 | [production] | 
            
  | 18:05 | <brennen> | 1.43.0-wmf.24 train status: no current blockers, logs clean-ish, rolling to group1 | [production] | 
            
  | 18:04 | <aokoth@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: vrts2001.codfw.wmnet decommissioned, removing all IPs except the asset tag one - aokoth@cumin1002" | [production] | 
            
  | 18:02 | <moritzm> | installing gtk+3.0 security updates | [production] | 
            
  | 18:01 | <aokoth@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 17:56 | <aokoth@cumin1002> | START - Cookbook sre.hosts.decommission for hosts vrts2001.codfw.wmnet | [production] | 
            
  | 17:55 | <aokoth@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on vrts2001.codfw.wmnet with reason: Decom | [production] | 
            
  | 17:55 | <aokoth@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on vrts2001.codfw.wmnet with reason: Decom | [production] | 
            
  | 17:35 | <dcaro@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - dcaro@cumin1002" | [production] | 
            
  | 17:17 | <dcaro@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1039.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 17:13 | <dcaro@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1039.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:55 | <dcaro@cumin1002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1039.eqiad.wmnet with OS bullseye | [production] | 
            
  | 16:47 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on cr3-ulsfo with reason: router broken ☠️ | [production] |