| 2025-01-06
      
      § | 
    
  | 06:08 | <marostegui@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es2021.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" | [production] | 
            
  | 06:05 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1193 (re)pooling @ 25%: Repooling after schema change', diff saved to https://phabricator.wikimedia.org/P71787 and previous config saved to /var/cache/conftool/dbconfig/20250106-060534-root.json | [production] | 
            
  | 06:04 | <marostegui@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 06:00 | <marostegui@cumin1002> | START - Cookbook sre.hosts.decommission for hosts es2021.codfw.wmnet | [production] | 
            
  | 05:57 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Remove es2021 from dbctl T382944', diff saved to https://phabricator.wikimedia.org/P71786 and previous config saved to /var/cache/conftool/dbconfig/20250106-055726-marostegui.json | [production] | 
            
  | 05:50 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1193 (re)pooling @ 10%: Repooling after schema change', diff saved to https://phabricator.wikimedia.org/P71785 and previous config saved to /var/cache/conftool/dbconfig/20250106-055029-root.json | [production] | 
            
  | 03:39 | <tstarling@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1101239|Enable canShellboxGetTempUrl everywhere (T292322)]] (duration: 21m 53s) | [production] | 
            
  | 03:31 | <tstarling@deploy2002> | tstarling: Continuing with sync | [production] | 
            
  | 03:31 | <tstarling@deploy2002> | tstarling: Backport for [[gerrit:1101239|Enable canShellboxGetTempUrl everywhere (T292322)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 03:17 | <tstarling@deploy2002> | Started scap sync-world: Backport for [[gerrit:1101239|Enable canShellboxGetTempUrl everywhere (T292322)]] | [production] | 
            
  
    | 2025-01-04
      
      § | 
    
  | 21:34 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 21:31 | <andrew@cloudcumin1001> | START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 20:54 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 20:50 | <andrew@cloudcumin1001> | START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 20:02 | <dzahn@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on doc1003.eqiad.wmnet with reason: dz | [production] | 
            
  | 20:01 | <dzahn@cumin2002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on doc1003.eqiad.wmnet with reason: dz | [production] | 
            
  | 19:33 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 19:29 | <andrew@cloudcumin1001> | START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 18:30 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 18:27 | <andrew@cloudcumin1001> | START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 18:22 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 18:18 | <andrew@cloudcumin1001> | START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 18:02 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 17:59 | <andrew@cloudcumin1001> | START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud | [traffic] | 
            
  | 15:45 | <wmbot~sakretsu@tools-bastion-13> | implemented health checks for draftbot | [tools.itwiki] | 
            
  
    | 2025-01-03
      
      § | 
    
  | 21:46 | <bd808@cloudcumin1001> | END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-69 | [tools] | 
            
  | 21:41 | <bd808@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-69 | [tools] | 
            
  | 21:40 | <bd808@cloudcumin1001> | END (FAIL) - Cookbook wmcs.toolforge.k8s.worker.drain (exit_code=99) for node tools-k8s-worker-nfs-69 | [tools] | 
            
  | 21:35 | <bd808@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.worker.drain for node tools-k8s-worker-nfs-69 | [tools] | 
            
  | 21:01 | <bd808> | `sudo service maintain-dbusers restart` on cloudcontrol1005. Report of missing replica.my.cnf and journalctl output empty due to log rotation. (T382962) | [admin] | 
            
  | 19:09 | <aokoth@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on doc2002.codfw.wmnet with reason: Disk Change | [production] | 
            
  | 19:09 | <aokoth@cumin1002> | START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on doc2002.codfw.wmnet with reason: Disk Change | [production] | 
            
  | 18:47 | <aokoth@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doc2002.codfw.wmnet with reason: Disk Change | [production] | 
            
  | 18:46 | <aokoth@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on doc2002.codfw.wmnet with reason: Disk Change | [production] | 
            
  | 17:15 | <thcipriani@deploy2002> | Finished deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit1003.wikimedia.org only) (duration: 00m 10s) | [production] | 
            
  | 17:15 | <thcipriani@deploy2002> | Started deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit1003.wikimedia.org only) | [production] | 
            
  | 17:06 | <thcipriani@deploy2002> | Finished deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit2002.wikimedia.org only) (duration: 00m 08s) | [production] | 
            
  | 17:06 | <thcipriani@deploy2002> | Started deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit2002.wikimedia.org only) | [production] | 
            
  | 17:00 | <isaranto@deploy2002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . | [production] | 
            
  | 16:55 | <thcipriani@deploy2002> | Finished deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit2003.wikimedia.org only) (duration: 00m 08s) | [production] | 
            
  | 16:55 | <thcipriani@deploy2002> | Started deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit2003.wikimedia.org only) | [production] | 
            
  | 12:10 | <moritzm> | renewed internal Ganeti certs in eqsin (would have expired in two days) T382873 | [production] | 
            
  | 12:01 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depool es2022 T381848', diff saved to https://phabricator.wikimedia.org/P71781 and previous config saved to /var/cache/conftool/dbconfig/20250103-120132-marostegui.json | [production] | 
            
  | 11:12 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depool es2020 T382945', diff saved to https://phabricator.wikimedia.org/P71780 and previous config saved to /var/cache/conftool/dbconfig/20250103-111255-marostegui.json | [production] | 
            
  | 11:05 | <marostegui> | Switchover es4 codfw master to es2043 dbmaint T381848 | [production] | 
            
  | 11:04 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Switchover es4 codfw master', diff saved to https://phabricator.wikimedia.org/P71779 and previous config saved to /var/cache/conftool/dbconfig/20250103-110440-marostegui.json | [production] | 
            
  | 10:35 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depool es2021 T381848', diff saved to https://phabricator.wikimedia.org/P71778 and previous config saved to /var/cache/conftool/dbconfig/20250103-103513-marostegui.json | [production] | 
            
  | 10:06 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db2236 (re)pooling @ 100%: Repooling after upgrade', diff saved to https://phabricator.wikimedia.org/P71777 and previous config saved to /var/cache/conftool/dbconfig/20250103-100603-root.json | [production] | 
            
  | 09:50 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db2236 (re)pooling @ 75%: Repooling after upgrade', diff saved to https://phabricator.wikimedia.org/P71776 and previous config saved to /var/cache/conftool/dbconfig/20250103-095057-root.json | [production] |