| 2024-09-19
      
      § | 
    
  | 09:07 | <aborrero@cloudcumin1001> | START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/50 | [admin] | 
            
  | 09:07 | <aborrero@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/50 | [admin] | 
            
  | 09:07 | <aborrero@cloudcumin1001> | START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/50 | [admin] | 
            
  | 09:03 | <hashar> | Cleared deprecated approvals from CI Jenkins # T375160 | [releng] | 
            
  | 09:02 | <hashar> | CI Jenkins: approved 3 scripts we wrote which were pending approval at https://integration.wikimedia.org/ci/manage/scriptApproval/ # T375160 | [releng] | 
            
  | 09:01 | <hashar> | CI Jenkins: approved 3 scripts we wrote which were pending approval at https://integration.wikimedia.org/ci/manage/scriptApproval/ | [releng] | 
            
  | 08:51 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.reimage for host db1246.eqiad.wmnet with OS bookworm | [production] | 
            
  | 08:50 | <arnaudb@cumin1002> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host db1246.eqiad.wmnet with OS bookworm | [production] | 
            
  | 08:45 | <hashar> | Restarting CI Jenkins with Java 17 # T359795 | [production] | 
            
  | 08:31 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.reimage for host db1246.eqiad.wmnet with OS bookworm | [production] | 
            
  | 08:31 | <_joe_> | deployed conftool 3.2.4 T375059 T373449 | [production] | 
            
  | 08:30 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox | [production] | 
            
  | 08:29 | <ayounsi@cumin1002> | START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox | [production] | 
            
  | 08:16 | <jnuche@deploy1003> | rebuilt and synchronized wikiversions files: group2 to 1.43.0-wmf.23  refs T373642 | [production] | 
            
  | 08:04 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary | [production] | 
            
  | 08:04 | <ayounsi@cumin1002> | START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary | [production] | 
            
  | 07:59 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2018.codfw.wmnet | [production] | 
            
  | 07:48 | <jmm@cumin2002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2018.codfw.wmnet | [production] | 
            
  | 07:43 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 16509 | [production] | 
            
  | 07:38 | <ayounsi@cumin1002> | START - Cookbook sre.network.peering with action 'configure' for AS: 16509 | [production] | 
            
  | 07:35 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2229 (re)pooling @ 100%: post maintenance', diff saved to https://phabricator.wikimedia.org/P69318 and previous config saved to /var/cache/conftool/dbconfig/20240919-073543-arnaudb.json | [production] | 
            
  | 07:20 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2229 (re)pooling @ 75%: post maintenance', diff saved to https://phabricator.wikimedia.org/P69317 and previous config saved to /var/cache/conftool/dbconfig/20240919-072037-arnaudb.json | [production] | 
            
  | 07:19 | <ayounsi@cumin1002> | START - Cookbook sre.network.peering with action 'configure' for AS: 16509 | [production] | 
            
  | 07:05 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2229 (re)pooling @ 50%: post maintenance', diff saved to https://phabricator.wikimedia.org/P69316 and previous config saved to /var/cache/conftool/dbconfig/20240919-070532-arnaudb.json | [production] | 
            
  | 06:53 | <moritzm> | adding Tiziano to pwstore | [production] | 
            
  | 06:50 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2229 (re)pooling @ 25%: post maintenance', diff saved to https://phabricator.wikimedia.org/P69315 and previous config saved to /var/cache/conftool/dbconfig/20240919-065026-arnaudb.json | [production] | 
            
  | 06:47 | <moritzm> | cleanup some old Bacula restores (4G) on seaborgium | [production] | 
            
  | 06:35 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2229 (re)pooling @ 15%: post maintenance', diff saved to https://phabricator.wikimedia.org/P69314 and previous config saved to /var/cache/conftool/dbconfig/20240919-063521-arnaudb.json | [production] | 
            
  | 06:20 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2229 (re)pooling @ 10%: post maintenance', diff saved to https://phabricator.wikimedia.org/P69313 and previous config saved to /var/cache/conftool/dbconfig/20240919-062016-arnaudb.json | [production] | 
            
  | 06:05 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2229 (re)pooling @ 5%: post maintenance', diff saved to https://phabricator.wikimedia.org/P69312 and previous config saved to /var/cache/conftool/dbconfig/20240919-060510-arnaudb.json | [production] | 
            
  | 05:01 | <eileen> | civicrm upgraded from ac29ff45 to 8af371aa | [production] | 
            
  | 01:25 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 01:25 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns name for   frack new switches - pt1979@cumin2002" | [production] | 
            
  | 01:24 | <pt1979@cumin2002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns name for   frack new switches - pt1979@cumin2002" | [production] | 
            
  | 01:21 | <pt1979@cumin2002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 00:46 | <sukhe> | sudo cumin 'puppetserver1003* or puppetserver2003*' 'systemctl start sync-puppet-volatile.service' | [production] | 
            
  | 00:45 | <sukhe> | sukhe@puppetserver1002:~$ sudo systemctl start sync-puppet-volatile.service | [production] | 
            
  | 00:41 | <swfrench-wmf> | force-reboot of puppetserver1001 via ipmitool (unresponsive for over 30m) | [production] | 
            
  
    | 2024-09-18
      
      § | 
    
  | 22:43 | <swfrench@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/eventstreams: sync | [production] | 
            
  | 22:43 | <swfrench@deploy1003> | helmfile [eqiad] START helmfile.d/services/eventstreams: sync | [production] | 
            
  | 22:19 | <jynus> | inserting without binlog missing heartbeat reecod on x1 codfw hosts | [production] | 
            
  | 22:11 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.switchdc.databases.prepare (exit_code=0) for the switch from eqiad to codfw | [production] | 
            
  | 21:55 | <mutante> | seaborgium - apt-get clean (disk space before: 98% used, now: 76% used, was alerting) | [production] | 
            
  | 20:59 | <ladsgroup@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the switch from eqiad to codfw | [production] | 
            
  | 20:45 | <toyofuku@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1073836|Enable dark mode for all logged in users on all projects (T370099)]], [[gerrit:1073835|Deploy Vector 2022 on several Wikimedia wikis (T374255)]], [[gerrit:1073839|Limit quick surveys to wikis with messages defined (T374654)]] (duration: 12m 52s) | [production] | 
            
  | 20:40 | <toyofuku@deploy1003> | toyofuku, jdlrobson: Continuing with sync | [production] | 
            
  | 20:35 | <toyofuku@deploy1003> | toyofuku, jdlrobson: Backport for [[gerrit:1073836|Enable dark mode for all logged in users on all projects (T370099)]], [[gerrit:1073835|Deploy Vector 2022 on several Wikimedia wikis (T374255)]], [[gerrit:1073839|Limit quick surveys to wikis with messages defined (T374654)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 20:32 | <toyofuku@deploy1003> | Started scap sync-world: Backport for [[gerrit:1073836|Enable dark mode for all logged in users on all projects (T370099)]], [[gerrit:1073835|Deploy Vector 2022 on several Wikimedia wikis (T374255)]], [[gerrit:1073839|Limit quick surveys to wikis with messages defined (T374654)]] | [production] | 
            
  | 20:02 | <wmbot~lucaswerkmeister@tools-bastion-13> | deployed b9a658f45e (health check for background runner, T374152) | [tools.quickcategories] | 
            
  | 19:47 | <wmbot~lucaswerkmeister@tools-bastion-13> | mv www/ www-unused-since-2024-09-18/ | [tools.quickcategories] |