| 2025-10-09
      
      ยง | 
    
  | 10:09 | <cgoubert@deploy2002> | helmfile [staging] START helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 10:09 | <cgoubert@deploy2002> | helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply | [production] | 
            
  | 10:08 | <cgoubert@deploy2002> | helmfile [eqiad] START helmfile.d/services/api-gateway: apply | [production] | 
            
  | 10:08 | <cgoubert@deploy2002> | helmfile [staging] DONE helmfile.d/services/api-gateway: apply | [production] | 
            
  | 10:08 | <cgoubert@deploy2002> | helmfile [staging] START helmfile.d/services/api-gateway: apply | [production] | 
            
  | 10:02 | <elukey@cumin1003> | START - Cookbook sre.hosts.provision for host ms-be2078.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL | [production] | 
            
  | 10:01 | <elukey@cumin1003> | END (ERROR) - Cookbook sre.hosts.provision (exit_code=97) for host ms-be2078.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL | [production] | 
            
  | 10:01 | <elukey@cumin1003> | START - Cookbook sre.hosts.provision for host ms-be2078.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL | [production] | 
            
  | 09:58 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db1252 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P83715 and previous config saved to /var/cache/conftool/dbconfig/20251009-095839-root.json | [production] | 
            
  | 09:44 | <kharlan@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1194890|Check against correct key in sortEntitiesByTimestamp (T406707)]] (duration: 11m 18s) | [production] | 
            
  | 09:43 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db1252 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P83713 and previous config saved to /var/cache/conftool/dbconfig/20251009-094333-root.json | [production] | 
            
  | 09:40 | <klausman@deploy2002> | helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 09:39 | <kharlan@deploy2002> | kharlan: Continuing with sync | [production] | 
            
  | 09:39 | <klausman@deploy2002> | helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:38 | <klausman@deploy2002> | helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 09:37 | <klausman@deploy2002> | helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:37 | <klausman@deploy2002> | helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 09:36 | <kharlan@deploy2002> | kharlan: Backport for [[gerrit:1194890|Check against correct key in sortEntitiesByTimestamp (T406707)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | [production] | 
            
  | 09:36 | <klausman@deploy2002> | helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:32 | <kharlan@deploy2002> | Started scap sync-world: Backport for [[gerrit:1194890|Check against correct key in sortEntitiesByTimestamp (T406707)]] | [production] | 
            
  | 09:31 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db2179 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P83711 and previous config saved to /var/cache/conftool/dbconfig/20251009-093131-root.json | [production] | 
            
  | 09:30 | <elukey@cumin2002> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['cp2046.codfw.wmnet'] | [production] | 
            
  | 09:28 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db1252 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P83709 and previous config saved to /var/cache/conftool/dbconfig/20251009-092827-root.json | [production] | 
            
  | 09:24 | <elukey@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cp2046.codfw.wmnet'] | [production] | 
            
  | 09:23 | <elukey@cumin2002> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['cp2045.codfw.wmnet'] | [production] | 
            
  | 09:23 | <elukey@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cp2045.codfw.wmnet'] | [production] | 
            
  | 09:21 | <elukey@cumin2002> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['cp2044.codfw.wmnet'] | [production] | 
            
  | 09:21 | <elukey@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cp2044.codfw.wmnet'] | [production] | 
            
  | 09:16 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db2179 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P83708 and previous config saved to /var/cache/conftool/dbconfig/20251009-091626-root.json | [production] | 
            
  | 09:13 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db1252 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P83707 and previous config saved to /var/cache/conftool/dbconfig/20251009-091322-root.json | [production] | 
            
  | 09:05 | <marostegui@cumin1003> | dbctl commit (dc=all): 'Depool db1252 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P83706 and previous config saved to /var/cache/conftool/dbconfig/20251009-090516-marostegui.json | [production] | 
            
  | 09:05 | <marostegui@cumin1003> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1252.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 09:01 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db2179 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P83705 and previous config saved to /var/cache/conftool/dbconfig/20251009-090120-root.json | [production] | 
            
  | 08:53 | <elukey@cumin2002> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['cp2044.codfw.wmnet'] | [production] | 
            
  | 08:53 | <elukey@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cp2044.codfw.wmnet'] | [production] | 
            
  | 08:52 | <elukey@cumin2002> | END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cp2050.codfw.wmnet'] | [production] | 
            
  | 08:52 | <elukey@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cp2050.codfw.wmnet'] | [production] | 
            
  | 08:52 | <elukey@cumin2002> | END (ERROR) - Cookbook sre.hardware.upgrade-firmware (exit_code=97) upgrade firmware for hosts ['cp2050.codfw.wmnet'] | [production] | 
            
  | 08:48 | <elukey@cumin1003> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2078.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL | [production] | 
            
  | 08:46 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db2179 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P83704 and previous config saved to /var/cache/conftool/dbconfig/20251009-084614-root.json | [production] | 
            
  | 08:44 | <elukey@cumin1003> | START - Cookbook sre.hosts.provision for host ms-be2078.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL | [production] | 
            
  | 08:38 | <marostegui@cumin1003> | dbctl commit (dc=all): 'Depool db2179 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P83703 and previous config saved to /var/cache/conftool/dbconfig/20251009-083801-marostegui.json | [production] | 
            
  | 08:37 | <marostegui@cumin1003> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2179.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 08:34 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db2147 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P83702 and previous config saved to /var/cache/conftool/dbconfig/20251009-083432-root.json | [production] | 
            
  | 08:26 | <btullis@deploy2002> | helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 08:26 | <btullis@deploy2002> | helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. | [production] | 
            
  | 08:22 | <jnuche@deploy2002> | rebuilt and synchronized wikiversions files: group2 to 1.45.0-wmf.22  refs T405678 | [production] | 
            
  | 08:19 | <btullis@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 08:19 | <elukey@cumin2002> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['cp2044.codfw.wmnet'] | [production] | 
            
  | 08:19 | <marostegui@cumin1003> | dbctl commit (dc=all): 'db2147 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P83701 and previous config saved to /var/cache/conftool/dbconfig/20251009-081926-root.json | [production] |