| 
      
        2025-10-22
      
      ยง
     | 
  
    
  | 11:50 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'db1196 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P84249 and previous config saved to /var/cache/conftool/dbconfig/20251022-115027-root.json | 
  [production] | 
            
  | 11:48 | 
  <cmooney@cumin1003> | 
  END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 11:47 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'db1263 (re)pooling @ 20%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84248 and previous config saved to /var/cache/conftool/dbconfig/20251022-114749-root.json | 
  [production] | 
            
  | 11:46 | 
  <cmooney@cumin1003> | 
  START - Cookbook sre.hosts.provision for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 11:46 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'db2146 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P84247 and previous config saved to /var/cache/conftool/dbconfig/20251022-114629-root.json | 
  [production] | 
            
  | 11:40 | 
  <mvernon@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on ms-be[1089-1090].eqiad.wmnet with reason: awaiting controller swap | 
  [production] | 
            
  | 11:35 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'db1196 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P84246 and previous config saved to /var/cache/conftool/dbconfig/20251022-113521-root.json | 
  [production] | 
            
  | 11:32 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'db1263 (re)pooling @ 10%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84245 and previous config saved to /var/cache/conftool/dbconfig/20251022-113243-root.json | 
  [production] | 
            
  | 11:31 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'db2146 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P84244 and previous config saved to /var/cache/conftool/dbconfig/20251022-113123-root.json | 
  [production] | 
            
  | 11:30 | 
  <mvolz@deploy1003> | 
  helmfile [eqiad] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:30 | 
  <mvolz@deploy1003> | 
  helmfile [eqiad] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:30 | 
  <mvolz@deploy1003> | 
  helmfile [codfw] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:29 | 
  <mvolz@deploy1003> | 
  helmfile [codfw] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:28 | 
  <mvolz@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:27 | 
  <mvolz@deploy1003> | 
  helmfile [staging] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:27 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'Depool db1196 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P84243 and previous config saved to /var/cache/conftool/dbconfig/20251022-112732-marostegui.json | 
  [production] | 
            
  | 11:27 | 
  <marostegui@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1196.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 11:26 | 
  <cgoubert@deploy2002> | 
  helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply | 
  [production] | 
            
  | 11:26 | 
  <cgoubert@deploy2002> | 
  helmfile [codfw] START helmfile.d/services/rest-gateway: apply | 
  [production] | 
            
  | 11:26 | 
  <marostegui@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 14 hosts with reason: Upgrading | 
  [production] | 
            
  | 11:25 | 
  <cmooney@cumin1003> | 
  END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host sretest1006 | 
  [production] | 
            
  | 11:25 | 
  <cmooney@cumin1003> | 
  START - Cookbook sre.network.configure-switch-interfaces for host sretest1006 | 
  [production] | 
            
  | 11:25 | 
  <dreamyjazz@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1198021|EventStreamConfig: Don't collect user-agent for suggested_investigations_interaction (T404177)]] (duration: 08m 48s) | 
  [production] | 
            
  | 11:24 | 
  <mvolz@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:24 | 
  <mvolz@deploy1003> | 
  helmfile [staging] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:20 | 
  <dreamyjazz@deploy2002> | 
  kharlan, dreamyjazz: Continuing with sync | 
  [production] | 
            
  | 11:20 | 
  <dreamyjazz@deploy2002> | 
  kharlan, dreamyjazz: Backport for [[gerrit:1198021|EventStreamConfig: Don't collect user-agent for suggested_investigations_interaction (T404177)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | 
  [production] | 
            
  | 11:20 | 
  <cgoubert@deploy2002> | 
  helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply | 
  [production] | 
            
  | 11:19 | 
  <cgoubert@deploy2002> | 
  helmfile [eqiad] START helmfile.d/services/rest-gateway: apply | 
  [production] | 
            
  | 11:18 | 
  <cgoubert@deploy2002> | 
  helmfile [staging] DONE helmfile.d/services/rest-gateway: apply | 
  [production] | 
            
  | 11:17 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'db1263 (re)pooling @ 7%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84242 and previous config saved to /var/cache/conftool/dbconfig/20251022-111736-root.json | 
  [production] | 
            
  | 11:16 | 
  <cmooney@cumin1003> | 
  END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host sretest1006 | 
  [production] | 
            
  | 11:16 | 
  <cmooney@cumin1003> | 
  START - Cookbook sre.network.configure-switch-interfaces for host sretest1006 | 
  [production] | 
            
  | 11:16 | 
  <marostegui@cumin1003> | 
  dbctl commit (dc=all): 'db2146 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P84241 and previous config saved to /var/cache/conftool/dbconfig/20251022-111617-root.json | 
  [production] | 
            
  | 11:16 | 
  <dreamyjazz@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1198021|EventStreamConfig: Don't collect user-agent for suggested_investigations_interaction (T404177)]] | 
  [production] | 
            
  | 11:16 | 
  <cgoubert@deploy2002> | 
  helmfile [staging] START helmfile.d/services/rest-gateway: apply | 
  [production] | 
            
  | 11:15 | 
  <cgoubert@deploy2002> | 
  helmfile [codfw] DONE helmfile.d/services/api-gateway: apply | 
  [production] | 
            
  | 11:15 | 
  <cmooney@cumin1003> | 
  END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 11:15 | 
  <cmooney@cumin1003> | 
  START - Cookbook sre.hosts.provision for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 11:15 | 
  <cgoubert@deploy2002> | 
  helmfile [codfw] START helmfile.d/services/api-gateway: apply | 
  [production] | 
            
  | 11:14 | 
  <cgoubert@deploy2002> | 
  helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply | 
  [production] | 
            
  | 11:14 | 
  <cgoubert@deploy2002> | 
  helmfile [eqiad] START helmfile.d/services/api-gateway: apply | 
  [production] | 
            
  | 11:09 | 
  <kamila@cumin1003> | 
  END (FAIL) - Cookbook sre.k8s.pool-depool-node (exit_code=99) depool for host wikikube-worker2203.codfw.wmnet | 
  [production] | 
            
  | 11:08 | 
  <dreamyjazz@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1198017|Fix abuse_filter_log index in TempUserIPLookup (T400280)]], [[gerrit:1198018|Fix abuse_filter_log index in TempUserIPLookup (T400280)]] (duration: 10m 01s) | 
  [production] | 
            
  | 11:08 | 
  <cgoubert@deploy2002> | 
  helmfile [staging] DONE helmfile.d/services/api-gateway: apply | 
  [production] | 
            
  | 11:07 | 
  <cgoubert@deploy2002> | 
  helmfile [staging] START helmfile.d/services/api-gateway: apply | 
  [production] | 
            
  | 11:06 | 
  <kamila@cumin1003> | 
  START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker2203.codfw.wmnet | 
  [production] | 
            
  | 11:05 | 
  <cmooney@cumin1003> | 
  END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 11:05 | 
  <cmooney@cumin1003> | 
  START - Cookbook sre.hosts.provision for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 11:04 | 
  <jelto@cumin1003> | 
  START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Upgrade GitLab | 
  [production] |