| 
      
        2025-09-10
      
      ยง
     | 
  
    
  | 15:26 | 
  <andrew@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephmon2004-dev.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:26 | 
  <fceratto@deploy1003> | 
  helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . | 
  [production] | 
            
  | 15:22 | 
  <pt1979@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mr1-eqsin with reason: router upgrade | 
  [production] | 
            
  | 15:22 | 
  <papaul> | 
  disable OSPF on mr1-eqsin to test BGP | 
  [production] | 
            
  | 15:21 | 
  <ejegg> | 
  standalone SmashPig upgraded from 0fccf147 to d73c3d9a | 
  [production] | 
            
  | 15:19 | 
  <swfrench@deploy1003> | 
  helmfile [eqiad] DONE helmfile.d/services/proton: apply | 
  [production] | 
            
  | 15:18 | 
  <btullis@cumin1003> | 
  START - Cookbook sre.hadoop.roll-restart-masters restart masters for Hadoop analytics cluster: Restart of jvm daemons. | 
  [production] | 
            
  | 15:16 | 
  <swfrench@cumin2002> | 
  conftool action : set/pooled=true; selector: dnsdisc=rest-gateway,name=codfw [reason: Repooling codfw while investigating provisioning of proton service - T400131] | 
  [production] | 
            
  | 15:12 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2155 (T402763)', diff saved to https://phabricator.wikimedia.org/P83164 and previous config saved to /var/cache/conftool/dbconfig/20250910-151216-ladsgroup.json | 
  [production] | 
            
  | 15:10 | 
  <andrew@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host cloudcephmon2004-dev.codfw.wmnet with OS trixie | 
  [production] | 
            
  | 15:09 | 
  <swfrench@deploy1003> | 
  helmfile [eqiad] START helmfile.d/services/proton: apply | 
  [production] | 
            
  | 15:08 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Set db2204 and db2207 weights after flip', diff saved to https://phabricator.wikimedia.org/P83163 and previous config saved to /var/cache/conftool/dbconfig/20250910-150800-fceratto.json | 
  [production] | 
            
  | 14:57 | 
  <fceratto@deploy1003> | 
  helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . | 
  [production] | 
            
  | 14:57 | 
  <fceratto@cumin1002> | 
  END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db2212 gradually with 4 steps - pooling in | 
  [production] | 
            
  | 14:56 | 
  <fceratto@cumin1002> | 
  START - Cookbook sre.mysql.pool db2212 gradually with 4 steps - pooling in | 
  [production] | 
            
  | 14:55 | 
  <swfrench@cumin2002> | 
  conftool action : set/pooled=false; selector: dnsdisc=rest-gateway,name=codfw [reason: Depooling codfw ahead of switch to active-passive - T400131] | 
  [production] | 
            
  | 14:55 | 
  <jforrester@deploy1003> | 
  Finished scap sync-world: Backport for [[gerrit:1186517|Improve performance of preferred labels subquery]] (duration: 13m 44s) | 
  [production] | 
            
  | 14:50 | 
  <jforrester@deploy1003> | 
  jforrester: Continuing with sync | 
  [production] | 
            
  | 14:49 | 
  <jhathaway@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART | 
  [production] | 
            
  | 14:49 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2161 (T401906)', diff saved to https://phabricator.wikimedia.org/P83162 and previous config saved to /var/cache/conftool/dbconfig/20250910-144932-fceratto.json | 
  [production] | 
            
  | 14:47 | 
  <jforrester@deploy1003> | 
  jforrester: Backport for [[gerrit:1186517|Improve performance of preferred labels subquery]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | 
  [production] | 
            
  | 14:47 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Depooling db2155 (T402763)', diff saved to https://phabricator.wikimedia.org/P83161 and previous config saved to /var/cache/conftool/dbconfig/20250910-144732-ladsgroup.json | 
  [production] | 
            
  | 14:47 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2155.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 14:47 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2147 (T402763)', diff saved to https://phabricator.wikimedia.org/P83160 and previous config saved to /var/cache/conftool/dbconfig/20250910-144710-ladsgroup.json | 
  [production] | 
            
  | 14:46 | 
  <jhathaway@cumin1002> | 
  START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART | 
  [production] | 
            
  | 14:45 | 
  <fceratto@deploy1003> | 
  helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . | 
  [production] | 
            
  | 14:41 | 
  <jforrester@deploy1003> | 
  Started scap sync-world: Backport for [[gerrit:1186517|Improve performance of preferred labels subquery]] | 
  [production] | 
            
  | 14:36 | 
  <jforrester@deploy1003> | 
  helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:36 | 
  <jforrester@deploy1003> | 
  helmfile [eqiad] START helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:35 | 
  <jforrester@deploy1003> | 
  helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:35 | 
  <jforrester@deploy1003> | 
  helmfile [codfw] START helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:34 | 
  <jforrester@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:34 | 
  <jgiannelos@deploy1003> | 
  helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply | 
  [production] | 
            
  | 14:34 | 
  <jforrester@deploy1003> | 
  helmfile [staging] START helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:34 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2161', diff saved to https://phabricator.wikimedia.org/P83159 and previous config saved to /var/cache/conftool/dbconfig/20250910-143424-fceratto.json | 
  [production] | 
            
  | 14:34 | 
  <jgiannelos@deploy1003> | 
  helmfile [eqiad] START helmfile.d/services/mobileapps: apply | 
  [production] | 
            
  | 14:33 | 
  <jgiannelos@deploy1003> | 
  helmfile [codfw] DONE helmfile.d/services/mobileapps: apply | 
  [production] | 
            
  | 14:32 | 
  <jgiannelos@deploy1003> | 
  helmfile [codfw] START helmfile.d/services/mobileapps: apply | 
  [production] | 
            
  | 14:32 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P83158 and previous config saved to /var/cache/conftool/dbconfig/20250910-143202-ladsgroup.json | 
  [production] | 
            
  | 14:30 | 
  <jgiannelos@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/mobileapps: apply | 
  [production] | 
            
  | 14:30 | 
  <jgiannelos@deploy1003> | 
  helmfile [staging] START helmfile.d/services/mobileapps: apply | 
  [production] | 
            
  | 14:26 | 
  <jforrester@deploy1003> | 
  helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:25 | 
  <jforrester@deploy1003> | 
  helmfile [eqiad] START helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:25 | 
  <jforrester@deploy1003> | 
  helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:25 | 
  <jforrester@deploy1003> | 
  helmfile [codfw] START helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:24 | 
  <jforrester@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:24 | 
  <jforrester@deploy1003> | 
  helmfile [staging] START helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:20 | 
  <jforrester@deploy1003> | 
  helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:19 | 
  <jforrester@deploy1003> | 
  helmfile [eqiad] START helmfile.d/services/wikifunctions: apply | 
  [production] | 
            
  | 14:19 | 
  <jforrester@deploy1003> | 
  helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply | 
  [production] |