| 
      
        2025-09-12
      
      ยง
     | 
  
    
  | 15:20 | 
  <herron@cumin1003> | 
  START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling restart_daemons on A:kafka-logging-eqiad | 
  [production] | 
            
  | 15:12 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1178', diff saved to https://phabricator.wikimedia.org/P83272 and previous config saved to /var/cache/conftool/dbconfig/20250912-151216-ladsgroup.json | 
  [production] | 
            
  | 15:03 | 
  <fceratto@deploy1003> | 
  helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . | 
  [production] | 
            
  | 14:57 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1178 (T402763)', diff saved to https://phabricator.wikimedia.org/P83271 and previous config saved to /var/cache/conftool/dbconfig/20250912-145709-ladsgroup.json | 
  [production] | 
            
  | 14:49 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Depooling db1178 (T402763)', diff saved to https://phabricator.wikimedia.org/P83270 and previous config saved to /var/cache/conftool/dbconfig/20250912-144934-ladsgroup.json | 
  [production] | 
            
  | 14:49 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1178.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 14:49 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1177 (T402763)', diff saved to https://phabricator.wikimedia.org/P83269 and previous config saved to /var/cache/conftool/dbconfig/20250912-144911-ladsgroup.json | 
  [production] | 
            
  | 14:45 | 
  <btullis@cumin1003> | 
  START - Cookbook sre.hosts.reimage for host an-worker1236.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 14:40 | 
  <btullis@cumin1003> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1235.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 14:34 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P83268 and previous config saved to /var/cache/conftool/dbconfig/20250912-143404-ladsgroup.json | 
  [production] | 
            
  | 14:18 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P83267 and previous config saved to /var/cache/conftool/dbconfig/20250912-141856-ladsgroup.json | 
  [production] | 
            
  | 14:11 | 
  <btullis@cumin1003> | 
  START - Cookbook sre.hosts.reimage for host an-worker1235.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 14:03 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1177 (T402763)', diff saved to https://phabricator.wikimedia.org/P83266 and previous config saved to /var/cache/conftool/dbconfig/20250912-140348-ladsgroup.json | 
  [production] | 
            
  | 13:57 | 
  <jmm@dns1004> | 
  END - running authdns-update | 
  [production] | 
            
  | 13:56 | 
  <jmm@dns1004> | 
  START - running authdns-update | 
  [production] | 
            
  | 13:53 | 
  <jmm@dns1004> | 
  END - running authdns-update | 
  [production] | 
            
  | 13:53 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Depooling db1177 (T402763)', diff saved to https://phabricator.wikimedia.org/P83265 and previous config saved to /var/cache/conftool/dbconfig/20250912-135309-ladsgroup.json | 
  [production] | 
            
  | 13:53 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1177.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 13:52 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1172 (T402763)', diff saved to https://phabricator.wikimedia.org/P83264 and previous config saved to /var/cache/conftool/dbconfig/20250912-135246-ladsgroup.json | 
  [production] | 
            
  | 13:52 | 
  <jmm@dns1004> | 
  START - running authdns-update | 
  [production] | 
            
  | 13:37 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P83262 and previous config saved to /var/cache/conftool/dbconfig/20250912-133739-ladsgroup.json | 
  [production] | 
            
  | 13:34 | 
  <taavi> | 
  an allow-all ipv6 egress rule in bastion default security group | 
  [bastion] | 
            
  | 13:22 | 
  <hashar> | 
  Attempt #4: Started running MediaWiki jobs with non recursive dependencies injected, excluding selenium jobs and using 9 workers. From contint1002 in a screen. Log written to /home/hashar/run.log # T389998 | 
  [releng] | 
            
  | 13:22 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P83261 and previous config saved to /var/cache/conftool/dbconfig/20250912-132231-ladsgroup.json | 
  [production] | 
            
  | 13:21 | 
  <btullis@cumin1003> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1235.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 13:07 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1172 (T402763)', diff saved to https://phabricator.wikimedia.org/P83260 and previous config saved to /var/cache/conftool/dbconfig/20250912-130724-ladsgroup.json | 
  [production] | 
            
  | 12:59 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Depooling db1172 (T402763)', diff saved to https://phabricator.wikimedia.org/P83259 and previous config saved to /var/cache/conftool/dbconfig/20250912-125949-ladsgroup.json | 
  [production] | 
            
  | 12:59 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1172.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 12:53 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1171.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 12:53 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1167 (T402763)', diff saved to https://phabricator.wikimedia.org/P83256 and previous config saved to /var/cache/conftool/dbconfig/20250912-125309-ladsgroup.json | 
  [production] | 
            
  | 12:38 | 
  <logmsgbot> | 
  jforrester Deployed security patch for T404392 | 
  [production] | 
            
  | 12:38 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1167', diff saved to https://phabricator.wikimedia.org/P83254 and previous config saved to /var/cache/conftool/dbconfig/20250912-123801-ladsgroup.json | 
  [production] | 
            
  | 12:29 | 
  <arnaudb@cumin1003> | 
  END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: Security Update | 
  [production] | 
            
  | 12:22 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1167', diff saved to https://phabricator.wikimedia.org/P83253 and previous config saved to /var/cache/conftool/dbconfig/20250912-122254-ladsgroup.json | 
  [production] | 
            
  | 12:07 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1167 (T402763)', diff saved to https://phabricator.wikimedia.org/P83252 and previous config saved to /var/cache/conftool/dbconfig/20250912-120746-ladsgroup.json | 
  [production] | 
            
  | 12:01 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Depooling db1167 (T402763)', diff saved to https://phabricator.wikimedia.org/P83251 and previous config saved to /var/cache/conftool/dbconfig/20250912-120106-ladsgroup.json | 
  [production] | 
            
  | 12:00 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 12:00 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1167.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 11:59 | 
  <ladsgroup@cumin1003> | 
  START - Cookbook sre.mysql.pool db1160* gradually with 4 steps - Work done | 
  [production] | 
            
  | 11:58 | 
  <brouberol@deploy1003> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply | 
  [production] | 
            
  | 11:58 | 
  <brouberol@deploy1003> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply | 
  [production] | 
            
  | 11:51 | 
  <btullis@cumin1003> | 
  START - Cookbook sre.hosts.reimage for host an-worker1235.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 11:48 | 
  <btullis@cumin1003> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1235.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 11:46 | 
  <brouberol@deploy1003> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply | 
  [production] | 
            
  | 11:44 | 
  <brouberol@deploy1003> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply | 
  [production] | 
            
  | 11:35 | 
  <brouberol@deploy1003> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply | 
  [production] | 
            
  | 11:35 | 
  <brouberol@deploy1003> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply | 
  [production] | 
            
  | 11:29 | 
  <jynus@cumin1003> | 
  END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host db2202.codfw.wmnet | 
  [production] | 
            
  | 11:27 | 
  <brouberol@deploy1003> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply | 
  [production] | 
            
  | 11:26 | 
  <brouberol@deploy1003> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply | 
  [production] |