| 
      
        2024-11-27
      
      ยง
     | 
  
    
  | 13:20 | 
  <jclark@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host wdqs1026.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 13:20 | 
  <jclark@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host wdqs1025.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 13:20 | 
  <jclark@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host wdqs1027.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 13:16 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.ganeti.changedisk for changing disk type of durum7002.magru.wmnet to drbd | 
  [production] | 
            
  | 13:15 | 
  <kartik@deploy2002> | 
  helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . | 
  [production] | 
            
  | 13:15 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of doh7002.wikimedia.org to drbd | 
  [production] | 
            
  | 13:05 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.ganeti.changedisk for changing disk type of doh7002.wikimedia.org to drbd | 
  [production] | 
            
  | 13:05 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of prometheus7001.magru.wmnet to drbd | 
  [production] | 
            
  | 12:56 | 
  <jiji@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on kafka-main[1002,1007].eqiad.wmnet with reason: Hardware refresh | 
  [production] | 
            
  | 12:56 | 
  <jiji@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on kafka-main[1002,1007].eqiad.wmnet with reason: Hardware refresh | 
  [production] | 
            
  | 12:50 | 
  <moritzm> | 
  installing ghostscript security updates | 
  [production] | 
            
  | 12:39 | 
  <kartik@deploy2002> | 
  helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . | 
  [production] | 
            
  | 12:38 | 
  <effie> | 
  start replacing kafka-main1002 with kafka-main1007 - T363214 | 
  [production] | 
            
  | 12:24 | 
  <mvolz@deploy2002> | 
  helmfile [staging] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 12:24 | 
  <mvolz@deploy2002> | 
  helmfile [staging] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 12:24 | 
  <kart_> | 
  Updated cxserver to 2024-11-20-121713-production (T377966, T357950) | 
  [production] | 
            
  | 12:22 | 
  <kartik@deploy2002> | 
  helmfile [eqiad] DONE helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 12:22 | 
  <kartik@deploy2002> | 
  helmfile [eqiad] START helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 12:20 | 
  <kartik@deploy2002> | 
  helmfile [codfw] DONE helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 12:20 | 
  <kartik@deploy2002> | 
  helmfile [codfw] START helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 12:18 | 
  <moritzm> | 
  installing python-cryptography security updates | 
  [production] | 
            
  | 12:14 | 
  <kartik@deploy2002> | 
  helmfile [staging] DONE helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 12:13 | 
  <kartik@deploy2002> | 
  helmfile [staging] START helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 12:12 | 
  <moritzm> | 
  installing openssl security updates | 
  [production] | 
            
  | 12:08 | 
  <mvolz@deploy2002> | 
  helmfile [eqiad] DONE helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 12:07 | 
  <mvolz@deploy2002> | 
  helmfile [eqiad] START helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 12:06 | 
  <mvolz@deploy2002> | 
  helmfile [codfw] DONE helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 12:06 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.ganeti.changedisk for changing disk type of prometheus7001.magru.wmnet to drbd | 
  [production] | 
            
  | 12:06 | 
  <mvolz@deploy2002> | 
  helmfile [codfw] START helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 12:05 | 
  <mvolz@deploy2002> | 
  helmfile [staging] DONE helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 12:05 | 
  <mvolz@deploy2002> | 
  helmfile [staging] START helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 12:05 | 
  <kartik@deploy2002> | 
  helmfile [staging] DONE helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 12:05 | 
  <kartik@deploy2002> | 
  helmfile [staging] START helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 12:03 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on ganeti2042.codfw.wmnet with reason: broken CPU | 
  [production] | 
            
  | 12:03 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on ganeti2042.codfw.wmnet with reason: broken CPU | 
  [production] | 
            
  | 11:53 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of bast7001.wikimedia.org to drbd | 
  [production] | 
            
  | 11:45 | 
  <ladsgroup@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1098484|Bump ratio of new parsercache key spec to 3 (T373037)]] (duration: 12m 51s) | 
  [production] | 
            
  | 11:38 | 
  <ladsgroup@deploy2002> | 
  ladsgroup: Continuing with sync | 
  [production] | 
            
  | 11:38 | 
  <ladsgroup@deploy2002> | 
  ladsgroup: Backport for [[gerrit:1098484|Bump ratio of new parsercache key spec to 3 (T373037)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 11:34 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.ganeti.changedisk for changing disk type of bast7001.wikimedia.org to drbd | 
  [production] | 
            
  | 11:32 | 
  <ladsgroup@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1098484|Bump ratio of new parsercache key spec to 3 (T373037)]] | 
  [production] | 
            
  | 11:29 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ncredir7002.magru.wmnet to drbd | 
  [production] | 
            
  | 11:21 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dns7002.wikimedia.org with reason: T376737 | 
  [production] | 
            
  | 11:21 | 
  <fabfur@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 4:00:00 on dns7002.wikimedia.org with reason: T376737 | 
  [production] | 
            
  | 11:21 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dns7001.wikimedia.org with reason: T376737 | 
  [production] | 
            
  | 11:20 | 
  <fabfur@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 4:00:00 on dns7001.wikimedia.org with reason: T376737 | 
  [production] | 
            
  | 11:19 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs[7001-7003].magru.wmnet with reason: T376737 | 
  [production] | 
            
  | 11:19 | 
  <fabfur@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 4:00:00 on lvs[7001-7003].magru.wmnet with reason: T376737 | 
  [production] | 
            
  | 11:19 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 16 hosts with reason: T376737 | 
  [production] | 
            
  | 11:19 | 
  <fabfur@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 4:00:00 on 16 hosts with reason: T376737 | 
  [production] |