| 2024-11-05
      
      ยง | 
    
  | 15:53 | <jmm@cumin2002> | END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1042.eqiad.wmnet to cluster eqiad and group B | [production] | 
            
  | 15:51 | <jmm@cumin2002> | START - Cookbook sre.ganeti.addnode for new host ganeti1042.eqiad.wmnet to cluster eqiad and group B | [production] | 
            
  | 15:51 | <jmm@cumin2002> | END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1041.eqiad.wmnet to cluster eqiad and group B | [production] | 
            
  | 15:50 | <jmm@cumin2002> | START - Cookbook sre.ganeti.addnode for new host ganeti1041.eqiad.wmnet to cluster eqiad and group B | [production] | 
            
  | 15:48 | <moritzm> | remove ganeti1013 from active ganeti nodes T378921 | [production] | 
            
  | 15:47 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1013.eqiad.wmnet | [production] | 
            
  | 15:43 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance es1033', diff saved to https://phabricator.wikimedia.org/P70934 and previous config saved to /var/cache/conftool/dbconfig/20241105-154326-ladsgroup.json | [production] | 
            
  | 15:40 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 15:37 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 15:32 | <hashar> | Switched PCC workers to Java 17 via https://horizon.wikimedia.org/project/prefixpuppet/?tab=prefix_puppet__puppet-pcc-worker  # T359795 | [production] | 
            
  | 15:28 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance es1033 (T376905)', diff saved to https://phabricator.wikimedia.org/P70933 and previous config saved to /var/cache/conftool/dbconfig/20241105-152819-ladsgroup.json | [production] | 
            
  | 15:27 | <hashar> | Switched deployment-deploy04.deployment-prep.eqiad1.wikimedia.cloud to Java 17 # T359795 | [production] | 
            
  | 15:21 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling es1033 (T376905)', diff saved to https://phabricator.wikimedia.org/P70932 and previous config saved to /var/cache/conftool/dbconfig/20241105-152139-ladsgroup.json | [production] | 
            
  | 15:21 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1033.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:21 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1033.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:21 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance es1026 (T376905)', diff saved to https://phabricator.wikimedia.org/P70931 and previous config saved to /var/cache/conftool/dbconfig/20241105-152114-ladsgroup.json | [production] | 
            
  | 15:20 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.reimage for host pc1017.eqiad.wmnet with OS bookworm | [production] | 
            
  | 15:18 | <hashar> | Switched WMCS integration instances from Java 11 to Java 17 via Horizon project wide config. That was forgotten in T359795  and blocks today Jenkins upgrade ( T379059 ) | [production] | 
            
  | 15:15 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc1017.eqiad.wmnet with OS bookworm | [production] | 
            
  | 15:06 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance es1026', diff saved to https://phabricator.wikimedia.org/P70929 and previous config saved to /var/cache/conftool/dbconfig/20241105-150607-ladsgroup.json | [production] | 
            
  | 15:02 | <cdanis@deploy2002> | helmfile [eqiad] DONE helmfile.d/services/chart-renderer: apply | [production] | 
            
  | 15:02 | <cdanis@deploy2002> | helmfile [eqiad] START helmfile.d/services/chart-renderer: apply | [production] | 
            
  | 15:02 | <cdanis@deploy2002> | helmfile [codfw] DONE helmfile.d/services/chart-renderer: apply | [production] | 
            
  | 15:01 | <cdanis@deploy2002> | helmfile [codfw] START helmfile.d/services/chart-renderer: apply | [production] | 
            
  | 15:01 | <hashar> | Upgrading CI Jenkins | T379059 | [production] | 
            
  | 14:53 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:51 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance es1026', diff saved to https://phabricator.wikimedia.org/P70928 and previous config saved to /var/cache/conftool/dbconfig/20241105-145059-ladsgroup.json | [production] | 
            
  | 14:50 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:48 | <jnuche@deploy2002> | rebuilt and synchronized wikiversions files: group0 to 1.44.0-wmf.2  refs T375661 | [production] | 
            
  | 14:44 | <cdanis@deploy2002> | helmfile [staging] DONE helmfile.d/services/chart-renderer: apply | [production] | 
            
  | 14:44 | <cdanis@deploy2002> | helmfile [staging] START helmfile.d/services/chart-renderer: apply | [production] | 
            
  | 14:35 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance es1026 (T376905)', diff saved to https://phabricator.wikimedia.org/P70927 and previous config saved to /var/cache/conftool/dbconfig/20241105-143552-ladsgroup.json | [production] | 
            
  | 14:34 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.reimage for host pc1017.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:33 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc1017.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:32 | <tgr|away> | UTC afternoon deploys done | [production] | 
            
  | 14:30 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling es1026 (T376905)', diff saved to https://phabricator.wikimedia.org/P70926 and previous config saved to /var/cache/conftool/dbconfig/20241105-142959-ladsgroup.json | [production] | 
            
  | 14:29 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1026.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 14:29 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1026.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 14:29 | <vgutierrez> | upload liberica 0.3 to apt.wm.o (bookworm-wikimedia) | [production] | 
            
  | 14:28 | <tgr@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1087455|JsonConfig: Disable TrackGlobalJsonLinks to avoid missing table errors (T379067)]] (duration: 17m 24s) | [production] | 
            
  | 14:24 | <tgr@deploy2002> | tgr: Continuing with sync | [production] | 
            
  | 14:16 | <tgr@deploy2002> | tgr: Backport for [[gerrit:1087455|JsonConfig: Disable TrackGlobalJsonLinks to avoid missing table errors (T379067)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 14:12 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:11 | <tgr@deploy2002> | Started scap sync-world: Backport for [[gerrit:1087455|JsonConfig: Disable TrackGlobalJsonLinks to avoid missing table errors (T379067)]] | [production] | 
            
  | 14:10 | <akosiaris@deploy2002> | helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 14:10 | <akosiaris@deploy2002> | helmfile [eqiad] START helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 14:09 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:08 | <moritzm> | installing PHP 7.4 security updates on bullseye (as packaged in Debian) | [production] | 
            
  | 14:08 | <akosiaris@deploy2002> | helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 14:07 | <akosiaris@deploy2002> | helmfile [codfw] START helmfile.d/services/rest-gateway: apply | [production] |