| 2024-11-28
      
      ยง | 
    
  | 16:19 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host ms-be2081.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART | [production] | 
            
  | 16:17 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2205.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:17 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2205.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:08 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2194.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:07 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2194.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:01 | <gmodena@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-dump-rev-content-reconcile-enrich: apply | [production] | 
            
  | 16:01 | <gmodena@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-dump-rev-content-reconcile-enrich: apply | [production] | 
            
  | 15:58 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2190.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:58 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2190.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:48 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2177.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:48 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2177.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:46 | <jmm@cumin2002> | END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host idp-test2004.wikimedia.org | [production] | 
            
  | 15:39 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2186.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:39 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2186.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:39 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2156.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:39 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2156.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:37 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host idp-test2004.wikimedia.org | [production] | 
            
  | 15:36 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host idp-test2005.wikimedia.org | [production] | 
            
  | 15:32 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host idp-test2005.wikimedia.org | [production] | 
            
  | 15:32 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance es2032 (T376905)', diff saved to https://phabricator.wikimedia.org/P71371 and previous config saved to /var/cache/conftool/dbconfig/20241128-153202-ladsgroup.json | [production] | 
            
  | 15:30 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2149.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:29 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2149.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:27 | <gmodena@deploy2002> | Finished deploy [analytics/refinery@ac87303] (hadoop-test): Gobblin config changes [analytics/refinery@ac873037] (duration: 00m 26s) | [production] | 
            
  | 15:26 | <gmodena@deploy2002> | Started deploy [analytics/refinery@ac87303] (hadoop-test): Gobblin config changes [analytics/refinery@ac873037] | [production] | 
            
  | 15:25 | <gmodena@deploy2002> | Finished deploy [analytics/refinery@ac87303] (thin): Gobblin config changes THIN [analytics/refinery@ac873037] (duration: 00m 30s) | [production] | 
            
  | 15:25 | <gmodena@deploy2002> | Started deploy [analytics/refinery@ac87303] (thin): Gobblin config changes THIN [analytics/refinery@ac873037] | [production] | 
            
  | 15:21 | <moritzm> | removing ganeti1018 from active Ganeti nodes T378921 | [production] | 
            
  | 15:20 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2139.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:20 | <elukey@deploy2002> | helmfile [staging] DONE helmfile.d/services/tegola-vector-tiles: sync | [production] | 
            
  | 15:20 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2139.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:19 | <gmodena@deploy2002> | Finished deploy [analytics/refinery@ac87303]: Gobblin config changes [analytics/refinery@ac873037] (duration: 03m 05s) | [production] | 
            
  | 15:19 | <elukey@deploy2002> | helmfile [staging] START helmfile.d/services/tegola-vector-tiles: sync | [production] | 
            
  | 15:18 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:18 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:16 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance es2032', diff saved to https://phabricator.wikimedia.org/P71370 and previous config saved to /var/cache/conftool/dbconfig/20241128-151655-ladsgroup.json | [production] | 
            
  | 15:16 | <gmodena@deploy2002> | Started deploy [analytics/refinery@ac87303]: Gobblin config changes [analytics/refinery@ac873037] | [production] | 
            
  | 15:15 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1018.eqiad.wmnet | [production] | 
            
  | 15:15 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1240.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:15 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db1240.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:13 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1223.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:12 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db1223.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:10 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1013,1017].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:10 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1013,1017].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:10 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1212.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:10 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db1212.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:08 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1198.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:08 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db1198.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:06 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1175.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:06 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db1175.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:04 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1166.eqiad.wmnet with reason: Maintenance | [production] |