| 2024-09-12
      
      ยง | 
    
  | 12:25 | <akosiaris@cumin1002> | START - Cookbook sre.k8s.pool-depool-node depool for host mw2394.codfw.wmnet | [production] | 
            
  | 12:25 | <akosiaris@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host mw2390.codfw.wmnet | [production] | 
            
  | 12:24 | <akosiaris@cumin1002> | START - Cookbook sre.k8s.pool-depool-node depool for host mw2390.codfw.wmnet | [production] | 
            
  | 12:19 | <btullis@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1085.eqiad.wmnet | [production] | 
            
  | 12:11 | <btullis@cumin1002> | START - Cookbook sre.hosts.reboot-single for host an-worker1085.eqiad.wmnet | [production] | 
            
  | 12:06 | <aborrero@cloudcumin1001> | END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-16, tools-k8s-worker-nfs-33 (T374612) | [tools] | 
            
  | 11:59 | <aborrero@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-16, tools-k8s-worker-nfs-33 (T374612) | [tools] | 
            
  | 11:54 | <aborrero@cloudcumin1001> | END (FAIL) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=99) for tools-k8s-worker-nfs-23, tools-k8s-worker-16, tools-k8s-worker-nfs-33 (T374612) | [tools] | 
            
  | 11:48 | <aborrero@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-23, tools-k8s-worker-16, tools-k8s-worker-nfs-33 (T374612) | [tools] | 
            
  | 11:47 | <urbanecm@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1071916|Babel: Set BabelUseCommunityConfiguration to false (T374611)]] (duration: 11m 28s) | [production] | 
            
  | 11:46 | <jynus> | restarting db1171:s7 mysql process T374610 | [production] | 
            
  | 11:42 | <aborrero@cloudcumin1001> | END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-28 (T374612) | [tools] | 
            
  | 11:38 | <jelto@cumin1002> | END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: Upgrade GitLab to new version | [production] | 
            
  | 11:37 | <aborrero@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-28 (T374612) | [tools] | 
            
  | 11:35 | <urbanecm@deploy1003> | Started scap sync-world: Backport for [[gerrit:1071916|Babel: Set BabelUseCommunityConfiguration to false (T374611)]] | [production] | 
            
  | 11:20 | <damilare> | SmashPig upgraded from eb7807f8 to 08c79f4f | [production] | 
            
  | 11:14 | <hnowlan@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply | [production] | 
            
  | 11:13 | <hnowlan@deploy1003> | helmfile [eqiad] START helmfile.d/services/shellbox-video: apply | [production] | 
            
  | 11:06 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2029.codfw.wmnet | [production] | 
            
  | 11:06 | <cgoubert@cumin1002> | START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2029.codfw.wmnet | [production] | 
            
  | 11:05 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2093.codfw.wmnet | [production] | 
            
  | 11:05 | <cgoubert@cumin1002> | START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2093.codfw.wmnet | [production] | 
            
  | 11:03 | <hnowlan@deploy1003> | helmfile [codfw] DONE helmfile.d/services/shellbox-video: apply | [production] | 
            
  | 11:03 | <hnowlan@deploy1003> | helmfile [codfw] START helmfile.d/services/shellbox-video: apply | [production] | 
            
  | 10:59 | <hnowlan@deploy1003> | helmfile [staging] DONE helmfile.d/services/shellbox-video: apply | [production] | 
            
  | 10:59 | <hnowlan@deploy1003> | helmfile [staging] START helmfile.d/services/shellbox-video: apply | [production] | 
            
  | 10:51 | <arturo> | merging change to keystone wmf hooks https://gerrit.wikimedia.org/r/c/operations/puppet/+/1071230 (T374020) | [admin] | 
            
  | 10:43 | <valerio-bozzolan> | "Not reachable since 3AM looking at Grafana. Soft reboot from Horizon" | [wlm-it-visual] | 
            
  | 10:34 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2216 (T371742)', diff saved to https://phabricator.wikimedia.org/P69039 and previous config saved to /var/cache/conftool/dbconfig/20240912-103434-ladsgroup.json | [production] | 
            
  | 10:33 | <cmooney@cumin1002> | END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) 2.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.8.0.e.f.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa. on all recursors | [production] | 
            
  | 10:32 | <cmooney@cumin1002> | START - Cookbook sre.dns.wipe-cache 2.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.8.0.e.f.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa. on all recursors | [production] | 
            
  | 10:25 | <btullis> | stopping envoyproxy on cephosd1001 | [production] | 
            
  | 10:25 | <cmooney@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 10:25 | <cmooney@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns entries for et-0-0-31-100.ssw1-f1-eqiad.eqiad.wmnet - cmooney@cumin1002" | [production] | 
            
  | 10:25 | <cmooney@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns entries for et-0-0-31-100.ssw1-f1-eqiad.eqiad.wmnet - cmooney@cumin1002" | [production] | 
            
  | 10:19 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2216', diff saved to https://phabricator.wikimedia.org/P69038 and previous config saved to /var/cache/conftool/dbconfig/20240912-101927-ladsgroup.json | [production] | 
            
  | 10:08 | <cgoubert@deploy1003> | helmfile [eqiad] START helmfile.d/services/mw-wikifunctions: apply | [production] | 
            
  | 10:08 | <cgoubert@deploy1003> | helmfile [codfw] DONE helmfile.d/services/mw-wikifunctions: apply | [production] | 
            
  | 10:08 | <cgoubert@deploy1003> | helmfile [codfw] START helmfile.d/services/mw-wikifunctions: apply | [production] | 
            
  | 10:08 | <btullis> | restarted envoyproxy on cephosd1001 | [production] | 
            
  | 10:08 | <claime> | Increasing mw-wikifunctions replicas to 6 | [production] | 
            
  | 09:59 | <btullis> | stopping envoyproxy on cephosd1001 | [production] | 
            
  | 09:53 | <arnaudb@cumin1002> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1246.eqiad.wmnet with OS bookworm | [production] | 
            
  | 09:53 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2129 (re)pooling @ 75%: post db2229 bootstrap', diff saved to https://phabricator.wikimedia.org/P69034 and previous config saved to /var/cache/conftool/dbconfig/20240912-095335-arnaudb.json | [production] | 
            
  | 09:49 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2216 (T371742)', diff saved to https://phabricator.wikimedia.org/P69033 and previous config saved to /var/cache/conftool/dbconfig/20240912-094912-ladsgroup.json | [production] | 
            
  | 09:41 | <dcausse@deploy1003> | helmfile [codfw] DONE helmfile.d/services/rdf-streaming-updater: apply | [production] | 
            
  | 09:41 | <dcausse@deploy1003> | helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply | [production] | 
            
  | 09:38 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2129 (re)pooling @ 50%: post db2229 bootstrap', diff saved to https://phabricator.wikimedia.org/P69032 and previous config saved to /var/cache/conftool/dbconfig/20240912-093829-arnaudb.json | [production] | 
            
  | 09:35 | <elukey@cumin1002> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest2001.codfw.wmnet with OS bookworm | [production] | 
            
  | 09:32 | <elukey@cumin1002> | START - Cookbook sre.hosts.reimage for host sretest2001.codfw.wmnet with OS bookworm | [production] |