| 2024-10-22
      
      ยง | 
    
  | 10:29 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1184.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 10:28 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1184.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 10:28 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1169 (T376905)', diff saved to https://phabricator.wikimedia.org/P70466 and previous config saved to /var/cache/conftool/dbconfig/20241022-102843-ladsgroup.json | [production] | 
            
  | 10:22 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2149 (re)pooling @ 25%: post clone', diff saved to https://phabricator.wikimedia.org/P70465 and previous config saved to /var/cache/conftool/dbconfig/20241022-102227-arnaudb.json | [production] | 
            
  | 10:13 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1169', diff saved to https://phabricator.wikimedia.org/P70464 and previous config saved to /var/cache/conftool/dbconfig/20241022-101336-ladsgroup.json | [production] | 
            
  | 10:12 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub: sync on production | [production] | 
            
  | 10:08 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub: apply on production | [production] | 
            
  | 10:07 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: sync on staging | [production] | 
            
  | 10:04 | <hnowlan@deploy1003> | helmfile [codfw] DONE helmfile.d/services/shellbox-video: sync | [production] | 
            
  | 10:04 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply on staging | [production] | 
            
  | 10:03 | <hnowlan@deploy1003> | helmfile [codfw] START helmfile.d/services/shellbox-video: sync | [production] | 
            
  | 10:03 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.mysql.clone (exit_code=0) of db2149.codfw.wmnet onto db2205.codfw.wmnet | [production] | 
            
  | 10:03 | <andrewtavis-wmde@deploy2002> | Finished deploy [airflow-dags/wmde@dcf019d]: (no justification provided) (duration: 00m 11s) | [production] | 
            
  | 10:02 | <andrewtavis-wmde@deploy2002> | Started deploy [airflow-dags/wmde@dcf019d]: (no justification provided) | [production] | 
            
  | 09:58 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1169', diff saved to https://phabricator.wikimedia.org/P70463 and previous config saved to /var/cache/conftool/dbconfig/20241022-095829-ladsgroup.json | [production] | 
            
  | 09:51 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply on staging | [production] | 
            
  | 09:43 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1169 (T376905)', diff saved to https://phabricator.wikimedia.org/P70461 and previous config saved to /var/cache/conftool/dbconfig/20241022-094322-ladsgroup.json | [production] | 
            
  | 09:36 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub: apply on production | [production] | 
            
  | 09:33 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling db1169 (T376905)', diff saved to https://phabricator.wikimedia.org/P70460 and previous config saved to /var/cache/conftool/dbconfig/20241022-093345-ladsgroup.json | [production] | 
            
  | 09:33 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1169.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 09:33 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1169.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 09:32 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply on staging | [production] | 
            
  | 09:28 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2082.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 09:27 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host ms-be2082.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 09:22 | <hashar> | Restarting CI Jenkins | [production] | 
            
  | 09:06 | <hashar> | Restarting Gerrit | [production] | 
            
  | 09:05 | <arturo> | restart puppetserver service for T377803 | [tools] | 
            
  | 09:04 | <arturo> | restart puppetserver service in T377803 | [cloudinfra] | 
            
  | 08:57 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on pc1013.eqiad.wmnet with reason: maintenance | [production] | 
            
  | 08:57 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on pc1013.eqiad.wmnet with reason: maintenance | [production] | 
            
  | 08:37 | <arnaudb@cumin1002> | START - Cookbook sre.mysql.clone of db2149.codfw.wmnet onto db2205.codfw.wmnet | [production] | 
            
  | 08:35 | <elukey@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2082.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 08:34 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host ms-be2082.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 08:33 | <elukey@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2082.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 08:33 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host ms-be2082.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 08:33 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2083.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 08:32 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host ms-be2083.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 08:25 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2240 (re)pooling @ 100%: post clone', diff saved to https://phabricator.wikimedia.org/P70459 and previous config saved to /var/cache/conftool/dbconfig/20241022-082545-arnaudb.json | [production] | 
            
  | 08:24 | <moritzm> | irc.wikimedia.org has been switched to ircstream T376014 | [production] | 
            
  | 08:10 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2240 (re)pooling @ 75%: post clone', diff saved to https://phabricator.wikimedia.org/P70457 and previous config saved to /var/cache/conftool/dbconfig/20241022-081040-arnaudb.json | [production] | 
            
  | 08:08 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host irc1002.wikimedia.org | [production] | 
            
  | 08:04 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host irc1002.wikimedia.org | [production] | 
            
  | 08:03 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2081.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 08:03 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host ms-be2081.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 08:00 | <elukey@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2081.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 07:59 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db[2149,2205].codfw.wmnet with reason: db2205 reclone | [production] | 
            
  | 07:59 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db[2149,2205].codfw.wmnet with reason: db2205 reclone | [production] | 
            
  | 07:58 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host ms-be2081.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 07:58 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'T377718', diff saved to https://phabricator.wikimedia.org/P70456 and previous config saved to /var/cache/conftool/dbconfig/20241022-075830-arnaudb.json | [production] | 
            
  | 07:55 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db2240 (re)pooling @ 50%: post clone', diff saved to https://phabricator.wikimedia.org/P70455 and previous config saved to /var/cache/conftool/dbconfig/20241022-075534-arnaudb.json | [production] |