| 
      
        2024-09-05
      
      ยง
     | 
  
    
  | 17:08 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'db2169 (re)pooling @ 25%: T370852', diff saved to https://phabricator.wikimedia.org/P68717 and previous config saved to /var/cache/conftool/dbconfig/20240905-170801-arnaudb.json | 
  [production] | 
            
  | 17:08 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'db2191 (re)pooling @ 25%: T370852', diff saved to https://phabricator.wikimedia.org/P68716 and previous config saved to /var/cache/conftool/dbconfig/20240905-170800-arnaudb.json | 
  [production] | 
            
  | 17:08 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'db2150 (re)pooling @ 25%: T370852', diff saved to https://phabricator.wikimedia.org/P68715 and previous config saved to /var/cache/conftool/dbconfig/20240905-170800-arnaudb.json | 
  [production] | 
            
  | 17:06 | 
  <fabfur@cumin1002> | 
  conftool action : set/pooled=yes; selector: name=cp2036.ulsfo.wmnet | 
  [production] | 
            
  | 17:06 | 
  <fabfur@cumin1002> | 
  conftool action : set/pooled=yes; selector: name=cp2035.ulsfo.wmnet | 
  [production] | 
            
  | 17:05 | 
  <Emperor> | 
  pool moss-fe2001 ms-fe2011 T373096 | 
  [production] | 
            
  | 17:04 | 
  <Emperor> | 
  moss-be2003 exit maintenance mode T373096 | 
  [production] | 
            
  | 17:03 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp2036.codfw.wmnet | 
  [production] | 
            
  | 17:03 | 
  <fabfur@cumin1002> | 
  START - Cookbook sre.hosts.remove-downtime for cp2036.codfw.wmnet | 
  [production] | 
            
  | 17:03 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp2035.codfw.wmnet | 
  [production] | 
            
  | 17:03 | 
  <fabfur@cumin1002> | 
  START - Cookbook sre.hosts.remove-downtime for cp2035.codfw.wmnet | 
  [production] | 
            
  | 16:59 | 
  <topranks> | 
  move server uplinks codfw rack c3 from asw-c3-codfw to lsw1-c3-codfw T373096 | 
  [production] | 
            
  | 16:58 | 
  <kamila@cumin1002> | 
  END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2091.codfw.wmnet | 
  [production] | 
            
  | 16:58 | 
  <kamila@cumin1002> | 
  START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2091.codfw.wmnet | 
  [production] | 
            
  | 16:58 | 
  <cmooney@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on 34 hosts with reason: Move server uplinks codfw racks C3 | 
  [production] | 
            
  | 16:57 | 
  <cmooney@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 0:30:00 on 34 hosts with reason: Move server uplinks codfw racks C3 | 
  [production] | 
            
  | 16:55 | 
  <kamila@cumin1002> | 
  END (PASS) - Cookbook sre.k8s.renumber-node (exit_code=0) Renumbering for host wikikube-worker2091.codfw.wmnet | 
  [production] | 
            
  | 16:55 | 
  <kamila@cumin1002> | 
  END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2091.codfw.wmnet | 
  [production] | 
            
  | 16:55 | 
  <kamila@cumin1002> | 
  START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2091.codfw.wmnet | 
  [production] | 
            
  | 16:51 | 
  <cmooney@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 0:30:00 on 35 hosts with reason: Move server uplinks codfw racks C3 | 
  [production] | 
            
  | 16:51 | 
  <cmooney@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 0:30:00 on 35 hosts with reason: Move server uplinks codfw racks C3 | 
  [production] | 
            
  | 16:51 | 
  <kamila@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2091.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 16:49 | 
  <dancy@deploy1003> | 
  Installing scap version "4.101.1" for 211 hosts | 
  [production] | 
            
  | 16:49 | 
  <kamila@cumin1002> | 
  END (FAIL) - Cookbook sre.k8s.renumber-node (exit_code=99) Renumbering for host wikikube-worker2092.codfw.wmnet | 
  [production] | 
            
  | 16:49 | 
  <kamila@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2092.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 16:48 | 
  <dcaro@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.ceph.osd.drain_node (exit_code=0) (T373986) | 
  [admin] | 
            
  | 16:43 | 
  <andrew@cloudcumin1001> | 
  START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1032.eqiad.wmnet' (T374043) | 
  [admin] | 
            
  | 16:42 | 
  <topranks> | 
  move server uplinks codfw rack c2 from asw-c2-codfw to lsw1-c2-codfw T373096 | 
  [production] | 
            
  | 16:42 | 
  <andrew@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1031.eqiad.wmnet' (T374043) | 
  [admin] | 
            
  | 16:42 | 
  <jhathaway> | 
  disabling puppet on cp nodes to test puppet change | 
  [production] | 
            
  | 16:39 | 
  <cmooney@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on 23 hosts with reason: Move server uplinks codfw racks C2 | 
  [production] | 
            
  | 16:38 | 
  <cmooney@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 0:30:00 on 23 hosts with reason: Move server uplinks codfw racks C2 | 
  [production] | 
            
  | 16:38 | 
  <dancy@deploy1003> | 
  rebuilt and synchronized wikiversions files: group1 to 1.43.0-wmf.21  refs T373640 | 
  [production] | 
            
  | 16:30 | 
  <Emperor> | 
  depool moss-fe2001 ms-fe2011 T373096 | 
  [production] | 
            
  | 16:30 | 
  <Emperor> | 
  moss-be2003 to maintenance mode T373096 | 
  [production] | 
            
  | 16:27 | 
  <hnowlan@puppetmaster1001> | 
  conftool action : set/pooled=no; selector: name=maps2007.codfw.wmnet | 
  [production] | 
            
  | 16:21 | 
  <arnaudb@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on 6 hosts with reason: network maintenance T370852 | 
  [production] | 
            
  | 16:21 | 
  <arnaudb@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 0:30:00 on 6 hosts with reason: network maintenance T370852 | 
  [production] | 
            
  | 16:21 | 
  <andrew@cloudcumin1001> | 
  START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1031.eqiad.wmnet' (T374043) | 
  [admin] | 
            
  | 16:20 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'depool db2141 db2144 db2150 db2169 db2186 db2191 - T370852', diff saved to https://phabricator.wikimedia.org/P68714 and previous config saved to /var/cache/conftool/dbconfig/20240905-162057-arnaudb.json | 
  [production] | 
            
  | 16:10 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'db2125 (re)pooling @ 100%: post clone repool', diff saved to https://phabricator.wikimedia.org/P68713 and previous config saved to /var/cache/conftool/dbconfig/20240905-161051-arnaudb.json | 
  [production] | 
            
  | 16:02 | 
  <claime> | 
  homer cr*-codfw* commit 'T372878' | 
  [production] | 
            
  | 15:55 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'db2125 (re)pooling @ 75%: post clone repool', diff saved to https://phabricator.wikimedia.org/P68712 and previous config saved to /var/cache/conftool/dbconfig/20240905-155545-arnaudb.json | 
  [production] | 
            
  | 15:54 | 
  <ebernhardson@deploy1003> | 
  helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | 
  [production] | 
            
  | 15:54 | 
  <ebernhardson@deploy1003> | 
  helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | 
  [production] | 
            
  | 15:49 | 
  <dancy> | 
  Updating scap to v4.101.0 in beta cluster | 
  [releng] | 
            
  | 15:48 | 
  <cmooney@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on backup2006.codfw.wmnet with reason: Move backup2006 uplink to lsw1-c2-codfw | 
  [production] | 
            
  | 15:48 | 
  <cmooney@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 0:30:00 on backup2006.codfw.wmnet with reason: Move backup2006 uplink to lsw1-c2-codfw | 
  [production] | 
            
  | 15:43 | 
  <hnowlan@cumin1002> | 
  END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host mw2322.codfw.wmnet | 
  [production] | 
            
  | 15:42 | 
  <hnowlan@cumin1002> | 
  START - Cookbook sre.k8s.pool-depool-node depool for host mw2322.codfw.wmnet | 
  [production] |