| 
      
        2025-09-09
      
      ยง
     | 
  
    
  | 12:43 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Depooling db2213 (T402925)', diff saved to https://phabricator.wikimedia.org/P82945 and previous config saved to /var/cache/conftool/dbconfig/20250909-124309-ladsgroup.json | 
  [production] | 
            
  | 12:43 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2213.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 12:41 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P82944 and previous config saved to /var/cache/conftool/dbconfig/20250909-124114-fceratto.json | 
  [production] | 
            
  | 12:39 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Depooling db2240 (T402925)', diff saved to https://phabricator.wikimedia.org/P82943 and previous config saved to /var/cache/conftool/dbconfig/20250909-123859-ladsgroup.json | 
  [production] | 
            
  | 12:38 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2240.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 12:38 | 
  <ladsgroup@cumin1003> | 
  DONE (ERROR) - Cookbook sre.hosts.downtime (exit_code=97) for 12:00:00 on db2240.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 12:33 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P82941 and previous config saved to /var/cache/conftool/dbconfig/20250909-123309-fceratto.json | 
  [production] | 
            
  | 12:26 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P82939 and previous config saved to /var/cache/conftool/dbconfig/20250909-122607-fceratto.json | 
  [production] | 
            
  | 12:21 | 
  <mvernon@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest2010.codfw.wmnet with OS bookworm | 
  [production] | 
            
  | 12:18 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P82938 and previous config saved to /var/cache/conftool/dbconfig/20250909-121802-fceratto.json | 
  [production] | 
            
  | 12:12 | 
  <ladsgroup@cumin1002> | 
  START - Cookbook sre.mysql.pool db1172 gradually with 4 steps - Pool db1172.eqiad.wmnet in after cloning | 
  [production] | 
            
  | 12:11 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2213 (T401906)', diff saved to https://phabricator.wikimedia.org/P82936 and previous config saved to /var/cache/conftool/dbconfig/20250909-121059-fceratto.json | 
  [production] | 
            
  | 12:08 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db2213 (T401906)', diff saved to https://phabricator.wikimedia.org/P82935 and previous config saved to /var/cache/conftool/dbconfig/20250909-120825-fceratto.json | 
  [production] | 
            
  | 12:08 | 
  <fceratto@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2213.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 12:04 | 
  <mvernon@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2010.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 12:02 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2172 (T401906)', diff saved to https://phabricator.wikimedia.org/P82934 and previous config saved to /var/cache/conftool/dbconfig/20250909-120254-fceratto.json | 
  [production] | 
            
  | 12:00 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db2172 (T401906)', diff saved to https://phabricator.wikimedia.org/P82933 and previous config saved to /var/cache/conftool/dbconfig/20250909-120043-fceratto.json | 
  [production] | 
            
  | 12:00 | 
  <fceratto@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2172.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 12:00 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2155 (T401906)', diff saved to https://phabricator.wikimedia.org/P82932 and previous config saved to /var/cache/conftool/dbconfig/20250909-120021-fceratto.json | 
  [production] | 
            
  | 11:58 | 
  <mvernon@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2010.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 11:53 | 
  <btullis@cumin1003> | 
  END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 11:52 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Set dbctl values for db2213 T404067', diff saved to https://phabricator.wikimedia.org/P82929 and previous config saved to /var/cache/conftool/dbconfig/20250909-115245-fceratto.json | 
  [production] | 
            
  | 11:47 | 
  <btullis@cumin1003> | 
  START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 11:47 | 
  <mvernon@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS bookworm | 
  [production] | 
            
  | 11:46 | 
  <mvernon@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2010.codfw.wmnet with OS bookworm | 
  [production] | 
            
  | 11:45 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P82928 and previous config saved to /var/cache/conftool/dbconfig/20250909-114514-fceratto.json | 
  [production] | 
            
  | 11:43 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 11:43 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: remove ganeti02 VIP in esams - jmm@cumin2002" | 
  [production] | 
            
  | 11:43 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: remove ganeti02 VIP in esams - jmm@cumin2002" | 
  [production] | 
            
  | 11:39 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 11:39 | 
  <btullis@cumin1003> | 
  END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-flink-codfw cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 11:37 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Promote db2192 to s5 primary T404067', diff saved to https://phabricator.wikimedia.org/P82927 and previous config saved to /var/cache/conftool/dbconfig/20250909-113740-fceratto.json | 
  [production] | 
            
  | 11:36 | 
  <jmm@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti3008.esams.wmnet with OS bookworm | 
  [production] | 
            
  | 11:35 | 
  <federico3> | 
  Starting s5 codfw failover from db2213 to db2192 - T404067 | 
  [production] | 
            
  | 11:35 | 
  <moritzm> | 
  update bookworm d-i image T403852 | 
  [production] | 
            
  | 11:33 | 
  <vriley@cumin1003> | 
  START - Cookbook sre.hosts.reimage for host es1056.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 11:32 | 
  <btullis@cumin1003> | 
  START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-flink-codfw cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 11:30 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P82926 and previous config saved to /var/cache/conftool/dbconfig/20250909-113006-fceratto.json | 
  [production] | 
            
  | 11:28 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Remove db2192 from API/vslow/dump T404067', diff saved to https://phabricator.wikimedia.org/P82925 and previous config saved to /var/cache/conftool/dbconfig/20250909-112828-fceratto.json | 
  [production] | 
            
  | 11:27 | 
  <fceratto@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 23 hosts with reason: Primary switchover s5 T404067 | 
  [production] | 
            
  | 11:15 | 
  <elukey@cumin1003> | 
  END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host ml-serve1008.eqiad.wmnet | 
  [production] | 
            
  | 11:15 | 
  <elukey@cumin1003> | 
  START - Cookbook sre.k8s.pool-depool-node pool for host ml-serve1008.eqiad.wmnet | 
  [production] | 
            
  | 11:15 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2155 (T401906)', diff saved to https://phabricator.wikimedia.org/P82924 and previous config saved to /var/cache/conftool/dbconfig/20250909-111459-fceratto.json | 
  [production] | 
            
  | 11:14 | 
  <elukey@cumin1003> | 
  END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-lab1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | 
  [production] | 
            
  | 11:14 | 
  <elukey@cumin1003> | 
  START - Cookbook sre.hosts.provision for host ml-lab1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | 
  [production] | 
            
  | 11:12 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db2155 (T401906)', diff saved to https://phabricator.wikimedia.org/P82923 and previous config saved to /var/cache/conftool/dbconfig/20250909-111250-fceratto.json | 
  [production] | 
            
  | 11:12 | 
  <fceratto@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2155.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 11:12 | 
  <fceratto@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2147 (T401906)', diff saved to https://phabricator.wikimedia.org/P82922 and previous config saved to /var/cache/conftool/dbconfig/20250909-111226-fceratto.json | 
  [production] | 
            
  | 11:10 | 
  <elukey@cumin1003> | 
  END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ml-serve1008.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL | 
  [production] | 
            
  | 11:09 | 
  <elukey@cumin1003> | 
  START - Cookbook sre.hosts.provision for host ml-serve1008.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL | 
  [production] |