| 2025-08-22
      
      ยง | 
    
  | 19:37 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.provision for host an-test-coord1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | [production] | 
            
  | 19:35 | <jhathaway@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-test-coord1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | [production] | 
            
  | 19:35 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.provision for host an-test-coord1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | [production] | 
            
  | 19:29 | <andrew@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1045.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 19:25 | <andrew@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1045.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 19:04 | <Krinkle> | Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/1181162 | [releng] | 
            
  | 18:57 | <andrew@cumin2002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1045.eqiad.wmnet with OS bullseye | [production] | 
            
  | 18:57 | <andrew@cumin2002> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1045.eqiad.wmnet with OS bullseye | [production] | 
            
  | 18:57 | <krinkle@deploy1003> | Finished deploy [integration/docroot@2d9ffad]: (no justification provided) (duration: 00m 11s) | [production] | 
            
  | 18:56 | <krinkle@deploy1003> | Started deploy [integration/docroot@2d9ffad]: (no justification provided) | [production] | 
            
  | 18:51 | <krinkle@deploy1003> | Finished deploy [integration/docroot@5918d5e]: Add support for lcov (duration: 00m 21s) | [production] | 
            
  | 18:51 | <jhathaway@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-test-coord1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | [production] | 
            
  | 18:51 | <krinkle@deploy1003> | Started deploy [integration/docroot@5918d5e]: Add support for lcov | [production] | 
            
  | 18:46 | <andrew@cumin2002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1045.eqiad.wmnet with OS bullseye | [production] | 
            
  | 18:41 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.provision for host an-test-coord1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | [production] | 
            
  | 18:31 | <jhathaway@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-test-coord1002.eqiad.wmnet with reason: supermicro | [production] | 
            
  | 18:28 | <vriley@cumin1003> | END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudcephosd1045 | [production] | 
            
  | 18:28 | <vriley@cumin1003> | START - Cookbook sre.network.configure-switch-interfaces for host cloudcephosd1045 | [production] | 
            
  | 18:22 | <vriley@cumin1003> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1045.eqiad.wmnet with OS bookworm | [production] | 
            
  | 18:22 | <vriley@cumin1003> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" | [production] | 
            
  | 17:53 | <jhathaway@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-test-coord1002.eqiad.wmnet with OS bookworm | [production] | 
            
  | 17:39 | <jhathaway@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-test-coord1002.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 17:35 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on an-test-coord1002.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 17:32 | <dzahn@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on people1005.eqiad.wmnet with reason: T402596 | [production] | 
            
  | 17:25 | <rzl> | sudo -i docker-registryctl delete-tags docker-registry.discovery.wmnet/envoy-future:1.26.8-1  # T402584 | [production] | 
            
  | 17:15 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.reimage for host an-test-coord1002.eqiad.wmnet with OS bookworm | [production] | 
            
  | 17:14 | <jhathaway@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-test-coord1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | [production] | 
            
  | 17:14 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.provision for host an-test-coord1002.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART | [production] | 
            
  | 17:13 | <sfaci@deploy1003> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply | [production] | 
            
  | 17:12 | <sfaci@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply | [production] | 
            
  | 15:51 | <fceratto@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1245.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:51 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1243 (T399249)', diff saved to https://phabricator.wikimedia.org/P81711 and previous config saved to /var/cache/conftool/dbconfig/20250822-155124-fceratto.json | [production] | 
            
  | 15:36 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1243', diff saved to https://phabricator.wikimedia.org/P81710 and previous config saved to /var/cache/conftool/dbconfig/20250822-153617-fceratto.json | [production] | 
            
  | 15:26 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.undrain_node | [admin] | 
            
  | 15:21 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1243', diff saved to https://phabricator.wikimedia.org/P81709 and previous config saved to /var/cache/conftool/dbconfig/20250822-152109-fceratto.json | [production] | 
            
  | 15:06 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1243 (T399249)', diff saved to https://phabricator.wikimedia.org/P81708 and previous config saved to /var/cache/conftool/dbconfig/20250822-150602-fceratto.json | [production] | 
            
  | 14:57 | <vriley@cumin1003> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" | [production] | 
            
  | 14:37 | <vriley@cumin1003> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1045.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:34 | <vriley@cumin1003> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1045.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:31 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.ceph.osd.drain_node (exit_code=0) | [admin] | 
            
  | 14:29 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.drain_node | [admin] | 
            
  | 14:29 | <andrew@cloudcumin1001> | END (ERROR) - Cookbook wmcs.ceph.osd.drain_node (exit_code=97) | [admin] | 
            
  | 14:29 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.drain_node | [admin] | 
            
  | 14:27 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) (T395910) | [admin] | 
            
  | 14:24 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.bootstrap_and_add (T395910) | [admin] | 
            
  | 14:20 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=99) (T395910) | [admin] | 
            
  | 14:15 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.bootstrap_and_add (T395910) | [admin] | 
            
  | 14:05 | <vriley@cumin1003> | START - Cookbook sre.hosts.reimage for host cloudcephosd1045.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:04 | <vriley@cumin1003> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephosd1045.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | [production] | 
            
  | 13:54 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.mysql.sanitarium_restart (exit_code=0) | [production] |