1-50 of 10000 results (99ms)
2026-03-10 ยง
12:56 <jiji@cumin1003> START - Cookbook sre.memcached.roll-reboot-restart rolling reboot on A:memcached-gutter-eqiad [production]
12:51 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host ml-serve1014.eqiad.wmnet with OS bookworm [production]
12:50 <jclark@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ml-serve1014 [production]
12:50 <jclark@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host ml-serve1014 [production]
12:49 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:49 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:49 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:49 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:48 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:48 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:48 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:48 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:47 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:45 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:44 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:42 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:42 <jiji@cumin1003> END (PASS) - Cookbook sre.memcached.roll-reboot-restart (exit_code=0) rolling restart_daemons on A:memcached-canary [production]
12:42 <jiji@cumin1003> START - Cookbook sre.memcached.roll-reboot-restart rolling restart_daemons on A:memcached-canary [production]
12:31 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:31 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:24 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
12:10 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
12:10 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
11:59 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
11:34 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2024.codfw.wmnet with OS bullseye [production]
11:34 <ayounsi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - ayounsi@cumin1003" [production]
11:17 <ayounsi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - ayounsi@cumin1003" [production]
11:15 <Emperor> rebalance codfw swift rings T354872 [production]
10:53 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-fe2024.codfw.wmnet with reason: host reimage [production]
10:47 <ayounsi@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-fe2024.codfw.wmnet with reason: host reimage [production]
10:31 <ayounsi@cumin1003> START - Cookbook sre.hosts.reimage for host ms-fe2024.codfw.wmnet with OS bullseye [production]
10:30 <ayounsi@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-fe2024.codfw.wmnet with OS bullseye [production]
10:20 <ayounsi@cumin1003> START - Cookbook sre.hosts.reimage for host ms-fe2024.codfw.wmnet with OS bullseye [production]
10:17 <gkyziridis@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
09:32 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr2-eqdfw [production]
09:31 <ayounsi@cumin1003> START - Cookbook sre.network.tls for network device cr2-eqdfw [production]
09:22 <derick@deploy2002> mwscript-k8s job started: extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=loginwiki --logwiki=metawiki TMPRI1975 FondueFanatic # T419499 [production]
09:00 <arnaudb@dns1005> END - running authdns-update [production]
09:00 <godog> restore all host interfaces - T417393 [production]
08:58 <arnaudb@dns1005> START - running authdns-update [production]
08:30 <godog> disabled interface for cloudcephmon1004 - T417393 [production]
08:22 <godog> disabled interfaces for cloudcephosd1021 cloudcephosd1042 cloudcephosd1043 cloudcephosd1018 cloudcephosd1022 - T417393 [production]
08:18 <godog> disabled interfaces for cloudcephosd1016 cloudcephosd1017 cloudcephosd1016 cloudcephosd1018 cloudcephosd1017 cloudcephosd1035 - T417393 [production]
08:05 <godog> start disabling cloudcephosd interfaces - T417393 [production]
07:49 <godog> prep cloudsw reboot tests 'ceph osd set noout' - T417393 [production]
07:41 <filippo@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 19 hosts with reason: switch down tests [production]
06:14 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs2009.codfw.wmnet with OS bookworm [production]
04:09 <pt1979@cumin2002> END (FAIL) - Cookbook sre.network.tls (exit_code=99) for network device asw1-23-ulsfo [production]
04:08 <pt1979@cumin2002> START - Cookbook sre.network.tls for network device asw1-23-ulsfo [production]
04:01 <mwpresync@deploy2002> Pruned MediaWiki: 1.46.0-wmf.16 (duration: 01m 48s) [production]