|
2026-03-10
ยง
|
| 12:56 |
<jiji@cumin1003> |
START - Cookbook sre.memcached.roll-reboot-restart rolling reboot on A:memcached-gutter-eqiad |
[production] |
| 12:51 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host ml-serve1014.eqiad.wmnet with OS bookworm |
[production] |
| 12:50 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ml-serve1014 |
[production] |
| 12:50 |
<jclark@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host ml-serve1014 |
[production] |
| 12:49 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:49 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:49 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:49 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:48 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:48 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:48 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:48 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:47 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:45 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:44 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:42 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:42 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.memcached.roll-reboot-restart (exit_code=0) rolling restart_daemons on A:memcached-canary |
[production] |
| 12:42 |
<jiji@cumin1003> |
START - Cookbook sre.memcached.roll-reboot-restart rolling restart_daemons on A:memcached-canary |
[production] |
| 12:31 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:31 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 12:24 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 12:10 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 12:10 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 11:59 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 11:34 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2024.codfw.wmnet with OS bullseye |
[production] |
| 11:34 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - ayounsi@cumin1003" |
[production] |
| 11:17 |
<ayounsi@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - ayounsi@cumin1003" |
[production] |
| 11:15 |
<Emperor> |
rebalance codfw swift rings T354872 |
[production] |
| 10:53 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-fe2024.codfw.wmnet with reason: host reimage |
[production] |
| 10:47 |
<ayounsi@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ms-fe2024.codfw.wmnet with reason: host reimage |
[production] |
| 10:31 |
<ayounsi@cumin1003> |
START - Cookbook sre.hosts.reimage for host ms-fe2024.codfw.wmnet with OS bullseye |
[production] |
| 10:30 |
<ayounsi@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-fe2024.codfw.wmnet with OS bullseye |
[production] |
| 10:20 |
<ayounsi@cumin1003> |
START - Cookbook sre.hosts.reimage for host ms-fe2024.codfw.wmnet with OS bullseye |
[production] |
| 10:17 |
<gkyziridis@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . |
[production] |
| 09:32 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr2-eqdfw |
[production] |
| 09:31 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.tls for network device cr2-eqdfw |
[production] |
| 09:22 |
<derick@deploy2002> |
mwscript-k8s job started: extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=loginwiki --logwiki=metawiki TMPRI1975 FondueFanatic # T419499 |
[production] |
| 09:00 |
<arnaudb@dns1005> |
END - running authdns-update |
[production] |
| 09:00 |
<godog> |
restore all host interfaces - T417393 |
[production] |
| 08:58 |
<arnaudb@dns1005> |
START - running authdns-update |
[production] |
| 08:30 |
<godog> |
disabled interface for cloudcephmon1004 - T417393 |
[production] |
| 08:22 |
<godog> |
disabled interfaces for cloudcephosd1021 cloudcephosd1042 cloudcephosd1043 cloudcephosd1018 cloudcephosd1022 - T417393 |
[production] |
| 08:18 |
<godog> |
disabled interfaces for cloudcephosd1016 cloudcephosd1017 cloudcephosd1016 cloudcephosd1018 cloudcephosd1017 cloudcephosd1035 - T417393 |
[production] |
| 08:05 |
<godog> |
start disabling cloudcephosd interfaces - T417393 |
[production] |
| 07:49 |
<godog> |
prep cloudsw reboot tests 'ceph osd set noout' - T417393 |
[production] |
| 07:41 |
<filippo@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 19 hosts with reason: switch down tests |
[production] |
| 06:14 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs2009.codfw.wmnet with OS bookworm |
[production] |
| 04:09 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.network.tls (exit_code=99) for network device asw1-23-ulsfo |
[production] |
| 04:08 |
<pt1979@cumin2002> |
START - Cookbook sre.network.tls for network device asw1-23-ulsfo |
[production] |
| 04:01 |
<mwpresync@deploy2002> |
Pruned MediaWiki: 1.46.0-wmf.16 (duration: 01m 48s) |
[production] |