|
2026-03-17
ยง
|
| 16:14 |
<brett@cumin2002> |
cookbooks.sre.cdn.roll-reboot finished rebooting cp7005.magru.wmnet |
[production] |
| 16:10 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ms-be2062.codfw.wmnet |
[production] |
| 16:08 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on zuul2003.codfw.wmnet with reason: host reimage |
[production] |
| 16:07 |
<mvernon@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host ms-be1064.eqiad.wmnet |
[production] |
| 16:05 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp7014.magru.wmnet with OS trixie |
[production] |
| 16:05 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp7014.magru.wmnet with OS trixie |
[production] |
| 16:03 |
<brett@cumin2002> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp7013.magru.wmnet,cp701[5-6].magru.wmnet} and A:cp |
[production] |
| 16:03 |
<brett@cumin2002> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp700[5-8].magru.wmnet} and A:cp |
[production] |
| 15:54 |
<mvernon@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1009.eqiad.wmnet |
[production] |
| 15:54 |
<mutante> |
zuul2003 - reimaging with trixie |
[production] |
| 15:52 |
<samtar@deploy2002> |
mwscript-k8s job started: foreachwikiindblist group1 cleanupWatchlistLabelMember.php # T420328 |
[production] |
| 15:51 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2033.codfw.wmnet |
[production] |
| 15:51 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2033.codfw.wmnet |
[production] |
| 15:46 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.reimage for host zuul2003.codfw.wmnet with OS trixie |
[production] |
| 15:45 |
<mvernon@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host thanos-be1009.eqiad.wmnet |
[production] |
| 15:45 |
<mvernon@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1008.eqiad.wmnet |
[production] |
| 15:44 |
<samtar@deploy2002> |
mwscript-k8s job started: foreachwikiindblist group0 cleanupWatchlistLabelMember.php # T420328 |
[production] |
| 15:43 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2033.codfw.wmnet |
[production] |
| 15:38 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2033.codfw.wmnet |
[production] |
| 15:38 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2048.codfw.wmnet |
[production] |
| 15:37 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2048.codfw.wmnet |
[production] |
| 15:36 |
<mvernon@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host thanos-be1008.eqiad.wmnet |
[production] |
| 15:36 |
<mvernon@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1007.eqiad.wmnet |
[production] |
| 15:34 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2048.codfw.wmnet |
[production] |
| 15:34 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker1012.eqiad.wmnet with reason: host reimage |
[production] |
| 15:33 |
<samtar@deploy2002> |
mwscript-k8s job started: foreachwikiindblist testwikis cleanupWatchlistLabelMember.php # T420328 |
[production] |
| 15:32 |
<btullis@cumin1003> |
START - Cookbook sre.druid.reboot-workers for Druid analytics cluster: Reboot Druid nodes |
[production] |
| 15:28 |
<mvernon@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host thanos-be1007.eqiad.wmnet |
[production] |
| 15:28 |
<mvernon@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1006.eqiad.wmnet |
[production] |
| 15:27 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1012.eqiad.wmnet with reason: host reimage |
[production] |
| 15:27 |
<samtar@deploy2002> |
mwscript-k8s job started: cleanupWatchlistLabelMember.php --wiki=testwiki # T420328 |
[production] |
| 15:27 |
<filippo@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudnet2008-dev.codfw.wmnet |
[production] |
| 15:25 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reimage for host dse-k8s-worker1015.eqiad.wmnet with OS bookworm |
[production] |
| 15:23 |
<jmm@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/thumbor: apply |
[production] |
| 15:22 |
<btullis@cumin1003> |
START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling restart_daemons on A:kafka-jumbo-eqiad |
[production] |
| 15:21 |
<jmm@deploy2002> |
helmfile [eqiad] START helmfile.d/services/thumbor: apply |
[production] |
| 15:20 |
<filippo@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host cloudnet2008-dev.codfw.wmnet |
[production] |
| 15:20 |
<mvernon@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host thanos-be1006.eqiad.wmnet |
[production] |
| 15:20 |
<mvernon@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1005.eqiad.wmnet |
[production] |
| 15:18 |
<jmm@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
| 15:18 |
<urbanecm@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] (duration: 06m 32s) |
[production] |
| 15:16 |
<jmm@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
| 15:16 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 16509 |
[production] |
| 15:14 |
<filippo@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudservices2005-dev.codfw.wmnet |
[production] |
| 15:14 |
<urbanecm@deploy2002> |
urbanecm: Continuing with sync |
[production] |
| 15:13 |
<urbanecm@deploy2002> |
urbanecm: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 15:13 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reimage for host dse-k8s-worker1012.eqiad.wmnet with OS bookworm |
[production] |
| 15:12 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2048.codfw.wmnet |
[production] |
| 15:11 |
<urbanecm@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] |
[production] |
| 15:10 |
<brennen@deploy2002> |
Finished deploy [phabricator/deployment@e845707]: deploy phab1004 for T420366 (duration: 01m 02s) |
[production] |