201-250 of 10000 results (114ms)
2026-03-17 ยง
16:14 <brett@cumin2002> cookbooks.sre.cdn.roll-reboot finished rebooting cp7005.magru.wmnet [production]
16:10 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be2062.codfw.wmnet [production]
16:08 <dzahn@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on zuul2003.codfw.wmnet with reason: host reimage [production]
16:07 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host ms-be1064.eqiad.wmnet [production]
16:05 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp7014.magru.wmnet with OS trixie [production]
16:05 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp7014.magru.wmnet with OS trixie [production]
16:03 <brett@cumin2002> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp7013.magru.wmnet,cp701[5-6].magru.wmnet} and A:cp [production]
16:03 <brett@cumin2002> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp700[5-8].magru.wmnet} and A:cp [production]
15:54 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1009.eqiad.wmnet [production]
15:54 <mutante> zuul2003 - reimaging with trixie [production]
15:52 <samtar@deploy2002> mwscript-k8s job started: foreachwikiindblist group1 cleanupWatchlistLabelMember.php # T420328 [production]
15:51 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2033.codfw.wmnet [production]
15:51 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2033.codfw.wmnet [production]
15:46 <dzahn@cumin2002> START - Cookbook sre.hosts.reimage for host zuul2003.codfw.wmnet with OS trixie [production]
15:45 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1009.eqiad.wmnet [production]
15:45 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1008.eqiad.wmnet [production]
15:44 <samtar@deploy2002> mwscript-k8s job started: foreachwikiindblist group0 cleanupWatchlistLabelMember.php # T420328 [production]
15:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2033.codfw.wmnet [production]
15:38 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2033.codfw.wmnet [production]
15:38 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2048.codfw.wmnet [production]
15:37 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2048.codfw.wmnet [production]
15:36 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1008.eqiad.wmnet [production]
15:36 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1007.eqiad.wmnet [production]
15:34 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2048.codfw.wmnet [production]
15:34 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker1012.eqiad.wmnet with reason: host reimage [production]
15:33 <samtar@deploy2002> mwscript-k8s job started: foreachwikiindblist testwikis cleanupWatchlistLabelMember.php # T420328 [production]
15:32 <btullis@cumin1003> START - Cookbook sre.druid.reboot-workers for Druid analytics cluster: Reboot Druid nodes [production]
15:28 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1007.eqiad.wmnet [production]
15:28 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1006.eqiad.wmnet [production]
15:27 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1012.eqiad.wmnet with reason: host reimage [production]
15:27 <samtar@deploy2002> mwscript-k8s job started: cleanupWatchlistLabelMember.php --wiki=testwiki # T420328 [production]
15:27 <filippo@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudnet2008-dev.codfw.wmnet [production]
15:25 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1015.eqiad.wmnet with OS bookworm [production]
15:23 <jmm@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
15:22 <btullis@cumin1003> START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling restart_daemons on A:kafka-jumbo-eqiad [production]
15:21 <jmm@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
15:20 <filippo@cumin1003> START - Cookbook sre.hosts.reboot-single for host cloudnet2008-dev.codfw.wmnet [production]
15:20 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1006.eqiad.wmnet [production]
15:20 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1005.eqiad.wmnet [production]
15:18 <jmm@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
15:18 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] (duration: 06m 32s) [production]
15:16 <jmm@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
15:16 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 16509 [production]
15:14 <filippo@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudservices2005-dev.codfw.wmnet [production]
15:14 <urbanecm@deploy2002> urbanecm: Continuing with sync [production]
15:13 <urbanecm@deploy2002> urbanecm: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:13 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1012.eqiad.wmnet with OS bookworm [production]
15:12 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2048.codfw.wmnet [production]
15:11 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] [production]
15:10 <brennen@deploy2002> Finished deploy [phabricator/deployment@e845707]: deploy phab1004 for T420366 (duration: 01m 02s) [production]