801-850 of 10000 results (30ms)
2026-05-21 ยง
08:00 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1256.eqiad.wmnet with OS trixie [production]
07:52 <slyngshede@cumin1003> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp600[3-4].drmrs.wmnet} and A:cp [production]
07:51 <marostegui@dns1004> END - running authdns-update [production]
07:50 <marostegui@dns1004> START - running authdns-update [production]
07:48 <marostegui> Failover m3-master T426633 [production]
07:47 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1023.eqiad.wmnet [production]
07:46 <slyngshede@cumin1003> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp6010.drmrs.wmnet} and A:cp [production]
07:46 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp6010.drmrs.wmnet [production]
07:45 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster1005.eqiad.wmnet to plain [production]
07:44 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster1005.eqiad.wmnet to plain [production]
07:43 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1256.eqiad.wmnet with reason: host reimage [production]
07:42 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster1005.eqiad.wmnet to drbd [production]
07:38 <cwilliams@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1256.eqiad.wmnet with reason: host reimage [production]
07:35 <slyngshede@cumin1003> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp6010.drmrs.wmnet} and A:cp [production]
07:35 <slyngshede@cumin1003> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp6002.drmrs.wmnet} and A:cp [production]
07:35 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp6002.drmrs.wmnet [production]
07:27 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster1005.eqiad.wmnet to drbd [production]
07:24 <slyngshede@cumin1003> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp6002.drmrs.wmnet} and A:cp [production]
07:24 <cwilliams@cumin1003> START - Cookbook sre.hosts.reimage for host db1256.eqiad.wmnet with OS trixie [production]
07:22 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1256: Upgrading db1256.eqiad.wmnet [production]
07:21 <cwilliams@cumin1003> START - Cookbook sre.mysql.depool depool db1256: Upgrading db1256.eqiad.wmnet [production]
07:21 <cwilliams@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
07:20 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of dse-k8s-etcd1001.eqiad.wmnet to plain [production]
07:18 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of dse-k8s-etcd1001.eqiad.wmnet to plain [production]
07:17 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbproxy1025.eqiad.wmnet with reason: Rebooting [production]
07:04 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of dse-k8s-etcd1001.eqiad.wmnet to drbd [production]
06:54 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of dse-k8s-etcd1001.eqiad.wmnet to drbd [production]
06:53 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd1003.eqiad.wmnet to plain [production]
06:52 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd1003.eqiad.wmnet to plain [production]
06:49 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd1003.eqiad.wmnet to drbd [production]
06:42 <arnaudb@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lists1004.wikimedia.org [production]
06:40 <arnaudb@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gitlab1004.wikimedia.org [production]
06:39 <arnaudb@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host vrts1003.eqiad.wmnet [production]
06:34 <arnaudb@cumin1003> START - Cookbook sre.hosts.reboot-single for host gitlab1004.wikimedia.org [production]
06:34 <arnaudb@cumin1003> START - Cookbook sre.hosts.reboot-single for host lists1004.wikimedia.org [production]
06:33 <arnaudb@cumin1003> START - Cookbook sre.hosts.reboot-single for host vrts1003.eqiad.wmnet [production]
06:24 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd1003.eqiad.wmnet to drbd [production]
06:23 <arnaudb@cumin1003> END (FAIL) - Cookbook sre.gerrit.reboot-gerrit (exit_code=99) Rebooting Gerrit on gerrit2003 [production]
06:22 <arnaudb@cumin1003> START - Cookbook sre.gerrit.reboot-gerrit Rebooting Gerrit on gerrit2003 [production]
06:15 <marostegui@dns1004> END - running authdns-update [production]
06:14 <marostegui> Failover m2-master T426633 [production]
06:13 <marostegui@dns1004> START - running authdns-update [production]
05:38 <marostegui@cumin1003> dbctl commit (dc=all): 'Remove pc1012 from dbctl T426930', diff saved to https://phabricator.wikimedia.org/P92728 and previous config saved to /var/cache/conftool/dbconfig/20260521-053858-marostegui.json [production]
05:30 <marostegui@cumin1003> dbctl commit (dc=all): 'Repool pc2 T418973', diff saved to https://phabricator.wikimedia.org/P92727 and previous config saved to /var/cache/conftool/dbconfig/20260521-053000-marostegui.json [production]
05:29 <marostegui@cumin1003> dbctl commit (dc=all): 'Add pc1022 to pc2 master T418973', diff saved to https://phabricator.wikimedia.org/P92726 and previous config saved to /var/cache/conftool/dbconfig/20260521-052905-marostegui.json [production]
05:21 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on pc1012.eqiad.wmnet with reason: Cloning [production]
04:13 <eileen> civicrm upgraded from cae2ab0e to dbafc0b4 [fundraising]
02:41 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on planet1003.eqiad.wmnet with reason: debug wip [production]
02:11 <bking@cumin2002> END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.REBOOT (3 nodes at a time) for ElasticSearch cluster search_codfw: T426560 - bking@cumin2002 [production]
02:07 <mwpresync@deploy1003> Finished scap build-images: Publishing wmf/next image (duration: 06m 29s) [production]