451-500 of 10000 results (121ms)
2026-03-19 §
08:43 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of install4003.wikimedia.org to plain [production]
08:42 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of install4003.wikimedia.org to plain [production]
08:40 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti4007.ulsfo.wmnet [production]
08:39 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti4007.ulsfo.wmnet [production]
08:38 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host doh4003.wikimedia.org [production]
08:38 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doh4003.wikimedia.org on all recursors [production]
08:38 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache doh4003.wikimedia.org on all recursors [production]
08:38 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:38 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove records for VM doh4003.wikimedia.org - jmm@cumin2002" [production]
08:38 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove records for VM doh4003.wikimedia.org - jmm@cumin2002" [production]
08:35 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
08:34 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doh4003.wikimedia.org on all recursors [production]
08:34 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache doh4003.wikimedia.org on all recursors [production]
08:34 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:34 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM doh4003.wikimedia.org - jmm@cumin2002" [production]
08:34 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM doh4003.wikimedia.org - jmm@cumin2002" [production]
08:31 <moritzm> installing python-apt security updates [production]
08:29 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
08:29 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host doh4003.wikimedia.org [production]
08:26 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-sre: apply [production]
08:25 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-sre: apply [production]
08:21 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
08:21 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
08:14 <moritzm> installing imagemagick security updates on Bullseye [production]
08:13 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1048.eqiad.wmnet [production]
08:13 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1048.eqiad.wmnet [production]
08:12 <aklapper@deploy2002> rebuilt and synchronized wikiversions files: group2 to 1.46.0-wmf.20 refs T413811 [production]
08:08 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1048.eqiad.wmnet [production]
07:36 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1048.eqiad.wmnet [production]
07:17 <aokoth@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
07:17 <aokoth@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
07:16 <aokoth@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
07:16 <aokoth@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
07:14 <aokoth@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
07:14 <aokoth@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
04:53 <kevinbazira@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
00:06 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon2003.codfw.wmnet [production]
00:02 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host kafkamon2003.codfw.wmnet [production]
00:01 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon1003.eqiad.wmnet [production]
2026-03-18 §
23:58 <mutante> releases2003 - kill 782 (stunnel4) - systemctl start stunnel4 - fix T420246 T420388 T420411 [production]
23:57 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host kafkamon1003.eqiad.wmnet [production]
23:49 <eevans@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev [production]
23:23 <eevans@cumin1003> START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev [production]
23:08 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp5017.* [production]
23:02 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp5020.* [production]
23:01 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp5028.* [production]
22:40 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5028.eqsin.wmnet with OS trixie [production]
22:16 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5020.eqsin.wmnet with OS trixie [production]
22:08 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5028.eqsin.wmnet with reason: host reimage [production]
22:04 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5028.eqsin.wmnet with reason: host reimage [production]