251-300 of 10000 results (124ms)
2026-06-17 ยง
11:23 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-wdqs2002.codfw.wmnet with OS bookworm [production]
11:23 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1163.eqiad.wmnet with reason: host reimage [production]
11:22 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1172.eqiad.wmnet with reason: upgrading [production]
11:22 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.dhcp (exit_code=0) for host dse-k8s-wdqs2001.codfw.wmnet [production]
11:21 <marostegui@cumin1003> DONE (ERROR) - Cookbook sre.hosts.downtime (exit_code=97) for 1:00:00 on db1171.eqiad.wmnet with reason: upgrading [production]
11:19 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1190.eqiad.wmnet with reason: upgrading [production]
11:18 <cwilliams@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1163.eqiad.wmnet with reason: host reimage [production]
11:18 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
11:17 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool es2044: repool after maintenance es2044 [production]
11:17 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.major-upgrade (exit_code=99) [production]
11:16 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2044.codfw.wmnet with OS trixie [production]
11:12 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-wdqs1003.eqiad.wmnet with OS bookworm [production]
11:12 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1003" [production]
11:11 <mvolz@deploy1003> helmfile [eqiad] DONE helmfile.d/services/citoid: apply [production]
11:11 <mvolz@deploy1003> helmfile [eqiad] START helmfile.d/services/citoid: apply [production]
11:10 <gkyziridis@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
11:09 <mvolz@deploy1003> helmfile [codfw] DONE helmfile.d/services/citoid: apply [production]
11:09 <mvolz@deploy1003> helmfile [codfw] START helmfile.d/services/citoid: apply [production]
11:08 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1003" [production]
11:08 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.major-upgrade (exit_code=0) [production]
11:08 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool es1038: Migration of es1038.eqiad.wmnet completed [production]
11:04 <cwilliams@cumin1003> START - Cookbook sre.hosts.reimage for host db1163.eqiad.wmnet with OS trixie [production]
11:02 <mvolz@deploy1003> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
11:02 <mvolz@deploy1003> helmfile [staging] START helmfile.d/services/citoid: apply [production]
11:01 <moritzm> The Debian mirror on mirrors.wikimedia.org has been disabled T416707 [production]
11:00 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1163: Upgrading db1163.eqiad.wmnet [production]
10:59 <btullis@cumin1003> START - Cookbook sre.hosts.dhcp for host dse-k8s-wdqs2001.codfw.wmnet [production]
10:59 <cwilliams@cumin1003> START - Cookbook sre.mysql.depool depool db1163: Upgrading db1163.eqiad.wmnet [production]
10:59 <cwilliams@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
10:59 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2044.codfw.wmnet with reason: host reimage [production]
10:53 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on es2044.codfw.wmnet with reason: host reimage [production]
10:50 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-wdqs1003.eqiad.wmnet with reason: host reimage [production]
10:48 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.major-upgrade (exit_code=0) [production]
10:48 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db2203: Migration of db2203.codfw.wmnet completed [production]
10:43 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-wdqs1003.eqiad.wmnet with reason: host reimage [production]
10:38 <btullis@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dse-k8s-wdqs2002.codfw.wmnet with OS bookworm [production]
10:37 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host es2044.codfw.wmnet with OS trixie [production]
10:36 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool es2044: Upgrading es2044.codfw.wmnet [production]
10:35 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool es2044: Upgrading es2044.codfw.wmnet [production]
10:35 <cgoubert@deploy1003> helmfile [eqiad] DONE helmfile.d/services/ratelimit: apply [production]
10:35 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
10:35 <cgoubert@deploy1003> helmfile [eqiad] START helmfile.d/services/ratelimit: apply [production]
10:35 <cgoubert@deploy1003> helmfile [codfw] DONE helmfile.d/services/ratelimit: apply [production]
10:34 <cgoubert@deploy1003> helmfile [codfw] START helmfile.d/services/ratelimit: apply [production]
10:34 <cgoubert@deploy1003> helmfile [staging] DONE helmfile.d/services/ratelimit: apply [production]
10:34 <cgoubert@deploy1003> helmfile [staging] START helmfile.d/services/ratelimit: apply [production]
10:31 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-wdqs1003.eqiad.wmnet with OS bookworm [production]
10:29 <moritzm> installing git-lfs security updates [production]
10:28 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-wdqs2002.codfw.wmnet with OS bookworm [production]
10:28 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-wdqs1002.eqiad.wmnet with OS bookworm [production]