1-50 of 10000 results (103ms)
2026-04-21 ยง
13:09 <aude@deploy1003> Started scap sync-world: Backport for [[gerrit:1275562|Opt-in new accounts to the ReadingLists beta feature on enwiki (T420881)]] [production]
13:09 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host tcp-proxy5003.eqsin.wmnet [production]
13:09 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host tcp-proxy5003.eqsin.wmnet with OS trixie [production]
13:08 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
13:06 <jayme@cumin1003> START - Cookbook sre.hosts.decommission for hosts wikikube-worker[1029,1089,1092,1098-1099,1106,1112].eqiad.wmnet [production]
13:03 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host thanos-be1008.eqiad.wmnet with OS bullseye [production]
13:03 <jmm@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts puppetserver1002.eqiad.wmnet [production]
13:03 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetserver1002.eqiad.wmnet [production]
12:56 <jiji@deploy1003> Unlocked for deployment [ALL REPOSITORIES]: Upgrading mw-mcrouter - effie (duration: 33m 37s) [production]
12:54 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host puppetserver1002.eqiad.wmnet [production]
12:53 <moritzm> update firmware on puppetserver1002: NIC from 22.31.6 to 23.21.6 T423282 [production]
12:52 <jmm@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts puppetserver1002.eqiad.wmnet [production]
12:49 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on tcp-proxy5003.eqsin.wmnet with reason: host reimage [production]
12:49 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-mcrouter: apply [production]
12:47 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-mcrouter: apply [production]
12:45 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on tcp-proxy5003.eqsin.wmnet with reason: host reimage [production]
12:45 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-be1008.eqiad.wmnet with reason: host reimage [production]
12:44 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host phab2003.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
12:43 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host phab2003.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
12:41 <jiji@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-mcrouter: apply [production]
12:40 <jmm@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts puppetserver1002.eqiad.wmnet [production]
12:40 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetserver1002.eqiad.wmnet [production]
12:39 <jiji@deploy1003> helmfile [codfw] START helmfile.d/services/mw-mcrouter: apply [production]
12:38 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-be1008.eqiad.wmnet with reason: host reimage [production]
12:29 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host puppetserver1002.eqiad.wmnet [production]
12:29 <moritzm> update firmware on puppetserver1002: BIOS from 1.9.2 to 1.20.2 T423282 [production]
12:28 <moritzm> update firmware on puppetserver1002: idrac from 6.10.30.20 to 7.20.80.50 T423282 [production]
12:23 <jiji@deploy1003> Locking from deployment [ALL REPOSITORIES]: Upgrading mw-mcrouter - effie [production]
12:22 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
12:15 <jmm@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts puppetserver1002.eqiad.wmnet [production]
12:15 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host thanos-be1008.eqiad.wmnet with OS bullseye [production]
12:06 <gkyziridis@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
12:02 <marostegui@cumin1003> dbctl commit (dc=all): 'Pool back pc1 but with pc2021 replacing pc2011', diff saved to https://phabricator.wikimedia.org/P91287 and previous config saved to /var/cache/conftool/dbconfig/20260421-120206-marostegui.json [production]
11:58 <jiji@deploy1003> Unlocked for deployment [ALL REPOSITORIES]: Upgrading mw-mcrouter - effie (duration: 68m 02s) [production]
11:57 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool pc1011: Pool pc2021 into pc [production]
11:57 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.parsercache (exit_code=99) [production]
11:57 <marostegui@cumin1003> START - Cookbook sre.mysql.parsercache [production]
11:57 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool pc1011: Pool pc2021 into pc [production]
11:57 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool pc2021: Pool pc2021 into pc [production]
11:57 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.parsercache (exit_code=99) [production]
11:57 <marostegui@cumin1003> START - Cookbook sre.mysql.parsercache [production]
11:57 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool pc2021: Pool pc2021 into pc [production]
11:56 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool pc2021: Pool pc2021 into pc [production]
11:56 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.parsercache (exit_code=99) [production]
11:56 <marostegui@cumin1003> START - Cookbook sre.mysql.parsercache [production]
11:56 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool pc2021: Pool pc2021 into pc [production]
11:53 <gkyziridis@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
11:52 <marostegui@cumin1003> dbctl commit (dc=all): 'add pc2021 to pc1', diff saved to https://phabricator.wikimedia.org/P91286 and previous config saved to /var/cache/conftool/dbconfig/20260421-115209-marostegui.json [production]
11:50 <moritzm> installing Tornado security updates [production]
11:48 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]