251-300 of 10000 results (30ms)
2025-12-02 ยง
16:43 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:43 <inflatador> bking@wmf3062 restart WDQS codfw to resolve lag/possible deadlocks [production]
16:40 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' [admin]
16:39 <ihurbain@deploy2002> ihurbain: Continuing with sync [production]
16:39 <jhathaway@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:38 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply [production]
16:38 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply [production]
16:37 <ihurbain@deploy2002> ihurbain: Backport for [[gerrit:1214069|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]], [[gerrit:1214070|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
16:36 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2193 (T410589)', diff saved to https://phabricator.wikimedia.org/P86319 and previous config saved to /var/cache/conftool/dbconfig/20251202-163612-ladsgroup.json [production]
16:36 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1251 gradually with 4 steps - Pool db1251.eqiad.wmnet in after cloning [production]
16:35 <ihurbain@deploy2002> Started scap sync-world: Backport for [[gerrit:1214069|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]], [[gerrit:1214070|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]] [production]
16:34 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm e6796dbd-2511-4bf6-bdee-4a14a7414d5f (cluster eqiad1) [admin]
16:33 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 5e5c3bad-f1c7-49e5-b846-edaf111af83c (cluster eqiad1) [admin]
16:33 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm e6796dbd-2511-4bf6-bdee-4a14a7414d5f (cluster eqiad1) [admin]
16:33 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 5e5c3bad-f1c7-49e5-b846-edaf111af83c (cluster eqiad1) [admin]
16:32 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 1d74fc9a-0ddd-41d6-a0fd-5bba5e455c32 (cluster eqiad1) [admin]
16:32 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 55ed5d49-43db-4f62-8c40-5cb0431dfce2 (cluster eqiad1) [admin]
16:31 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 1d74fc9a-0ddd-41d6-a0fd-5bba5e455c32 (cluster eqiad1) [admin]
16:31 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 55ed5d49-43db-4f62-8c40-5cb0431dfce2 (cluster eqiad1) [admin]
16:30 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:27 <brett> import varnish 7.1.1-2~bpo13+wmf2 into trixie-wikimedia - T401832 [production]
16:24 <jhathaway@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:23 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:20 <jhathaway@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:19 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:18 <swfrench-wmf> restarted navtiming on webperf1003 - T352245 [production]
16:14 <swfrench-wmf> begin rolling restarts of eqiad-associated confds - T352245 [production]
16:12 <moritzm> installing nodejs security updates [production]
16:12 <swfrench@deploy2002> Unlocked for deployment [MediaWiki]: Hold deployments during etcd certificate change - T352245 (duration: 03m 45s) [production]
16:12 <jhathaway@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:10 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:08 <swfrench@deploy2002> Locking from deployment [MediaWiki]: Hold deployments during etcd certificate change - T352245 [production]
16:08 <swfrench-wmf> migrating etcd to PKI certs on conf1008 - T352245 [production]
16:08 <jhathaway@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
16:02 <moritzm> installing libsndfile security updates [production]
16:01 <jhathaway@cumin1003> START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
16:00 <gehel> restarting wdqs@codfw - system overloaded [production]
15:58 <jhathaway@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on sretest1005.eqiad.wmnet with reason: ipxe [production]
15:50 <marostegui@cumin1003> START - Cookbook sre.mysql.pool db1251 gradually with 4 steps - Pool db1251.eqiad.wmnet in after cloning [production]
15:48 <moritzm> upgrade Envoy on Yarn T405808 [production]
15:48 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 8030caca-e1e8-4f1d-bce1-04afd22adb3a (cluster eqiad1) [admin]
15:48 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 8030caca-e1e8-4f1d-bce1-04afd22adb3a (cluster eqiad1) [admin]
15:48 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 9ccf684d-c6ea-45ee-83db-ee3af5de3dfe (cluster eqiad1) [admin]
15:47 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 9ccf684d-c6ea-45ee-83db-ee3af5de3dfe (cluster eqiad1) [admin]
15:47 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 02bf16d5-5e10-470b-b05d-341673a284de (cluster eqiad1) [admin]
15:47 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 02bf16d5-5e10-470b-b05d-341673a284de (cluster eqiad1) [admin]
15:47 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 106a0f58-3276-4754-93cd-a7ae20fddc75 (cluster eqiad1) [admin]
15:46 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 106a0f58-3276-4754-93cd-a7ae20fddc75 (cluster eqiad1) [admin]
15:46 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm d9e4c884-82f1-4c2e-8b35-70bfeb5292cf (cluster eqiad1) [admin]
15:46 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm d9e4c884-82f1-4c2e-8b35-70bfeb5292cf (cluster eqiad1) [admin]