|
2025-12-02
ยง
|
| 16:43 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:43 |
<inflatador> |
bking@wmf3062 restart WDQS codfw to resolve lag/possible deadlocks |
[production] |
| 16:40 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' |
[admin] |
| 16:39 |
<ihurbain@deploy2002> |
ihurbain: Continuing with sync |
[production] |
| 16:39 |
<jhathaway@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:38 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply |
[production] |
| 16:38 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply |
[production] |
| 16:37 |
<ihurbain@deploy2002> |
ihurbain: Backport for [[gerrit:1214069|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]], [[gerrit:1214070|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 16:36 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2193 (T410589)', diff saved to https://phabricator.wikimedia.org/P86319 and previous config saved to /var/cache/conftool/dbconfig/20251202-163612-ladsgroup.json |
[production] |
| 16:36 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1251 gradually with 4 steps - Pool db1251.eqiad.wmnet in after cloning |
[production] |
| 16:35 |
<ihurbain@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1214069|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]], [[gerrit:1214070|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]] |
[production] |
| 16:34 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm e6796dbd-2511-4bf6-bdee-4a14a7414d5f (cluster eqiad1) |
[admin] |
| 16:33 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 5e5c3bad-f1c7-49e5-b846-edaf111af83c (cluster eqiad1) |
[admin] |
| 16:33 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm e6796dbd-2511-4bf6-bdee-4a14a7414d5f (cluster eqiad1) |
[admin] |
| 16:33 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm 5e5c3bad-f1c7-49e5-b846-edaf111af83c (cluster eqiad1) |
[admin] |
| 16:32 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 1d74fc9a-0ddd-41d6-a0fd-5bba5e455c32 (cluster eqiad1) |
[admin] |
| 16:32 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 55ed5d49-43db-4f62-8c40-5cb0431dfce2 (cluster eqiad1) |
[admin] |
| 16:31 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm 1d74fc9a-0ddd-41d6-a0fd-5bba5e455c32 (cluster eqiad1) |
[admin] |
| 16:31 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm 55ed5d49-43db-4f62-8c40-5cb0431dfce2 (cluster eqiad1) |
[admin] |
| 16:30 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:27 |
<brett> |
import varnish 7.1.1-2~bpo13+wmf2 into trixie-wikimedia - T401832 |
[production] |
| 16:24 |
<jhathaway@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:23 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:20 |
<jhathaway@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:19 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:18 |
<swfrench-wmf> |
restarted navtiming on webperf1003 - T352245 |
[production] |
| 16:14 |
<swfrench-wmf> |
begin rolling restarts of eqiad-associated confds - T352245 |
[production] |
| 16:12 |
<moritzm> |
installing nodejs security updates |
[production] |
| 16:12 |
<swfrench@deploy2002> |
Unlocked for deployment [MediaWiki]: Hold deployments during etcd certificate change - T352245 (duration: 03m 45s) |
[production] |
| 16:12 |
<jhathaway@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:10 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:08 |
<swfrench@deploy2002> |
Locking from deployment [MediaWiki]: Hold deployments during etcd certificate change - T352245 |
[production] |
| 16:08 |
<swfrench-wmf> |
migrating etcd to PKI certs on conf1008 - T352245 |
[production] |
| 16:08 |
<jhathaway@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 16:02 |
<moritzm> |
installing libsndfile security updates |
[production] |
| 16:01 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 16:00 |
<gehel> |
restarting wdqs@codfw - system overloaded |
[production] |
| 15:58 |
<jhathaway@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on sretest1005.eqiad.wmnet with reason: ipxe |
[production] |
| 15:50 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool db1251 gradually with 4 steps - Pool db1251.eqiad.wmnet in after cloning |
[production] |
| 15:48 |
<moritzm> |
upgrade Envoy on Yarn T405808 |
[production] |
| 15:48 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 8030caca-e1e8-4f1d-bce1-04afd22adb3a (cluster eqiad1) |
[admin] |
| 15:48 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm 8030caca-e1e8-4f1d-bce1-04afd22adb3a (cluster eqiad1) |
[admin] |
| 15:48 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 9ccf684d-c6ea-45ee-83db-ee3af5de3dfe (cluster eqiad1) |
[admin] |
| 15:47 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm 9ccf684d-c6ea-45ee-83db-ee3af5de3dfe (cluster eqiad1) |
[admin] |
| 15:47 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 02bf16d5-5e10-470b-b05d-341673a284de (cluster eqiad1) |
[admin] |
| 15:47 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm 02bf16d5-5e10-470b-b05d-341673a284de (cluster eqiad1) |
[admin] |
| 15:47 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 106a0f58-3276-4754-93cd-a7ae20fddc75 (cluster eqiad1) |
[admin] |
| 15:46 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm 106a0f58-3276-4754-93cd-a7ae20fddc75 (cluster eqiad1) |
[admin] |
| 15:46 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm d9e4c884-82f1-4c2e-8b35-70bfeb5292cf (cluster eqiad1) |
[admin] |
| 15:46 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.stop_start vm d9e4c884-82f1-4c2e-8b35-70bfeb5292cf (cluster eqiad1) |
[admin] |