|
2025-11-20
ยง
|
| 22:46 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host relforge1009.eqiad.wmnet with OS trixie |
[production] |
| 22:46 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host relforge1010.eqiad.wmnet with OS trixie |
[production] |
| 22:45 |
<ryankemper@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REBOOT (2 nodes at a time) for ElasticSearch cluster search_codfw: codfw cluster reboot (apply updates) - ryankemper@cumin2002 - T390860 |
[production] |
| 22:43 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host relforge1010.eqiad.wmnet with OS trixie |
[production] |
| 22:43 |
<sbassett@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1208021|ActionApi: Remove the xslt option (T401987 T401995)]] |
[production] |
| 22:43 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host relforge1009.eqiad.wmnet with OS trixie |
[production] |
| 22:42 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host relforge1008.eqiad.wmnet with OS trixie |
[production] |
| 22:39 |
<bking@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on relforge[1008-1010].eqiad.wmnet with reason: T410681 |
[production] |
| 22:24 |
<logmsgbot> |
mstyles Deployed security patch for T407157 |
[production] |
| 22:23 |
<James_F> |
Zuul: [mediawiki/extensions/WikimediaEditorTasks] Unmark as production, for T376954 |
[releng] |
| 22:00 |
<reedy@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1208003|Fix wgMediaViewerThumbnailBucketSizes on prod (T372165)]], [[gerrit:1208005|AccountRecovery: Allow temp users to access Special:AccountRecovery]] (duration: 12m 06s) |
[production] |
| 21:54 |
<reedy@deploy2002> |
bvibber, reedy: Continuing with sync |
[production] |
| 21:54 |
<reedy@deploy2002> |
bvibber, reedy: Backport for [[gerrit:1208003|Fix wgMediaViewerThumbnailBucketSizes on prod (T372165)]], [[gerrit:1208005|AccountRecovery: Allow temp users to access Special:AccountRecovery]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:48 |
<reedy@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1208003|Fix wgMediaViewerThumbnailBucketSizes on prod (T372165)]], [[gerrit:1208005|AccountRecovery: Allow temp users to access Special:AccountRecovery]] |
[production] |
| 21:42 |
<bvibber@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1207273|Fix wgMediaViewerThumbnailBucketSizes to match wgThumbnailSteps (T372165)]], [[gerrit:1207865|Undeploy the WikimediaEditorTasks extension (T376954)]] (duration: 39m 55s) |
[production] |
| 21:37 |
<tappof> |
/srv/thanos-store cleanup on titan2001 (start) |
[production] |
| 21:30 |
<bvibber@deploy2002> |
bvibber, jforrester: Continuing with sync |
[production] |
| 21:29 |
<bvibber@deploy2002> |
bvibber, jforrester: Backport for [[gerrit:1207273|Fix wgMediaViewerThumbnailBucketSizes to match wgThumbnailSteps (T372165)]], [[gerrit:1207865|Undeploy the WikimediaEditorTasks extension (T376954)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:03 |
<bvibber@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1207273|Fix wgMediaViewerThumbnailBucketSizes to match wgThumbnailSteps (T372165)]], [[gerrit:1207865|Undeploy the WikimediaEditorTasks extension (T376954)]] |
[production] |
| 20:48 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db1187 (T410589)', diff saved to https://phabricator.wikimedia.org/P85426 and previous config saved to /var/cache/conftool/dbconfig/20251120-204852-ladsgroup.json |
[production] |
| 20:48 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance |
[production] |
| 20:48 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1180 (T410589)', diff saved to https://phabricator.wikimedia.org/P85425 and previous config saved to /var/cache/conftool/dbconfig/20251120-204827-ladsgroup.json |
[production] |
| 20:33 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P85424 and previous config saved to /var/cache/conftool/dbconfig/20251120-203320-ladsgroup.json |
[production] |
| 20:24 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host hcaptcha-proxy6002.wikimedia.org with OS bookworm |
[production] |
| 20:18 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P85423 and previous config saved to /var/cache/conftool/dbconfig/20251120-201812-ladsgroup.json |
[production] |
| 20:15 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host hcaptcha-proxy7001.wikimedia.org with OS bookworm |
[production] |
| 20:12 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host hcaptcha-proxy7002.wikimedia.org with OS bookworm |
[production] |
| 20:11 |
<reedy@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1207963|AccountRecovery: Log more data for account recovery submissions]] (duration: 09m 42s) |
[production] |
| 20:07 |
<reedy@deploy2002> |
reedy: Continuing with sync |
[production] |
| 20:07 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host hcaptcha-proxy6001.wikimedia.org with OS bookworm |
[production] |
| 20:06 |
<reedy@deploy2002> |
reedy: Backport for [[gerrit:1207963|AccountRecovery: Log more data for account recovery submissions]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 20:04 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on hcaptcha-proxy6002.wikimedia.org with reason: host reimage |
[production] |
| 20:03 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1180 (T410589)', diff saved to https://phabricator.wikimedia.org/P85422 and previous config saved to /var/cache/conftool/dbconfig/20251120-200304-ladsgroup.json |
[production] |
| 20:01 |
<reedy@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1207963|AccountRecovery: Log more data for account recovery submissions]] |
[production] |
| 19:57 |
<swfrench@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1204948|De-configure cookie-based enrollment in PHP 8.3 (T405955)]] (duration: 10m 03s) |
[production] |
| 19:57 |
<dduvall> |
deploying buildkitd v0.26.2 upgrade (w/ worker resource limits) to gitlab-cloud-runners cluster (T410049) |
[releng] |
| 19:56 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on hcaptcha-proxy7001.wikimedia.org with reason: host reimage |
[production] |
| 19:55 |
<wfan> |
donorwiki upgraded from 6388fb1f to 40f6f252 |
[production] |
| 19:53 |
<swfrench@deploy2002> |
swfrench: Continuing with sync |
[production] |
| 19:52 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on hcaptcha-proxy7002.wikimedia.org with reason: host reimage |
[production] |
| 19:52 |
<swfrench@deploy2002> |
swfrench: Backport for [[gerrit:1204948|De-configure cookie-based enrollment in PHP 8.3 (T405955)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 19:48 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on hcaptcha-proxy6001.wikimedia.org with reason: host reimage |
[production] |
| 19:47 |
<swfrench@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1204948|De-configure cookie-based enrollment in PHP 8.3 (T405955)]] |
[production] |
| 19:44 |
<sukhe@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on hcaptcha-proxy6002.wikimedia.org with reason: host reimage |
[production] |
| 19:44 |
<sukhe@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on hcaptcha-proxy7001.wikimedia.org with reason: host reimage |
[production] |
| 19:43 |
<sukhe@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on hcaptcha-proxy7002.wikimedia.org with reason: host reimage |
[production] |
| 19:43 |
<sukhe@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on hcaptcha-proxy6001.wikimedia.org with reason: host reimage |
[production] |
| 19:37 |
<sukhe> |
sudo cumin 'A:lvs-eqiad or A:lvs-codfw' 'run-puppet-agent --enable "set druid-coordinator to state lvs_setup"' |
[production] |
| 19:35 |
<sukhe> |
sukhe@lvs1020:~$ sudo systemctl restart pybal.service |
[production] |
| 19:29 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host hcaptcha-proxy5002.wikimedia.org with OS bookworm |
[production] |