|
2025-11-20
§
|
| 08:36 |
<kharlan@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1207788|hCaptcha: Log the risk score for null edits differently (T410550)]] |
[production] |
| 08:17 |
<dcausse@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1207744|Revert "cirrus: start A/B test on completion with default_sort" (T404858)]] (duration: 12m 54s) |
[production] |
| 08:13 |
<moritzm> |
installing squid security updates |
[production] |
| 08:12 |
<dcausse@deploy2002> |
dcausse: Continuing with sync |
[production] |
| 08:09 |
<dcausse@deploy2002> |
dcausse: Backport for [[gerrit:1207744|Revert "cirrus: start A/B test on completion with default_sort" (T404858)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 08:04 |
<dcausse@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1207744|Revert "cirrus: start A/B test on completion with default_sort" (T404858)]] |
[production] |
| 07:21 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repool ms2 T410480', diff saved to https://phabricator.wikimedia.org/P85407 and previous config saved to /var/cache/conftool/dbconfig/20251120-072110-marostegui.json |
[production] |
| 06:00 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db1168 (T410589)', diff saved to https://phabricator.wikimedia.org/P85406 and previous config saved to /var/cache/conftool/dbconfig/20251120-060041-ladsgroup.json |
[production] |
| 06:00 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1168.eqiad.wmnet with reason: Maintenance |
[production] |
| 06:00 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1165 (T410589)', diff saved to https://phabricator.wikimedia.org/P85405 and previous config saved to /var/cache/conftool/dbconfig/20251120-060017-ladsgroup.json |
[production] |
| 05:45 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P85404 and previous config saved to /var/cache/conftool/dbconfig/20251120-054509-ladsgroup.json |
[production] |
| 05:30 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P85403 and previous config saved to /var/cache/conftool/dbconfig/20251120-053002-ladsgroup.json |
[production] |
| 05:14 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1165 (T410589)', diff saved to https://phabricator.wikimedia.org/P85402 and previous config saved to /var/cache/conftool/dbconfig/20251120-051454-ladsgroup.json |
[production] |
| 01:45 |
<ladsgroup@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1185* gradually with 4 steps - Work done |
[production] |
| 01:45 |
<wfan> |
payments-wiki upgraded from d72930e6 to 36d362c6 |
[production] |
| 01:40 |
<ladsgroup@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply |
[production] |
| 01:39 |
<ladsgroup@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-experimental: apply |
[production] |
| 01:34 |
<wfan> |
civicrm upgraded from f471a3ec to e4748b9f |
[production] |
| 01:14 |
<mwpresync@deploy2002> |
Finished scap build-images: Publishing wmf/next image (duration: 13m 32s) |
[production] |
| 01:03 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db1165 (T410589)', diff saved to https://phabricator.wikimedia.org/P85397 and previous config saved to /var/cache/conftool/dbconfig/20251120-010322-ladsgroup.json |
[production] |
| 01:03 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1165.eqiad.wmnet with reason: Maintenance |
[production] |
| 01:00 |
<mwpresync@deploy2002> |
Started scap build-images: Publishing wmf/next image |
[production] |
| 00:59 |
<ladsgroup@cumin1003> |
START - Cookbook sre.mysql.pool db1185* gradually with 4 steps - Work done |
[production] |
| 00:59 |
<ladsgroup@cumin1003> |
DONE (ERROR) - Cookbook sre.hosts.downtime (exit_code=97) for 3 days, 0:00:00 on db1159.eqiad.wmnet with reason: Maintenance |
[production] |
| 00:41 |
<brett@dns1006> |
END - running authdns-update |
[production] |
| 00:40 |
<brett@dns1006> |
START - running authdns-update |
[production] |
|
2025-11-19
§
|
| 23:43 |
<wmftkbot> |
Test Kitchen mw-user experiment (poll 1) - adds: we-3-3-4-reading-list-test1, growthexperiments-get-started-notification, we-3-3-4-reading-list-test1-en; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD |
[analytics] |
| 23:43 |
<wmftkbot> |
Test Kitchen edge-unique experiments (poll 1) - adds: fy25-26-we-4-2-hcaptcha-editing, logged-out-retention-round2, image-browsing-enwiki, hcaptcha-on-french-wikipedia, fy2025-26-we3.1-image-browsing-ab-test; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD |
[analytics] |
| 22:49 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.reboot (exit_code=99) |
[production] |
| 22:48 |
<aaron@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1203191|Sandbox cleanup for the Wikimedia REST APIs (T409776 T402426)]] (duration: 13m 43s) |
[production] |
| 22:44 |
<aaron@deploy2002> |
aaron: Continuing with sync |
[production] |
| 22:43 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T410563, transfer main graph to lagged host) xfer wikidata_main from wdqs1015.eqiad.wmnet -> wdqs1011.eqiad.wmnet, repooling both afterwards |
[production] |
| 22:39 |
<aaron@deploy2002> |
aaron: Backport for [[gerrit:1203191|Sandbox cleanup for the Wikimedia REST APIs (T409776 T402426)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 22:34 |
<aaron@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1203191|Sandbox cleanup for the Wikimedia REST APIs (T409776 T402426)]] |
[production] |
| 22:28 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.reboot |
[production] |
| 22:22 |
<ryankemper@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REBOOT (2 nodes at a time) for ElasticSearch cluster search_codfw: codfw cluster reboot (apply updates) - ryankemper@cumin2002 - T390860 |
[production] |
| 22:16 |
<kemayo@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1207201|TextMatchEditCheck: undo duplicate sub-type logging (T407286)]], [[gerrit:1207245|Remove action_context from page_load events in ReadingList A/B test (T410535)]], [[gerrit:1207246|Remove action_context from page_load events in ReadingList A/B test (T410535)]], [[gerrit:1207241|README: remove outdated advice about dblists]] (duration: 10m 57s) |
[production] |
| 22:16 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.REBOOT (1 nodes at a time) for ElasticSearch cluster search_codfw: codfw cluster reboot (apply updates) - ryankemper@cumin2002 - T390860 |
[production] |
| 22:16 |
<ryankemper@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REBOOT (1 nodes at a time) for ElasticSearch cluster search_codfw: codfw cluster reboot (apply updates) - ryankemper@cumin2002 - T390860 |
[production] |
| 22:12 |
<kemayo@deploy2002> |
aude, kemayo, novemlinguae: Continuing with sync |
[production] |
| 22:10 |
<kemayo@deploy2002> |
aude, kemayo, novemlinguae: Backport for [[gerrit:1207201|TextMatchEditCheck: undo duplicate sub-type logging (T407286)]], [[gerrit:1207245|Remove action_context from page_load events in ReadingList A/B test (T410535)]], [[gerrit:1207246|Remove action_context from page_load events in ReadingList A/B test (T410535)]], [[gerrit:1207241|README: remove outdated advice about dblists]] synced to the tests |
[production] |
| 22:09 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-test: apply |
[production] |
| 22:08 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-test: apply |
[production] |
| 22:05 |
<kemayo@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1207201|TextMatchEditCheck: undo duplicate sub-type logging (T407286)]], [[gerrit:1207245|Remove action_context from page_load events in ReadingList A/B test (T410535)]], [[gerrit:1207246|Remove action_context from page_load events in ReadingList A/B test (T410535)]], [[gerrit:1207241|README: remove outdated advice about dblists]] |
[production] |
| 22:03 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-test: apply |
[production] |
| 22:03 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-test: apply |
[production] |
| 22:02 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-ipoid: apply |
[production] |
| 22:02 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-ipoid: apply |
[production] |
| 22:00 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-ipoid: apply |
[production] |
| 22:00 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-ipoid: apply |
[production] |