1301-1350 of 10000 results (123ms)
2025-06-03 ยง
13:11 <dbrant@deploy1003> helmfile [staging] START helmfile.d/services/wikifeeds: apply [production]
13:11 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
13:08 <bwojtowicz@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
13:04 <marostegui@cumin1002> dbctl commit (dc=all): 'es2038 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P76946 and previous config saved to /var/cache/conftool/dbconfig/20250603-130453-root.json [production]
13:04 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P76945 and previous config saved to /var/cache/conftool/dbconfig/20250603-130418-root.json [production]
13:04 <marostegui> Shutdown clouddb1016:x3 T390954 [production]
13:02 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on clouddb1016.eqiad.wmnet with reason: Setting up x3 T390954 [production]
12:58 <moritzm> uploaded wmf-laptop 1.0.2 to apt.wikimedia.org [production]
12:57 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2167', diff saved to https://phabricator.wikimedia.org/P76943 and previous config saved to /var/cache/conftool/dbconfig/20250603-125721-fceratto.json [production]
12:49 <marostegui@cumin1002> dbctl commit (dc=all): 'es2038 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P76942 and previous config saved to /var/cache/conftool/dbconfig/20250603-124948-root.json [production]
12:49 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P76941 and previous config saved to /var/cache/conftool/dbconfig/20250603-124913-root.json [production]
12:42 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2167 (T395241)', diff saved to https://phabricator.wikimedia.org/P76940 and previous config saved to /var/cache/conftool/dbconfig/20250603-124214-fceratto.json [production]
12:40 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2049.codfw.wmnet [production]
12:35 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ganeti2049.codfw.wmnet [production]
12:34 <marostegui@cumin1002> dbctl commit (dc=all): 'es2038 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P76939 and previous config saved to /var/cache/conftool/dbconfig/20250603-123442-root.json [production]
12:34 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P76938 and previous config saved to /var/cache/conftool/dbconfig/20250603-123407-root.json [production]
12:33 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2167 (T395241)', diff saved to https://phabricator.wikimedia.org/P76937 and previous config saved to /var/cache/conftool/dbconfig/20250603-123357-fceratto.json [production]
12:33 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2167.codfw.wmnet with reason: Maintenance [production]
12:33 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2166 (T395241)', diff saved to https://phabricator.wikimedia.org/P76936 and previous config saved to /var/cache/conftool/dbconfig/20250603-123331-fceratto.json [production]
12:30 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1048.eqiad.wmnet with OS bullseye [production]
12:25 <marostegui@cumin1002> START - Cookbook sre.mysql.pool es2048 gradually with 4 steps - Pool es2048.codfw.wmnet in after cloning [production]
12:19 <marostegui@cumin1002> dbctl commit (dc=all): 'es2038 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P76935 and previous config saved to /var/cache/conftool/dbconfig/20250603-121937-root.json [production]
12:19 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P76934 and previous config saved to /var/cache/conftool/dbconfig/20250603-121902-root.json [production]
12:18 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2166', diff saved to https://phabricator.wikimedia.org/P76933 and previous config saved to /var/cache/conftool/dbconfig/20250603-121824-fceratto.json [production]
12:16 <marostegui@deploy1003> Finished scap sync-world: Backport for [[gerrit:1153135|Revert "db-production.php: Disable writes on es7"]] (duration: 09m 47s) [production]
12:15 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
12:12 <bwojtowicz@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
12:10 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'readability' for release 'main' . [production]
12:09 <marostegui@deploy1003> marostegui: Continuing with sync [production]
12:09 <marostegui@deploy1003> marostegui: Backport for [[gerrit:1153135|Revert "db-production.php: Disable writes on es7"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
12:08 <bwojtowicz@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'readability' for release 'main' . [production]
12:07 <claime> Launching manual run of recount-categories cronjob - T395745 [production]
12:06 <marostegui@deploy1003> Started scap sync-world: Backport for [[gerrit:1153135|Revert "db-production.php: Disable writes on es7"]] [production]
12:06 <cgoubert@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply [production]
12:05 <cgoubert@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
12:04 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'logo-detection' for release 'main' . [production]
12:04 <marostegui@cumin1002> dbctl commit (dc=all): 'es2038 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P76931 and previous config saved to /var/cache/conftool/dbconfig/20250603-120431-root.json [production]
12:03 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P76930 and previous config saved to /var/cache/conftool/dbconfig/20250603-120356-root.json [production]
12:03 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2166', diff saved to https://phabricator.wikimedia.org/P76929 and previous config saved to /var/cache/conftool/dbconfig/20250603-120316-fceratto.json [production]
12:02 <bwojtowicz@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'logo-detection' for release 'main' . [production]
12:00 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
11:59 <bwojtowicz@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
11:56 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
11:55 <bwojtowicz@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
11:54 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
11:52 <bwojtowicz@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
11:50 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es2038 T395785', diff saved to https://phabricator.wikimedia.org/P76927 and previous config saved to /var/cache/conftool/dbconfig/20250603-115026-marostegui.json [production]
11:49 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'article-models' for release 'main' . [production]
11:49 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote es2039 to es7 primary and set section read-write T395785', diff saved to https://phabricator.wikimedia.org/P76926 and previous config saved to /var/cache/conftool/dbconfig/20250603-114917-marostegui.json [production]
11:48 <marostegui> Starting es7 codfw failover from es2038 to es2039 - T395785 [production]