3651-3700 of 10000 results (116ms)
2025-03-03 ยง
13:01 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
12:58 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
12:58 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
12:56 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
12:55 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
12:55 <cgoubert@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
12:53 <cgoubert@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
12:51 <marostegui@cumin1002> dbctl commit (dc=all): 'db1220 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P73951 and previous config saved to /var/cache/conftool/dbconfig/20250303-125156-root.json [production]
12:36 <marostegui@cumin1002> dbctl commit (dc=all): 'db1220 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P73950 and previous config saved to /var/cache/conftool/dbconfig/20250303-123651-root.json [production]
12:33 <marostegui@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1220.eqiad.wmnet [production]
12:29 <marostegui@cumin1002> START - Cookbook sre.mysql.upgrade for db1220.eqiad.wmnet [production]
12:26 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P73949 and previous config saved to /var/cache/conftool/dbconfig/20250303-122609-root.json [production]
12:24 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1220 T387557', diff saved to https://phabricator.wikimedia.org/P73948 and previous config saved to /var/cache/conftool/dbconfig/20250303-122437-marostegui.json [production]
12:23 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote db1237 to x1 primary T387557', diff saved to https://phabricator.wikimedia.org/P73947 and previous config saved to /var/cache/conftool/dbconfig/20250303-122304-root.json [production]
12:22 <marostegui> Starting x1 eqiad failover from db1220 to db1237 - T387557 [production]
12:17 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs [production]
12:17 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for role: restbase::production@eqiad [production]
12:16 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 13 hosts with reason: Primary switchover x1 T387557 [production]
12:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Set db1237 with weight 0 T387557', diff saved to https://phabricator.wikimedia.org/P73946 and previous config saved to /var/cache/conftool/dbconfig/20250303-121623-root.json [production]
12:16 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs [production]
12:11 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P73945 and previous config saved to /var/cache/conftool/dbconfig/20250303-121104-root.json [production]
12:09 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.migrate-service-ipip for role: restbase::production@eqiad [production]
12:08 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for role: wmcs::openstack::eqiad1::cloudweb@eqiad [production]
12:08 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs [production]
12:07 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs [production]
12:03 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.migrate-service-ipip for role: wmcs::openstack::eqiad1::cloudweb@eqiad [production]
12:02 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
12:01 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
12:01 <jgiannelos@deploy2002> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply [production]
12:00 <jgiannelos@deploy2002> helmfile [codfw] START helmfile.d/services/mobileapps: apply [production]
12:00 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs [production]
12:00 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for role: restbase::production@codfw [production]
12:00 <jgiannelos@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
11:59 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs [production]
11:59 <jayme> Imported helmfile 0.171.0-2 and helm-diff 3.10.0-1 to bullseye-wikimedia and bookworm-wikimedia - T341984 T387376 [production]
11:58 <jgiannelos@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
11:57 <elukey@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
11:56 <elukey@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
11:56 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P73944 and previous config saved to /var/cache/conftool/dbconfig/20250303-115559-root.json [production]
11:52 <kevinbazira@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'article-models' for release 'main' . [production]
11:52 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.migrate-service-ipip for role: restbase::production@codfw [production]
11:49 <jayme@deploy1003> helmfile [staging] DONE helmfile.d/services/ipoid: apply [production]
11:49 <root@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2206.codfw.wmnet with reason: Index rebuild [production]
11:48 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1249.eqiad.wmnet with reason: Index rebuild [production]
11:48 <jayme@deploy1003> helmfile [staging] START helmfile.d/services/ipoid: apply [production]
11:48 <fceratto@cumin1002> START - Cookbook sre.mysql.clone of db2166.codfw.wmnet onto db2167.codfw.wmnet [production]
11:45 <marostegui@cumin1002> dbctl commit (dc=all): 'db1190 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73943 and previous config saved to /var/cache/conftool/dbconfig/20250303-114500-root.json [production]
11:43 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.clone (exit_code=99) of db2166.codfw.wmnet onto db2167.codfw.wmnet [production]
11:42 <fceratto@cumin1002> START - Cookbook sre.mysql.clone of db2166.codfw.wmnet onto db2167.codfw.wmnet [production]
11:42 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for role: mediawiki::jobrunner@eqiad [production]