2701-2750 of 10000 results (121ms)
2024-11-05 ยง
12:35 <fnegri@cumin1002> START - Cookbook sre.wikireplicas.update-views [production]
12:34 <fnegri@cumin1002> END (FAIL) - Cookbook sre.wikireplicas.update-views (exit_code=93) [production]
12:34 <fnegri@cumin1002> START - Cookbook sre.wikireplicas.update-views [production]
12:33 <urbanecm> eswiki,x1: `delete from growthexperiments_link_recommendations where gelr_page=10598298;` (to verify updates are flowing in; T378983) [production]
12:33 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1013.eqiad.wmnet [production]
12:33 <urbanecm> mwmaint2002: kill all instances of refreshLinkRecommendation (T378983) [production]
12:32 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1013.eqiad.wmnet [production]
12:28 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1013.eqiad.wmnet [production]
12:23 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1087407|CirrusSearch: Disable updating weighted tags via EventBus (T378983 T377150)]] (duration: 07m 39s) [production]
12:18 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: testing [production]
12:18 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: testing [production]
12:18 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2230.codfw.wmnet with reason: testing [production]
12:17 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db2230.codfw.wmnet with reason: testing [production]
12:16 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1087407|CirrusSearch: Disable updating weighted tags via EventBus (T378983 T377150)]] [production]
12:10 <jnuche@deploy2002> Finished scap sync-world: testwikis to 1.44.0-wmf.2 refs T375661 (duration: 07m 43s) [production]
12:04 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1040.eqiad.wmnet to cluster eqiad and group B [production]
12:02 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti1040.eqiad.wmnet to cluster eqiad and group B [production]
12:02 <jnuche@deploy2002> Started scap sync-world: testwikis to 1.44.0-wmf.2 refs T375661 [production]
12:01 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1040.eqiad.wmnet [production]
11:57 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1040.eqiad.wmnet [production]
11:53 <jmm@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ganeti1042 [production]
11:53 <jnuche@deploy2002> rebuilt and synchronized wikiversions files: group0 to 1.44.0-wmf.2 refs T375661 [production]
11:53 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance es1029 (T376905)', diff saved to https://phabricator.wikimedia.org/P70922 and previous config saved to /var/cache/conftool/dbconfig/20241105-115301-ladsgroup.json [production]
11:52 <jmm@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ganeti1042 [production]
11:49 <jmm@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ganeti1041 [production]
11:47 <jmm@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ganeti1041 [production]
11:47 <jmm@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ganeti1040 [production]
11:46 <jmm@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ganeti1040 [production]
11:38 <jnuche@deploy2002> Finished scap sync-world: testwikis to 1.44.0-wmf.2 refs T375661 (duration: 36m 28s) [production]
11:37 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance es1029', diff saved to https://phabricator.wikimedia.org/P70921 and previous config saved to /var/cache/conftool/dbconfig/20241105-113754-ladsgroup.json [production]
11:22 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance es1029', diff saved to https://phabricator.wikimedia.org/P70920 and previous config saved to /var/cache/conftool/dbconfig/20241105-112246-ladsgroup.json [production]
11:07 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance es1029 (T376905)', diff saved to https://phabricator.wikimedia.org/P70919 and previous config saved to /var/cache/conftool/dbconfig/20241105-110739-ladsgroup.json [production]
11:02 <jnuche@deploy2002> Started scap sync-world: testwikis to 1.44.0-wmf.2 refs T375661 [production]
11:01 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling es1029 (T376905)', diff saved to https://phabricator.wikimedia.org/P70918 and previous config saved to /var/cache/conftool/dbconfig/20241105-110139-ladsgroup.json [production]
11:01 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1029.eqiad.wmnet with reason: Maintenance [production]
11:01 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1029.eqiad.wmnet with reason: Maintenance [production]
11:01 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance es1032 (T376905)', diff saved to https://phabricator.wikimedia.org/P70917 and previous config saved to /var/cache/conftool/dbconfig/20241105-110115-ladsgroup.json [production]
10:46 <jnuche@deploy2002> Installing scap version "4.121.0" for 209 hosts [production]
10:46 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance es1032', diff saved to https://phabricator.wikimedia.org/P70916 and previous config saved to /var/cache/conftool/dbconfig/20241105-104608-ladsgroup.json [production]
10:44 <jnuche@deploy2002> install-world aborted: (no justification provided) (duration: 03m 09s) [production]
10:41 <jnuche@deploy2002> Installing scap version "4.121.0" for 209 hosts [production]
10:41 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
10:40 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
10:31 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance es1032', diff saved to https://phabricator.wikimedia.org/P70915 and previous config saved to /var/cache/conftool/dbconfig/20241105-103101-ladsgroup.json [production]
10:15 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance es1032 (T376905)', diff saved to https://phabricator.wikimedia.org/P70914 and previous config saved to /var/cache/conftool/dbconfig/20241105-101553-ladsgroup.json [production]
10:11 <elukey> set proxy timeouts of docker registry's nginx instances from 300s to 180s - T378618 [production]
10:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling es1032 (T376905)', diff saved to https://phabricator.wikimedia.org/P70913 and previous config saved to /var/cache/conftool/dbconfig/20241105-100953-ladsgroup.json [production]
10:09 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1032.eqiad.wmnet with reason: Maintenance [production]
10:09 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1032.eqiad.wmnet with reason: Maintenance [production]
10:07 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs1013.eqiad.wmnet with OS bookworm [production]