201-250 of 10000 results (149ms)
2026-03-23 ยง
20:35 <sukhe@cumin1003> START - Cookbook sre.dns.netbox [production]
20:34 <dani@deploy2002> Started scap sync-world: Backport for [[gerrit:1254448|Undeploy participant recruitment survey on ptwiki (T419275)]], [[gerrit:1254450|Undeploy participant recruitment survey on trwiki (T419275)]], [[gerrit:1254452|Undeploy participant recruitment survey on frwiki (T419778)]], [[gerrit:1255763|testKitchen: Add custom stream name (T417050)]], [[gerrit:1259120|Enable wgCampaignEventsEnableEventGoals in [production]
20:31 <sukhe@cumin1003> START - Cookbook sre.hosts.decommission for hosts hcaptcha-proxy4001.wikimedia.org [production]
20:28 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1103.eqiad.wmnet with reason: host reimage [production]
20:24 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1103.eqiad.wmnet with reason: host reimage [production]
20:23 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1115.eqiad.wmnet with OS trixie [production]
20:19 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1102.eqiad.wmnet with reason: host reimage [production]
20:17 <alexsanford@deploy2002> Finished scap sync-world: Backport for [[gerrit:1256472|Reduce reauth timeout for editing site JS to 10 minutes (T419605)]] (duration: 07m 32s) [production]
20:14 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1102.eqiad.wmnet with reason: host reimage [production]
20:13 <alexsanford@deploy2002> alexsanford: Continuing with sync [production]
20:11 <alexsanford@deploy2002> alexsanford: Backport for [[gerrit:1256472|Reduce reauth timeout for editing site JS to 10 minutes (T419605)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:09 <alexsanford@deploy2002> Started scap sync-world: Backport for [[gerrit:1256472|Reduce reauth timeout for editing site JS to 10 minutes (T419605)]] [production]
20:08 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp1103.eqiad.wmnet with OS trixie [production]
20:07 <alexsanford> Deployed mitigation for T419605 [production]
19:58 <jforrester@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
19:58 <jforrester@deploy2002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
19:58 <jforrester@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
19:58 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp1102.eqiad.wmnet with OS trixie [production]
19:57 <jforrester@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
19:54 <jforrester@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
19:54 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp1115.eqiad.wmnet with OS trixie [production]
19:54 <jforrester@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
19:51 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host hcaptcha-proxy4004.wikimedia.org [production]
19:51 <cdobbins@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1102.eqiad.wmnet with OS trixie [production]
19:50 <cdobbins@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1103.eqiad.wmnet with OS trixie [production]
19:50 <ayounsi@cumin1003> START - Cookbook sre.hosts.reboot-single for host hcaptcha-proxy4004.wikimedia.org [production]
19:47 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1115.eqiad.wmnet with OS trixie [production]
19:47 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host hcaptcha-proxy4003.wikimedia.org [production]
19:46 <ayounsi@cumin1003> START - Cookbook sre.hosts.reboot-single for host hcaptcha-proxy4003.wikimedia.org [production]
19:44 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp1115.eqiad.wmnet with OS trixie [production]
19:44 <eevans@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on P{aqs[1011,1014,1016-1022]*} and P{P:Cassandra} [production]
19:42 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp1103.eqiad.wmnet with OS trixie [production]
19:42 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp1103.eqiad.wmnet [reason: trixie reimaging] [production]
19:41 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp1101.eqiad.wmnet [reason: trixie reimaging] [production]
19:41 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp1102.eqiad.wmnet with OS trixie [production]
19:41 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp1102.eqiad.wmnet [reason: trixie reimaging] [production]
19:40 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1101.eqiad.wmnet with OS trixie [production]
19:39 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp1100.eqiad.wmnet [reason: trixie reimaging] [production]
19:37 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1100.eqiad.wmnet with OS trixie [production]
19:30 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1115.eqiad.wmnet with OS trixie [production]
19:18 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1101.eqiad.wmnet with reason: host reimage [production]
19:14 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1100.eqiad.wmnet with reason: host reimage [production]
19:13 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1101.eqiad.wmnet with reason: host reimage [production]
19:13 <akhatun@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply [production]
19:13 <akhatun@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply [production]
19:10 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1100.eqiad.wmnet with reason: host reimage [production]
18:59 <inflatador> bking@deploy2002 restarting opensearch-semantic-search eqiad to renew certs [production]
18:57 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp1101.eqiad.wmnet with OS trixie [production]
18:55 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp1101.eqiad.wmnet with OS trixie [production]
18:54 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp1100.eqiad.wmnet with OS trixie [production]