6001-6050 of 10000 results (43ms)
2025-03-18 ยง
21:43 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ganeti2048.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
21:34 <toyofuku> web deploy window done [production]
21:28 <toyofuku@deploy2002> Finished scap sync-world: Backport for [[gerrit:1128877|Disable donation LINK on Catalan Wikipedia (T387768)]] (duration: 12m 35s) [production]
21:21 <toyofuku@deploy2002> toyofuku, jdlrobson: Continuing with sync [production]
21:21 <toyofuku@deploy2002> toyofuku, jdlrobson: Backport for [[gerrit:1128877|Disable donation LINK on Catalan Wikipedia (T387768)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:16 <toyofuku@deploy2002> Started scap sync-world: Backport for [[gerrit:1128877|Disable donation LINK on Catalan Wikipedia (T387768)]] [production]
21:10 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp4038.ulsfo.wmnet [production]
21:02 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade/restart of Apache Traffic Server on A:drmrs and A:cp for 9.2.9-1wm1 [production]
20:55 <tgr_> late UTC deploys done [production]
20:52 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=cp4038.ulsfo.wmnet [production]
20:52 <brett> Upgrading cp4038 to Varnish 7 (T378737) [production]
20:52 <tgr@deploy2002> Finished scap sync-world: Backport for [[gerrit:1127954|Enable SUL3 logins on group 0 (T384153)]] (duration: 24m 51s) [production]
20:45 <tgr@deploy2002> tgr: Continuing with sync [production]
20:34 <tgr@deploy2002> tgr: Backport for [[gerrit:1127954|Enable SUL3 logins on group 0 (T384153)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:27 <tgr@deploy2002> Started scap sync-world: Backport for [[gerrit:1127954|Enable SUL3 logins on group 0 (T384153)]] [production]
20:25 <tgr@deploy2002> Finished scap sync-world: Backport for [[gerrit:1128922|Edit check: set up the multi-check a/b test (T384372)]], [[gerrit:1127945|Enable VisualEditor EditCheck multi-check a/b test on test2wiki (T384372)]], [[gerrit:1128777|Growth: enable new way of refreshing LinkRecommendations for pilots (T386250)]] (duration: 16m 36s) [production]
20:25 <bd808> Rebooting deployment-jobrunner05 because things just seem weird (T387631, T387276) [releng]
20:18 <tgr@deploy2002> migr, kemayo, tgr: Continuing with sync [production]
20:16 <tgr@deploy2002> migr, kemayo, tgr: Backport for [[gerrit:1128922|Edit check: set up the multi-check a/b test (T384372)]], [[gerrit:1127945|Enable VisualEditor EditCheck multi-check a/b test on test2wiki (T384372)]], [[gerrit:1128777|Growth: enable new way of refreshing LinkRecommendations for pilots (T386250)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:15 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ganeti2048.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
20:14 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ganeti2048.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
20:08 <tgr@deploy2002> Started scap sync-world: Backport for [[gerrit:1128922|Edit check: set up the multi-check a/b test (T384372)]], [[gerrit:1127945|Enable VisualEditor EditCheck multi-check a/b test on test2wiki (T384372)]], [[gerrit:1128777|Growth: enable new way of refreshing LinkRecommendations for pilots (T386250)]] [production]
20:06 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp4037.ulsfo.wmnet [production]
19:58 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=cp4037.ulsfo.wmnet [production]
19:54 <brett> Upgrading remaining ulsfo cache nodes to Varnish 7 (T378737) [production]
19:53 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ganeti2048.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:53 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ganeti2048.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:51 <dr0ptp4kt> Deployed refinery using scap, then deployed onto hdfs (concludes Deploying Refinery at 37a2ddf for c1126977 / T388654 tlwikisource to allowlist) [production]
19:50 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti2047.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:44 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ganeti2047.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:43 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ganeti2047.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:43 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ganeti2047.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:42 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ganeti2047.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:41 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ganeti2047.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:22 <root@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/jaeger: apply [production]
19:21 <root@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/jaeger: apply [production]
19:18 <dr0ptp4kt> Deployed refinery using scap, then deployed onto hdfs (concludes Deploying Refinery at 37a2ddf for c1126977 / T388654 tlwikisource to allowlist) [analytics]
19:08 <dr0ptp4kt@deploy2002> Finished deploy [analytics/refinery@37a2ddf] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@37a2ddfc] (duration: 00m 38s) [production]
19:08 <dr0ptp4kt@deploy2002> Started deploy [analytics/refinery@37a2ddf] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@37a2ddfc] [production]
19:07 <dr0ptp4kt@deploy2002> Finished deploy [analytics/refinery@37a2ddf] (thin): Regular analytics weekly train THIN [analytics/refinery@37a2ddfc] (duration: 00m 50s) [production]
19:07 <dr0ptp4kt@deploy2002> Started deploy [analytics/refinery@37a2ddf] (thin): Regular analytics weekly train THIN [analytics/refinery@37a2ddfc] [production]
19:06 <dr0ptp4kt@deploy2002> Finished deploy [analytics/refinery@37a2ddf]: Regular analytics weekly train [analytics/refinery@37a2ddfc] (duration: 02m 20s) [production]
19:04 <dr0ptp4kt@deploy2002> Started deploy [analytics/refinery@37a2ddf]: Regular analytics weekly train [analytics/refinery@37a2ddfc] [production]
19:03 <dr0ptp4kt> Deploying Refinery at 37a2ddf for c1126977 / T388654 tlwikisource to allowlist [production]
19:03 <dr0ptp4kt> Deploying Refinery at 37a2ddf for c1126977 / T388654 tlwikisource to allowlist [analytics]
18:50 <brett@cumin2002> START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on A:drmrs and A:cp for 9.2.9-1wm1 [production]
18:42 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade/restart of Apache Traffic Server on P{cp70[02-16].magru.wmnet} and A:cp for 9.2.9-1wm1 [production]
18:35 <topranks> re-enabling cr1-drmrs external circuits after upgrade [production]
18:13 <topranks> reboot cr1-drmrs to update JunOS (router is drained of traffic) T364092 [production]
18:13 <cmooney@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 7 hosts with reason: Upgrade cr1-drmrs JunOS [production]