3101-3150 of 10000 results (127ms)
2025-03-18 §
19:08 <dr0ptp4kt@deploy2002> Started deploy [analytics/refinery@37a2ddf] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@37a2ddfc] [production]
19:07 <dr0ptp4kt@deploy2002> Finished deploy [analytics/refinery@37a2ddf] (thin): Regular analytics weekly train THIN [analytics/refinery@37a2ddfc] (duration: 00m 50s) [production]
19:07 <dr0ptp4kt@deploy2002> Started deploy [analytics/refinery@37a2ddf] (thin): Regular analytics weekly train THIN [analytics/refinery@37a2ddfc] [production]
19:06 <dr0ptp4kt@deploy2002> Finished deploy [analytics/refinery@37a2ddf]: Regular analytics weekly train [analytics/refinery@37a2ddfc] (duration: 02m 20s) [production]
19:04 <dr0ptp4kt@deploy2002> Started deploy [analytics/refinery@37a2ddf]: Regular analytics weekly train [analytics/refinery@37a2ddfc] [production]
19:03 <dr0ptp4kt> Deploying Refinery at 37a2ddf for c1126977 / T388654 tlwikisource to allowlist [production]
18:50 <brett@cumin2002> START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on A:drmrs and A:cp for 9.2.9-1wm1 [production]
18:42 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade/restart of Apache Traffic Server on P{cp70[02-16].magru.wmnet} and A:cp for 9.2.9-1wm1 [production]
18:35 <topranks> re-enabling cr1-drmrs external circuits after upgrade [production]
18:13 <topranks> reboot cr1-drmrs to update JunOS (router is drained of traffic) T364092 [production]
18:13 <cmooney@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 7 hosts with reason: Upgrade cr1-drmrs JunOS [production]
17:54 <root@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/jaeger: apply [production]
17:53 <root@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/jaeger: apply [production]
17:40 <root@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
17:40 <root@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
17:31 <root@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
17:27 <root@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
17:26 <topranks> resetting PIC 0 on cr1-drmrs (QSFP ports) to move link from port 1 to port 3 T389071 [production]
17:25 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-research: apply [production]
17:25 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-research: apply [production]
17:22 <root@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/jaeger: apply [production]
17:20 <swfrench@deploy2002> Finished scap sync-world: Switch mw-wikifuncions to PHP 8.1 - T383845 (duration: 11m 46s) [production]
17:15 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti2048.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
17:12 <swfrench@deploy2002> Started scap sync-world: Switch mw-wikifuncions to PHP 8.1 - T383845 [production]
17:12 <root@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/jaeger: apply [production]
17:11 <root@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/jaeger: apply [production]
17:09 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host ganeti2048.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
17:05 <topranks> move traffic off cr1-drms to allow for pic reset / port reconfiguration T389071 [production]
17:04 <root@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/jaeger: apply [production]
17:00 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host puppetserver2004.codfw.wmnet with OS bookworm [production]
16:51 <fabfur@cumin1002> conftool action : set/pooled=yes; selector: name=cp4038.ulsfo.wmnet [production]
16:39 <brett@cumin2002> START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on P{cp70[02-16].magru.wmnet} and A:cp for 9.2.9-1wm1 [production]
16:38 <elukey> restart pybal on lvs1020 and lvs1019 to pick up kartotherian svc changes [production]
16:38 <fabfur> repooling cp4038 (T388147) [production]
16:38 <Ammar> T389226 Ran mwscript-k8s --comment="T389226" -f -- extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=mediawikiwiki --logwiki=metawiki 'Schwarze Feder' 'AndreasKemper' [production]
16:37 <Ammar> T389226 Ran mwscript-k8s --comment="T389226" -f -- extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=bnwikivoyage --logwiki=metawiki 'Arafatuniofdhaka' 'আরাফাত হোসেন ভূঁইয়া' [production]
16:37 <fabfur> enabled puppet on A:cp (T388147) [production]
16:35 <brett@cumin2002> END (FAIL) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=1) Rolling upgrade/restart of Apache Traffic Server on A:magru and A:cp for 9.2.9-1wm1 [production]
16:34 <brett@cumin2002> START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on A:magru and A:cp for 9.2.9-1wm1 [production]
16:33 <elukey> restart pybal on lvs2013 (kartotherian's svc change) [production]
16:33 <root@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/jaeger: apply [production]
16:33 <root@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/jaeger: apply [production]
16:29 <elukey> restart pybal on lvs2014 [production]
16:29 <fabfur@cumin1002> conftool action : set/pooled=no; selector: name=cp4038.ulsfo.wmnet [production]
16:28 <fabfur> enabled puppet and depooled cp4038 [production]
16:27 <elukey> disable puppet on lvs low traffic hosts in eqiad/codfw to restart pybal (kartotherian svc change) [production]
16:25 <fabfur> disabled puppet on A:cp for T388147 [production]
16:24 <root@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/jaeger: apply [production]
16:21 <root@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/jaeger: apply [production]
16:17 <elukey> removed kartotherian-related confd error files from config-master2001 - related to a maintenance issue [production]