351-400 of 10000 results (109ms)
2025-04-15 ยง
10:38 <sukhe> sudo cumin 'A:durum' 'disable-puppet "rolling out CR 1132669"' [production]
10:37 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-upload_codfw [production]
10:34 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1136670|Bump thumbnail steps to 95% (T360589)]] [production]
10:33 <ladsgroup@deploy1003> sync-world aborted: Backport for [[gerrit:1136670|Bump thumbnail steps to 95% (T360589)]] (duration: 14m 11s) [production]
10:27 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P75008 and previous config saved to /var/cache/conftool/dbconfig/20250415-102705-fceratto.json [production]
10:26 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on P{cp[5023-5024].eqsin.wmnet} and A:cp [production]
10:24 <vgutierrez@cumin1002> END (FAIL) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=1) rolling upgrade of Varnish on A:cp-text_eqsin [production]
10:19 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1136670|Bump thumbnail steps to 95% (T360589)]] [production]
10:11 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P75007 and previous config saved to /var/cache/conftool/dbconfig/20250415-101158-fceratto.json [production]
10:00 <dcausse@deploy1003> Finished deploy [wdqs/wdqs@fe88851] (wcqs): version 0.3.156 (duration: 02m 25s) [production]
09:58 <dcausse@deploy1003> Started deploy [wdqs/wdqs@fe88851] (wcqs): version 0.3.156 [production]
09:57 <dcausse@deploy1003> Finished deploy [wdqs/wdqs@fe88851]: version 0.3.156 (T326311) (duration: 14m 31s) [production]
09:56 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1158 (T391056)', diff saved to https://phabricator.wikimedia.org/P75006 and previous config saved to /var/cache/conftool/dbconfig/20250415-095650-fceratto.json [production]
09:54 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1158 (T391056)', diff saved to https://phabricator.wikimedia.org/P75005 and previous config saved to /var/cache/conftool/dbconfig/20250415-095442-fceratto.json [production]
09:54 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1014,1018].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
09:54 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1158.eqiad.wmnet with reason: Maintenance [production]
09:43 <dcausse@deploy1003> Started deploy [wdqs/wdqs@fe88851]: version 0.3.156 (T326311) [production]
09:15 <jnuche@deploy1003> sync-world aborted: testwikis to 1.44.0-wmf.25 refs T386220 (duration: 14m 36s) [production]
09:00 <jnuche@deploy1003> Started scap sync-world: testwikis to 1.44.0-wmf.25 refs T386220 [production]
08:51 <dcausse@deploy1003> Finished deploy [wdqs/wdqs@4186ae7] (wcqs): test deploy new scap config to wcqs2001.codfw.wmnet (T221709) (duration: 00m 20s) [production]
08:51 <dcausse@deploy1003> Started deploy [wdqs/wdqs@4186ae7] (wcqs): test deploy new scap config to wcqs2001.codfw.wmnet (T221709) [production]
08:42 <XioNoX> drain arelion eqsin-codfw link [production]
08:09 <dcausse@deploy1003> Finished deploy [wdqs/wdqs@4186ae7]: test deploy new scap config to wdqs2025.codfw.wmnet (T221709) (duration: 00m 18s) [production]
08:09 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
08:09 <dcausse@deploy1003> Started deploy [wdqs/wdqs@4186ae7]: test deploy new scap config to wdqs2025.codfw.wmnet (T221709) [production]
08:08 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
07:47 <godog> upgrade thanos to 0.38.0 on prometheus100[57] - T383966 [production]
07:28 <Emperor> make sure all disks are mounted correctly prior to disk-swap testing T391854 ms-be1091 [production]
07:28 <Emperor> make sure all disks are mounted correctly prior to disk-swap testing T391854 [production]
07:10 <elukey@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on ms-be1091.eqiad.wmnet with reason: dcops maintenance [production]
07:06 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-upload_codfw [production]
07:06 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-text_codfw [production]
07:06 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-upload_eqsin [production]
07:05 <kartik@deploy1003> helmfile [staging] DONE helmfile.d/services/machinetranslation: apply [production]
07:05 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-text_eqsin [production]
07:04 <vgutierrez> rolling upgrade to varnish 7.1.1-1.1~bpo11+wmf3 in eqsin and codfw - T391334 [production]
06:50 <kartik@deploy1003> helmfile [staging] START helmfile.d/services/machinetranslation: apply [production]
06:48 <kart_> Updated cxserver to 2025-04-07-053106-production (T390732, T390711) [production]
06:48 <kartik@deploy1003> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
06:47 <kartik@deploy1003> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
06:46 <kartik@deploy1003> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
06:45 <kartik@deploy1003> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
06:45 <kartik@deploy1003> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
06:44 <kartik@deploy1003> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
05:03 <marostegui@cumin1002> dbctl commit (dc=all): 'Repool pc6 T391454', diff saved to https://phabricator.wikimedia.org/P75003 and previous config saved to /var/cache/conftool/dbconfig/20250415-050307-marostegui.json [production]
04:57 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on pc2016.codfw.wmnet,pc1016.eqiad.wmnet with reason: Maintenance [production]
04:57 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool pc6 T391454', diff saved to https://phabricator.wikimedia.org/P75002 and previous config saved to /var/cache/conftool/dbconfig/20250415-045700-marostegui.json [production]
04:10 <mwpresync@deploy1003> Pruned MediaWiki: 1.44.0-wmf.22 (duration: 10m 03s) [production]
03:43 <mwpresync@deploy1003> sync-world failed: <CalledProcessError> Command 'sudo -u mwbuilder /srv/mwbuilder/release/make-container-image/build-images.py /srv/mediawiki-staging/scap/image-build --staging-dir /srv/mediawiki-staging --mediawiki-versions 1.44.0-wmf.24,1.44.0-wmf.25 --multiversion-image-name docker-registry.discovery.wmnet/restricted/mediawiki-multiversion --multiversion-debug-image-name docker-registry.discov [production]
03:02 <mwpresync@deploy1003> Started scap sync-world: testwikis to 1.44.0-wmf.25 refs T386220 [production]