101-150 of 10000 results (138ms)
2026-04-01 ยง
15:22 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T421714, prepare newly-reimaged host) xfer wikidata from wcqs1001.eqiad.wmnet -> wcqs1003.eqiad.wmnet, repooling both afterwards [production]
15:20 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T421714, prepare newly-reimaged host) xfer wikidata from wcqs1001.eqiad.wmnet -> wcqs1003.eqiad.wmnet, repooling both afterwards [production]
15:13 <jforrester@deploy1003> Finished scap sync-world: Backport for [[gerrit:1266290|Wikifunctions: Switch cache from mcrouter-wikifunctions to special access (T411807)]] (duration: 12m 53s) [production]
15:12 <lucaswerkmeister-wmde@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply [production]
15:11 <lucaswerkmeister-wmde@deploy1003> helmfile [codfw] START helmfile.d/services/mw-experimental: apply [production]
15:11 <lucaswerkmeister-wmde@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply [production]
15:10 <lucaswerkmeister-wmde@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-experimental: apply [production]
15:09 <jforrester@deploy1003> jforrester: Continuing with sync [production]
15:02 <jforrester@deploy1003> jforrester: Backport for [[gerrit:1266290|Wikifunctions: Switch cache from mcrouter-wikifunctions to special access (T411807)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:01 <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1266290|Wikifunctions: Switch cache from mcrouter-wikifunctions to special access (T411807)]] [production]
15:00 <btullis@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host db1208.eqiad.wmnet [production]
14:59 <taavi@dns1004> END - running authdns-update [production]
14:59 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wcqs1003.eqiad.wmnet with OS bullseye [production]
14:57 <taavi@dns1004> START - running authdns-update [production]
14:56 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker2005.codfw.wmnet [production]
14:55 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
14:55 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
14:53 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance [production]
14:52 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1259 (T419635)', diff saved to https://phabricator.wikimedia.org/P90188 and previous config saved to /var/cache/conftool/dbconfig/20260401-145256-fceratto.json [production]
14:52 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 34968 [production]
14:52 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'email' for AS: 34968 [production]
14:50 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker2005.codfw.wmnet [production]
14:50 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker2004.codfw.wmnet [production]
14:48 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host db1208.eqiad.wmnet [production]
14:47 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker2004.codfw.wmnet [production]
14:45 <fabfur@cumin1003> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_ulsfo - 3.2 upgrade (T421402) [production]
14:44 <fabfur@cumin1003> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_ulsfo - 3.2 upgrade (T421402) [production]
14:44 <fabfur> upgrading ulsfo to haproxy 3.2 (T421402) [production]
14:42 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1259', diff saved to https://phabricator.wikimedia.org/P90187 and previous config saved to /var/cache/conftool/dbconfig/20260401-144247-fceratto.json [production]
14:32 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1259', diff saved to https://phabricator.wikimedia.org/P90186 and previous config saved to /var/cache/conftool/dbconfig/20260401-143239-fceratto.json [production]
14:28 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wcqs1003.eqiad.wmnet with reason: host reimage [production]
14:28 <jforrester@deploy1003> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
14:28 <jforrester@deploy1003> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
14:27 <jforrester@deploy1003> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
14:26 <jforrester@deploy1003> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
14:26 <jforrester@deploy1003> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
14:25 <jforrester@deploy1003> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
14:22 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1259 (T419635)', diff saved to https://phabricator.wikimedia.org/P90184 and previous config saved to /var/cache/conftool/dbconfig/20260401-142231-fceratto.json [production]
14:22 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on wcqs1003.eqiad.wmnet with reason: host reimage [production]
14:21 <jforrester@deploy1003> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
14:21 <jforrester@deploy1003> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
14:20 <jforrester@deploy1003> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
14:19 <jforrester@deploy1003> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
14:19 <jforrester@deploy1003> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
14:18 <jforrester@deploy1003> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
14:16 <jforrester@deploy1003> Finished scap sync-world: Backport for [[gerrit:1266190|MemcachedWrapper: Hash key when longer than 250 characters]], [[gerrit:1266219|Extend queue processing times for abstract fragments (T421581)]] (duration: 08m 14s) [production]
14:13 <brouberol@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts an-worker1148.eqiad.wmnet [production]
14:13 <brouberol@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1148.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin1003" [production]
14:13 <brouberol@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:13 <brouberol@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1148.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin1003" [production]