|
2026-04-01
ยง
|
| 15:22 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T421714, prepare newly-reimaged host) xfer wikidata from wcqs1001.eqiad.wmnet -> wcqs1003.eqiad.wmnet, repooling both afterwards |
[production] |
| 15:20 |
<bking@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (T421714, prepare newly-reimaged host) xfer wikidata from wcqs1001.eqiad.wmnet -> wcqs1003.eqiad.wmnet, repooling both afterwards |
[production] |
| 15:13 |
<jforrester@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1266290|Wikifunctions: Switch cache from mcrouter-wikifunctions to special access (T411807)]] (duration: 12m 53s) |
[production] |
| 15:12 |
<lucaswerkmeister-wmde@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply |
[production] |
| 15:11 |
<lucaswerkmeister-wmde@deploy1003> |
helmfile [codfw] START helmfile.d/services/mw-experimental: apply |
[production] |
| 15:11 |
<lucaswerkmeister-wmde@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply |
[production] |
| 15:10 |
<lucaswerkmeister-wmde@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-experimental: apply |
[production] |
| 15:09 |
<jforrester@deploy1003> |
jforrester: Continuing with sync |
[production] |
| 15:02 |
<jforrester@deploy1003> |
jforrester: Backport for [[gerrit:1266290|Wikifunctions: Switch cache from mcrouter-wikifunctions to special access (T411807)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 15:01 |
<jforrester@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1266290|Wikifunctions: Switch cache from mcrouter-wikifunctions to special access (T411807)]] |
[production] |
| 15:00 |
<btullis@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host db1208.eqiad.wmnet |
[production] |
| 14:59 |
<taavi@dns1004> |
END - running authdns-update |
[production] |
| 14:59 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wcqs1003.eqiad.wmnet with OS bullseye |
[production] |
| 14:57 |
<taavi@dns1004> |
START - running authdns-update |
[production] |
| 14:56 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker2005.codfw.wmnet |
[production] |
| 14:55 |
<javiermonton@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply |
[production] |
| 14:55 |
<javiermonton@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply |
[production] |
| 14:53 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance |
[production] |
| 14:52 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1259 (T419635)', diff saved to https://phabricator.wikimedia.org/P90188 and previous config saved to /var/cache/conftool/dbconfig/20260401-145256-fceratto.json |
[production] |
| 14:52 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 34968 |
[production] |
| 14:52 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.peering with action 'email' for AS: 34968 |
[production] |
| 14:50 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker2005.codfw.wmnet |
[production] |
| 14:50 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker2004.codfw.wmnet |
[production] |
| 14:48 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host db1208.eqiad.wmnet |
[production] |
| 14:47 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker2004.codfw.wmnet |
[production] |
| 14:45 |
<fabfur@cumin1003> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_ulsfo - 3.2 upgrade (T421402) |
[production] |
| 14:44 |
<fabfur@cumin1003> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_ulsfo - 3.2 upgrade (T421402) |
[production] |
| 14:44 |
<fabfur> |
upgrading ulsfo to haproxy 3.2 (T421402) |
[production] |
| 14:42 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1259', diff saved to https://phabricator.wikimedia.org/P90187 and previous config saved to /var/cache/conftool/dbconfig/20260401-144247-fceratto.json |
[production] |
| 14:32 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1259', diff saved to https://phabricator.wikimedia.org/P90186 and previous config saved to /var/cache/conftool/dbconfig/20260401-143239-fceratto.json |
[production] |
| 14:28 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wcqs1003.eqiad.wmnet with reason: host reimage |
[production] |
| 14:28 |
<jforrester@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply |
[production] |
| 14:28 |
<jforrester@deploy1003> |
helmfile [eqiad] START helmfile.d/services/wikifunctions: apply |
[production] |
| 14:27 |
<jforrester@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply |
[production] |
| 14:26 |
<jforrester@deploy1003> |
helmfile [codfw] START helmfile.d/services/wikifunctions: apply |
[production] |
| 14:26 |
<jforrester@deploy1003> |
helmfile [staging] DONE helmfile.d/services/wikifunctions: apply |
[production] |
| 14:25 |
<jforrester@deploy1003> |
helmfile [staging] START helmfile.d/services/wikifunctions: apply |
[production] |
| 14:22 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1259 (T419635)', diff saved to https://phabricator.wikimedia.org/P90184 and previous config saved to /var/cache/conftool/dbconfig/20260401-142231-fceratto.json |
[production] |
| 14:22 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wcqs1003.eqiad.wmnet with reason: host reimage |
[production] |
| 14:21 |
<jforrester@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply |
[production] |
| 14:21 |
<jforrester@deploy1003> |
helmfile [eqiad] START helmfile.d/services/wikifunctions: apply |
[production] |
| 14:20 |
<jforrester@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply |
[production] |
| 14:19 |
<jforrester@deploy1003> |
helmfile [codfw] START helmfile.d/services/wikifunctions: apply |
[production] |
| 14:19 |
<jforrester@deploy1003> |
helmfile [staging] DONE helmfile.d/services/wikifunctions: apply |
[production] |
| 14:18 |
<jforrester@deploy1003> |
helmfile [staging] START helmfile.d/services/wikifunctions: apply |
[production] |
| 14:16 |
<jforrester@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1266190|MemcachedWrapper: Hash key when longer than 250 characters]], [[gerrit:1266219|Extend queue processing times for abstract fragments (T421581)]] (duration: 08m 14s) |
[production] |
| 14:13 |
<brouberol@cumin1003> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts an-worker1148.eqiad.wmnet |
[production] |
| 14:13 |
<brouberol@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1148.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin1003" |
[production] |
| 14:13 |
<brouberol@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 14:13 |
<brouberol@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1148.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin1003" |
[production] |