|
2026-02-17
ยง
|
| 15:16 |
<sfaci@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/test-kitchen: apply |
[production] |
| 15:14 |
<jayme@cumin1003> |
START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs |
[production] |
| 15:13 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1239329|sqwiki: remove editor usergroup (T415196)]] (duration: 07m 45s) |
[production] |
| 15:11 |
<jayme@cumin1003> |
START - Cookbook sre.loadbalancer.migrate-service-ipip for alias: wikikube-staging-worker-codfw@codfw |
[production] |
| 15:11 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for alias: wikikube-staging-worker-eqiad@eqiad |
[production] |
| 15:11 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs |
[production] |
| 15:09 |
<jayme@cumin1003> |
START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs |
[production] |
| 15:09 |
<ladsgroup@deploy2002> |
ladsgroup, anzx: Continuing with sync |
[production] |
| 15:08 |
<ladsgroup@deploy2002> |
ladsgroup, anzx: Backport for [[gerrit:1239329|sqwiki: remove editor usergroup (T415196)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 15:07 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: pool site magru [reason: XioNoX: maint work done, T416442] |
[production] |
| 15:07 |
<sukhe@cumin1003> |
START - Cookbook sre.dns.admin DNS admin: pool site magru [reason: XioNoX: maint work done, T416442] |
[production] |
| 15:06 |
<sukhe@cumin1003> |
END (FAIL) - Cookbook sre.dns.admin (exit_code=99) DNS admin: pool site magru [reason: Xionix maint work done, T416442] |
[production] |
| 15:06 |
<sukhe@cumin1003> |
START - Cookbook sre.dns.admin DNS admin: pool site magru [reason: Xionix maint work done, T416442] |
[production] |
| 15:06 |
<jayme@cumin1003> |
START - Cookbook sre.loadbalancer.migrate-service-ipip for alias: wikikube-staging-worker-eqiad@eqiad |
[production] |
| 15:05 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1239329|sqwiki: remove editor usergroup (T415196)]] |
[production] |
| 15:02 |
<phuedx@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1239672|Test Kitchen: Set event intake service name]] (duration: 11m 56s) |
[production] |
| 15:01 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for 40 hosts |
[production] |
| 15:01 |
<sukhe@cumin1003> |
START - Cookbook sre.hosts.remove-downtime for 40 hosts |
[production] |
| 14:59 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
| 14:58 |
<sukhe> |
running authdns-update after magru depool |
[production] |
| 14:58 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
| 14:58 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=yes; selector: cluster=dnsbox,dc=magru [reason: magru maintenance done] |
[production] |
| 14:58 |
<vgutierrez> |
upload golang-github-mmatczuk-anyflag-dev 0.0~git20240709.eb9e24c-1 to trixie-wikimedia (apt.wm.o) - T401832 |
[production] |
| 14:57 |
<phuedx@deploy2002> |
phuedx: Continuing with sync |
[production] |
| 14:55 |
<brouberol@deploy2002> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 14:54 |
<brouberol@deploy2002> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 14:52 |
<phuedx@deploy2002> |
phuedx: Backport for [[gerrit:1239672|Test Kitchen: Set event intake service name]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 14:50 |
<phuedx@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1239672|Test Kitchen: Set event intake service name]] |
[production] |
| 14:47 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 14:44 |
<XioNoX> |
mr1-magru> request system reboot - T416442 |
[production] |
| 14:36 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 14:35 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 14:34 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1239935|lift IP cap for event at Tshwane University of Technology (T417578)]] (duration: 06m 45s) |
[production] |
| 14:30 |
<ladsgroup@deploy2002> |
anzx, ladsgroup: Continuing with sync |
[production] |
| 14:29 |
<ladsgroup@deploy2002> |
anzx, ladsgroup: Backport for [[gerrit:1239935|lift IP cap for event at Tshwane University of Technology (T417578)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 14:27 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1239935|lift IP cap for event at Tshwane University of Technology (T417578)]] |
[production] |
| 14:26 |
<brouberol@deploy2002> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 14:26 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 14:23 |
<XioNoX> |
asw1-b4-magru> request system reboot - T416442 |
[production] |
| 14:23 |
<brouberol@deploy2002> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 14:16 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1239855|Add ParserOutputFlags::PREVENT_SELECTIVE_UPDATE (T348236)]] (duration: 08m 13s) |
[production] |
| 14:15 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2172 (T415786)', diff saved to https://phabricator.wikimedia.org/P88846 and previous config saved to /var/cache/conftool/dbconfig/20260217-141510-marostegui.json |
[production] |
| 14:15 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2172.codfw.wmnet with reason: Maintenance |
[production] |
| 14:14 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2155 (T415786)', diff saved to https://phabricator.wikimedia.org/P88845 and previous config saved to /var/cache/conftool/dbconfig/20260217-141457-marostegui.json |
[production] |
| 14:12 |
<ayounsi@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on asw1-b4-magru,asw1-b4-magru IPv6,asw1-b4-magru.mgmt with reason: router upgrade |
[production] |
| 14:12 |
<ladsgroup@deploy2002> |
ladsgroup, cscott: Continuing with sync |
[production] |
| 14:10 |
<ladsgroup@deploy2002> |
ladsgroup, cscott: Backport for [[gerrit:1239855|Add ParserOutputFlags::PREVENT_SELECTIVE_UPDATE (T348236)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 14:08 |
<ayounsi@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on asw1-b3-magru,asw1-b3-magru IPv6,asw1-b3-magru.mgmt with reason: router upgrade |
[production] |
| 14:08 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1239855|Add ParserOutputFlags::PREVENT_SELECTIVE_UPDATE (T348236)]] |
[production] |
| 14:07 |
<vgutierrez> |
upload golang-github-florianl-go-tc_0.4.7 to trixie-wikimedia (apt.wm.o) - T401832 |
[production] |