2025-09-10
ยง
|
12:34 |
<ladsgroup@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1236* gradually with 4 steps - Work done |
[production] |
12:31 |
<fnegri@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component maintain-kubeusers (T403964) |
[tools] |
12:31 |
<kartik@deploy1003> |
helmfile [ml-serve-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
12:31 |
<fnegri@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.toolforge.component.deploy (exit_code=97) for component maintain-kubeusers (T403964) |
[tools] |
12:30 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2191.codfw.wmnet |
[production] |
12:30 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db2191 gradually with 4 steps - Upgrade of db2191.codfw.wmnet completed |
[production] |
12:30 |
<fnegri@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component maintain-kubeusers (T403964) |
[tools] |
12:30 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1252 (T402763)', diff saved to https://phabricator.wikimedia.org/P83131 and previous config saved to /var/cache/conftool/dbconfig/20250910-123011-ladsgroup.json |
[production] |
12:26 |
<kartik@deploy1003> |
helmfile [ml-serve-eqiad] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
12:25 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host dse-k8s-worker1014.eqiad.wmnet with OS bookworm |
[production] |
12:21 |
<kartik@deploy1003> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
12:20 |
<stevemunene@deploy1003> |
helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
12:19 |
<stevemunene@deploy1003> |
helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
12:09 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db1252 (T402763)', diff saved to https://phabricator.wikimedia.org/P83128 and previous config saved to /var/cache/conftool/dbconfig/20250910-120940-ladsgroup.json |
[production] |
12:09 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1252.eqiad.wmnet with reason: Maintenance |
[production] |
12:09 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1249 (T402763)', diff saved to https://phabricator.wikimedia.org/P83127 and previous config saved to /var/cache/conftool/dbconfig/20250910-120917-ladsgroup.json |
[production] |
12:00 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2212 (T402763)', diff saved to https://phabricator.wikimedia.org/P83125 and previous config saved to /var/cache/conftool/dbconfig/20250910-120024-fceratto.json |
[production] |
11:56 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1173.eqiad.wmnet |
[production] |
11:56 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1173 gradually with 4 steps - Upgrade of db1173.eqiad.wmnet completed |
[production] |
11:54 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P83122 and previous config saved to /var/cache/conftool/dbconfig/20250910-115409-ladsgroup.json |
[production] |
11:48 |
<ladsgroup@cumin1003> |
START - Cookbook sre.mysql.pool db1236* gradually with 4 steps - Work done |
[production] |
11:45 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db2212 (T402925)', diff saved to https://phabricator.wikimedia.org/P83120 and previous config saved to /var/cache/conftool/dbconfig/20250910-114549-ladsgroup.json |
[production] |
11:45 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2212.codfw.wmnet with reason: Maintenance |
[production] |
11:44 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.pool db2191 gradually with 4 steps - Upgrade of db2191.codfw.wmnet completed |
[production] |
11:43 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1236.eqiad.wmnet with reason: Maintenance |
[production] |
11:39 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P83117 and previous config saved to /var/cache/conftool/dbconfig/20250910-113902-ladsgroup.json |
[production] |
11:33 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db2191 - Upgrading db2191.codfw.wmnet |
[production] |
11:33 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.depool db2191 - Upgrading db2191.codfw.wmnet |
[production] |
11:33 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2191.codfw.wmnet |
[production] |
11:25 |
<moritzm> |
installing Linux 6.1.148 on Bookworm hosts |
[production] |
11:23 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1249 (T402763)', diff saved to https://phabricator.wikimedia.org/P83114 and previous config saved to /var/cache/conftool/dbconfig/20250910-112354-ladsgroup.json |
[production] |
11:15 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2212 (T402763)', diff saved to https://phabricator.wikimedia.org/P83113 and previous config saved to /var/cache/conftool/dbconfig/20250910-111503-fceratto.json |
[production] |
11:13 |
<ladsgroup@cumin1003> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1236.eqiad.wmnet |
[production] |
11:10 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.pool db1173 gradually with 4 steps - Upgrade of db1173.eqiad.wmnet completed |
[production] |
11:09 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2212 (T402763)', diff saved to https://phabricator.wikimedia.org/P83111 and previous config saved to /var/cache/conftool/dbconfig/20250910-110937-fceratto.json |
[production] |
11:09 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2212.codfw.wmnet with reason: Maintenance |
[production] |
11:07 |
<ladsgroup@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db1236 - Upgrading db1236.eqiad.wmnet |
[production] |
11:07 |
<ladsgroup@cumin1003> |
START - Cookbook sre.mysql.depool db1236 - Upgrading db1236.eqiad.wmnet |
[production] |
11:07 |
<moritzm> |
kick off full OSM import for the new maps cluster in codfw T381565 |
[production] |
11:07 |
<ladsgroup@cumin1003> |
START - Cookbook sre.mysql.upgrade for db1236.eqiad.wmnet |
[production] |
11:05 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1236.eqiad.wmnet with reason: Clean up the mess |
[production] |
11:05 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps1014.eqiad.wmnet |
[production] |
11:04 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db1173 - Upgrading db1173.eqiad.wmnet |
[production] |
11:04 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.depool db1173 - Upgrading db1173.eqiad.wmnet |
[production] |
11:04 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.upgrade for db1173.eqiad.wmnet |
[production] |
11:03 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db1249 (T402763)', diff saved to https://phabricator.wikimedia.org/P83110 and previous config saved to /var/cache/conftool/dbconfig/20250910-110343-ladsgroup.json |
[production] |
11:03 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1249.eqiad.wmnet with reason: Maintenance |
[production] |
11:03 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1248 (T402763)', diff saved to https://phabricator.wikimedia.org/P83109 and previous config saved to /var/cache/conftool/dbconfig/20250910-110320-ladsgroup.json |
[production] |
10:58 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host maps1014.eqiad.wmnet |
[production] |
10:57 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1173.eqiad.wmnet |
[production] |