2025-05-22
ยง
|
12:57 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply |
[production] |
12:56 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-test-k8s: apply |
[production] |
12:56 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-test-k8s: apply |
[production] |
12:53 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1009.eqiad.wmnet |
[production] |
12:53 |
<gkyziridis@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . |
[production] |
12:51 |
<fceratto@cumin1002> |
END (FAIL) - Cookbook sre.mysql.sanitarium_restart (exit_code=99) |
[production] |
12:51 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.sanitarium_restart |
[production] |
12:47 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1009.eqiad.wmnet |
[production] |
12:46 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-test-k8s: apply |
[production] |
12:46 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-test-k8s: apply |
[production] |
12:45 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-test-k8s: apply |
[production] |
12:45 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-test-k8s: apply |
[production] |
12:42 |
<gkyziridis@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . |
[production] |
12:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1036 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P76405 and previous config saved to /var/cache/conftool/dbconfig/20250522-123720-root.json |
[production] |
12:34 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-test-k8s: sync |
[production] |
12:34 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-test-k8s: sync |
[production] |
12:28 |
<hashar> |
Gerrit is back and was upgraded from 3.10.4 to 3.10.6 | T390666 |
[production] |
12:25 |
<hashar> |
Stopping Gerrit for upgrade |
[production] |
12:24 |
<hashar@deploy1003> |
Finished deploy [gerrit/gerrit@facd6ee]: Gerrit to 3.10.6 on gerrit1003 - T390666 (duration: 00m 09s) |
[production] |
12:24 |
<hashar@deploy1003> |
Started deploy [gerrit/gerrit@facd6ee]: Gerrit to 3.10.6 on gerrit1003 - T390666 |
[production] |
12:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1036 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P76404 and previous config saved to /var/cache/conftool/dbconfig/20250522-122215-root.json |
[production] |
12:21 |
<hashar@deploy1003> |
Finished deploy [gerrit/gerrit@facd6ee]: Gerrit to 3.10.6 on gerrit2002 - T390666 (duration: 00m 10s) |
[production] |
12:21 |
<hashar@deploy1003> |
Started deploy [gerrit/gerrit@facd6ee]: Gerrit to 3.10.6 on gerrit2002 - T390666 |
[production] |
12:14 |
<moritzm> |
installing nodejs security updates |
[production] |
12:12 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1008.eqiad.wmnet |
[production] |
12:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1036 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P76400 and previous config saved to /var/cache/conftool/dbconfig/20250522-120709-root.json |
[production] |
12:04 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1008.eqiad.wmnet |
[production] |
11:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1036 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P76398 and previous config saved to /var/cache/conftool/dbconfig/20250522-115203-root.json |
[production] |
11:51 |
<jmm@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/thumbor: apply |
[production] |
11:47 |
<jmm@deploy1003> |
helmfile [eqiad] START helmfile.d/services/thumbor: apply |
[production] |
11:37 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
11:37 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
11:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1036 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P76394 and previous config saved to /var/cache/conftool/dbconfig/20250522-113657-root.json |
[production] |
11:33 |
<ladsgroup@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
11:32 |
<ladsgroup@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
11:29 |
<jmm@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
11:26 |
<jmm@deploy1003> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
11:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1183 T394507', diff saved to https://phabricator.wikimedia.org/P76393 and previous config saved to /var/cache/conftool/dbconfig/20250522-112245-marostegui.json |
[production] |
11:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1036 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P76392 and previous config saved to /var/cache/conftool/dbconfig/20250522-112152-root.json |
[production] |
11:19 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1007.eqiad.wmnet |
[production] |
11:16 |
<jmm@deploy1003> |
helmfile [staging] DONE helmfile.d/services/thumbor: apply |
[production] |
11:16 |
<jmm@deploy1003> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
11:15 |
<volans> |
uploaded debmonitor-client_0.4.1 to apt.wikimedia.org buster-wikimedia,bullseye-wikimedia,bookworm-wikimedia,trixie-wikimedia |
[production] |
11:15 |
<marostegui> |
Migrate es1036 es6 eqiad dbmaint to MariaDB 10.11 T394469 |
[production] |
11:14 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on es1036.eqiad.wmnet with reason: Maintenance |
[production] |
11:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es1036 T394469', diff saved to https://phabricator.wikimedia.org/P76391 and previous config saved to /var/cache/conftool/dbconfig/20250522-111422-marostegui.json |
[production] |
11:11 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1007.eqiad.wmnet |
[production] |
11:06 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: Upgrade GitLab to 17.10 |
[production] |
11:06 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
11:05 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |