2025-04-10
ยง
|
12:01 |
<btullis@cumin1002> |
START - Cookbook sre.network.configure-switch-interfaces for host an-worker1169 |
[production] |
12:00 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-main: apply |
[production] |
11:59 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-main: apply |
[production] |
11:59 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
11:58 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
11:58 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-product: apply |
[production] |
11:57 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-product: apply |
[production] |
11:57 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-platform-eng: apply |
[production] |
11:57 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2054.codfw.wmnet |
[production] |
11:57 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1054.eqiad.wmnet |
[production] |
11:56 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-platform-eng: apply |
[production] |
11:56 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wmde: apply |
[production] |
11:56 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wmde: apply |
[production] |
11:55 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-ml: apply |
[production] |
11:55 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-ml: apply |
[production] |
11:54 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-research: apply |
[production] |
11:54 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-research: apply |
[production] |
11:53 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
11:53 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T391056)', diff saved to https://phabricator.wikimedia.org/P74828 and previous config saved to /var/cache/conftool/dbconfig/20250410-115328-fceratto.json |
[production] |
11:52 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
11:50 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1159 (T391056)', diff saved to https://phabricator.wikimedia.org/P74827 and previous config saved to /var/cache/conftool/dbconfig/20250410-115037-fceratto.json |
[production] |
11:50 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host mc2054.codfw.wmnet |
[production] |
11:50 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1159.eqiad.wmnet with reason: Maintenance |
[production] |
11:50 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host mc1054.eqiad.wmnet |
[production] |
11:32 |
<cgoubert@deploy1003> |
Started scap sync-world: Rebuilding mediawiki images to pick up new base images 1135694 - T387208 |
[production] |
11:28 |
<claime> |
Rebuilding php base images to pick up 1135694 - T387208 |
[production] |
11:26 |
<ladsgroup@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1135688|Bump thumbnail steps to 85% (T360589)]] (duration: 16m 20s) |
[production] |
11:17 |
<ladsgroup@deploy1003> |
ladsgroup: Continuing with sync |
[production] |
11:15 |
<ladsgroup@deploy1003> |
ladsgroup: Backport for [[gerrit:1135688|Bump thumbnail steps to 85% (T360589)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
11:10 |
<ladsgroup@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1135688|Bump thumbnail steps to 85% (T360589)]] |
[production] |
11:08 |
<cgoubert@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1133935|MWScript.php: exit code on mesh, longer timeout (T390972 T387208)]] (duration: 22m 15s) |
[production] |
10:55 |
<cgoubert@deploy1003> |
cgoubert: Continuing with sync |
[production] |
10:54 |
<cgoubert@deploy1003> |
cgoubert: Backport for [[gerrit:1133935|MWScript.php: exit code on mesh, longer timeout (T390972 T387208)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
10:45 |
<cgoubert@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1133935|MWScript.php: exit code on mesh, longer timeout (T390972 T387208)]] |
[production] |
10:28 |
<elukey@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: sync |
[production] |
10:28 |
<elukey@deploy1003> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: sync |
[production] |
10:26 |
<elukey> |
rest-gateway from now on calls citoid on its ingress endpoint |
[production] |
10:23 |
<phedenskog@deploy1003> |
Finished deploy [performance/navtiming@94fa387]: Disable navtiming performance metrics in Graphite (duration: 00m 50s) |
[production] |
10:23 |
<phedenskog@deploy1003> |
Started deploy [performance/navtiming@94fa387]: Disable navtiming performance metrics in Graphite |
[production] |
10:21 |
<elukey@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: sync |
[production] |
10:20 |
<elukey@deploy1003> |
helmfile [codfw] START helmfile.d/services/rest-gateway: sync |
[production] |
10:19 |
<cgoubert@deploy1003> |
Started scap sync-world: Rebuilding mediawiki images to pick up new base images 1135379 - T387208 |
[production] |
10:19 |
<cgoubert@deploy1003> |
sync-world aborted: Rebuilding mediawiki images to pick up new base images 1135379 - T387208 (duration: 35m 23s) |
[production] |
09:55 |
<hnowlan@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rest-gateway: apply |
[production] |
09:55 |
<hnowlan@deploy1003> |
helmfile [staging] START helmfile.d/services/rest-gateway: apply |
[production] |
09:55 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) config_reloading A:liberica |
[production] |
09:50 |
<fabfur@cumin1002> |
START - Cookbook sre.loadbalancer.admin config_reloading A:liberica |
[production] |
09:45 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply |
[production] |
09:45 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply |
[production] |
09:44 |
<cgoubert@deploy1003> |
Started scap sync-world: Rebuilding mediawiki images to pick up new base images 1135379 - T387208 |
[production] |