2025-01-14
ยง
|
09:47 |
<root@cumin1002> |
END (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for dbprov2006.codfw.wmnet: Renew puppet certificate - root@cumin1002 |
[production] |
09:47 |
<jelto> |
homer 'cr*codfw*' commit 'T377877' |
[production] |
09:45 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on pc[2014-2016].codfw.wmnet,pc1016.eqiad.wmnet with reason: reorganizing pc4 |
[production] |
09:44 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 5:00:00 on pc[2014-2016].codfw.wmnet,pc1016.eqiad.wmnet with reason: reorganizing pc4 |
[production] |
09:44 |
<root@cumin1002> |
START - Cookbook sre.puppet.renew-cert for dbprov2006.codfw.wmnet: Renew puppet certificate - root@cumin1002 |
[production] |
09:43 |
<jelto> |
homer 'lsw1-c3-codfw*' commit 'T377877' |
[production] |
09:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool pc4 T383398', diff saved to https://phabricator.wikimedia.org/P72028 and previous config saved to /var/cache/conftool/dbconfig/20250114-094350-marostegui.json |
[production] |
09:43 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_codfw and A:cp |
[production] |
09:43 |
<hashar@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1111106|knwiki, knwikisource, knwiktionary, knwikiquote: update logo, wordmark (T382802)]], [[gerrit:1111109|hiwikisource: logo fix (T310961)]] (duration: 16m 21s) |
[production] |
09:38 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2215.codfw.wmnet with OS bookworm |
[production] |
09:35 |
<hashar@deploy2002> |
anzx, hashar: Continuing with sync |
[production] |
09:33 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on es[1022,1043].eqiad.wmnet with reason: cloning |
[production] |
09:33 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on es[1022,1043].eqiad.wmnet with reason: cloning |
[production] |
09:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es1022 T382569', diff saved to https://phabricator.wikimedia.org/P72027 and previous config saved to /var/cache/conftool/dbconfig/20250114-093315-marostegui.json |
[production] |
09:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 1%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72026 and previous config saved to /var/cache/conftool/dbconfig/20250114-093216-root.json |
[production] |
09:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Add es1044 to dbctl depooled T382569', diff saved to https://phabricator.wikimedia.org/P72025 and previous config saved to /var/cache/conftool/dbconfig/20250114-093147-marostegui.json |
[production] |
09:31 |
<hashar@deploy2002> |
anzx, hashar: Backport for [[gerrit:1111106|knwiki, knwikisource, knwiktionary, knwikiquote: update logo, wordmark (T382802)]], [[gerrit:1111109|hiwikisource: logo fix (T310961)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
09:26 |
<hashar@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1111106|knwiki, knwikisource, knwiktionary, knwikiquote: update logo, wordmark (T382802)]], [[gerrit:1111109|hiwikisource: logo fix (T310961)]] |
[production] |
09:19 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2215.codfw.wmnet with reason: host reimage |
[production] |
09:15 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2215.codfw.wmnet with reason: host reimage |
[production] |
09:12 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
09:11 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
09:10 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-research: apply |
[production] |
09:10 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-research: apply |
[production] |
09:09 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wmde: apply |
[production] |
09:09 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wmde: apply |
[production] |
09:09 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-ml: apply |
[production] |
09:08 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-ml: apply |
[production] |
09:06 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
09:05 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
08:59 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2214.codfw.wmnet with OS bookworm |
[production] |
08:59 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2215 |
[production] |
08:59 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.move-vlan for host wikikube-worker2215 |
[production] |
08:59 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.reimage for host wikikube-worker2215.codfw.wmnet with OS bookworm |
[production] |
08:58 |
<jelto@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-worker2215.codfw.wmnet with OS bookworm |
[production] |
08:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1023 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72024 and previous config saved to /var/cache/conftool/dbconfig/20250114-085802-root.json |
[production] |
08:55 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2213.codfw.wmnet with OS bookworm |
[production] |
08:55 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dbprov2006.codfw.wmnet with reason: os upgrade |
[production] |
08:55 |
<jynus@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on dbprov2006.codfw.wmnet with reason: os upgrade |
[production] |
08:53 |
<gmodena@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1110777|Revert^2 "config: remove eventbus instrumentation setting"]] (duration: 21m 52s) |
[production] |
08:51 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2212.codfw.wmnet with OS bookworm |
[production] |
08:43 |
<gmodena@deploy2002> |
otto, gmodena: Continuing with sync |
[production] |
08:42 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1023 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72023 and previous config saved to /var/cache/conftool/dbconfig/20250114-084256-root.json |
[production] |
08:40 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2214.codfw.wmnet with reason: host reimage |
[production] |
08:38 |
<gmodena@deploy2002> |
otto, gmodena: Backport for [[gerrit:1110777|Revert^2 "config: remove eventbus instrumentation setting"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
08:36 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2213.codfw.wmnet with reason: host reimage |
[production] |
08:34 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2214.codfw.wmnet with reason: host reimage |
[production] |
08:32 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2212.codfw.wmnet with reason: host reimage |
[production] |
08:31 |
<gmodena@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1110777|Revert^2 "config: remove eventbus instrumentation setting"]] |
[production] |
08:31 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2213.codfw.wmnet with reason: host reimage |
[production] |