2025-01-14
ยง
|
11:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 50%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72039 and previous config saved to /var/cache/conftool/dbconfig/20250114-111754-root.json |
[production] |
11:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Remove es1020 from dbctl for decommission T383578', diff saved to https://phabricator.wikimedia.org/P72038 and previous config saved to /var/cache/conftool/dbconfig/20250114-111647-marostegui.json |
[production] |
11:14 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2239.codfw.wmnet with reason: reboot |
[production] |
11:14 |
<jynus@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2239.codfw.wmnet with reason: reboot |
[production] |
11:13 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbprov1003.eqiad.wmnet with reason: os upgrade |
[production] |
11:13 |
<jynus@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dbprov1003.eqiad.wmnet with reason: os upgrade |
[production] |
11:05 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_eqsin and A:cp |
[production] |
11:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 25%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72037 and previous config saved to /var/cache/conftool/dbconfig/20250114-110248-root.json |
[production] |
10:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 10%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72036 and previous config saved to /var/cache/conftool/dbconfig/20250114-104743-root.json |
[production] |
10:45 |
<marostegui@dns1006> |
END - running authdns-update |
[production] |
10:43 |
<marostegui@dns1006> |
START - running authdns-update |
[production] |
10:43 |
<marostegui@dns1006> |
END - running authdns-update |
[production] |
10:41 |
<marostegui@dns1006> |
START - running authdns-update |
[production] |
10:37 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_eqsin and A:cp |
[production] |
10:35 |
<marostegui> |
Reboot db2235 m5 codfw master |
[production] |
10:34 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2235.codfw.wmnet with reason: upgrade |
[production] |
10:34 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 5:00:00 on db2235.codfw.wmnet with reason: upgrade |
[production] |
10:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 5%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72035 and previous config saved to /var/cache/conftool/dbconfig/20250114-103238-root.json |
[production] |
10:27 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_codfw and A:cp |
[production] |
10:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 4%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72034 and previous config saved to /var/cache/conftool/dbconfig/20250114-101732-root.json |
[production] |
10:05 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_codfw and A:cp |
[production] |
10:03 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_codfw and A:cp |
[production] |
10:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 3%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72033 and previous config saved to /var/cache/conftool/dbconfig/20250114-100227-root.json |
[production] |
10:00 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
09:59 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
09:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repool pc4 T383398', diff saved to https://phabricator.wikimedia.org/P72032 and previous config saved to /var/cache/conftool/dbconfig/20250114-095404-marostegui.json |
[production] |
09:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote pc2014 to codfw pc4 master dbmaint T383398', diff saved to https://phabricator.wikimedia.org/P72031 and previous config saved to /var/cache/conftool/dbconfig/20250114-095320-marostegui.json |
[production] |
09:50 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[2212-2215].codfw.wmnet |
[production] |
09:50 |
<jelto@cumin1002> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[2212-2215].codfw.wmnet |
[production] |
09:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 2%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72030 and previous config saved to /var/cache/conftool/dbconfig/20250114-094722-root.json |
[production] |
09:47 |
<root@cumin1002> |
END (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for dbprov2006.codfw.wmnet: Renew puppet certificate - root@cumin1002 |
[production] |
09:47 |
<jelto> |
homer 'cr*codfw*' commit 'T377877' |
[production] |
09:45 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on pc[2014-2016].codfw.wmnet,pc1016.eqiad.wmnet with reason: reorganizing pc4 |
[production] |
09:44 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 5:00:00 on pc[2014-2016].codfw.wmnet,pc1016.eqiad.wmnet with reason: reorganizing pc4 |
[production] |
09:44 |
<root@cumin1002> |
START - Cookbook sre.puppet.renew-cert for dbprov2006.codfw.wmnet: Renew puppet certificate - root@cumin1002 |
[production] |
09:43 |
<jelto> |
homer 'lsw1-c3-codfw*' commit 'T377877' |
[production] |
09:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool pc4 T383398', diff saved to https://phabricator.wikimedia.org/P72028 and previous config saved to /var/cache/conftool/dbconfig/20250114-094350-marostegui.json |
[production] |
09:43 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_codfw and A:cp |
[production] |
09:43 |
<hashar@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1111106|knwiki, knwikisource, knwiktionary, knwikiquote: update logo, wordmark (T382802)]], [[gerrit:1111109|hiwikisource: logo fix (T310961)]] (duration: 16m 21s) |
[production] |
09:38 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2215.codfw.wmnet with OS bookworm |
[production] |
09:35 |
<hashar@deploy2002> |
anzx, hashar: Continuing with sync |
[production] |
09:33 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on es[1022,1043].eqiad.wmnet with reason: cloning |
[production] |
09:33 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on es[1022,1043].eqiad.wmnet with reason: cloning |
[production] |
09:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es1022 T382569', diff saved to https://phabricator.wikimedia.org/P72027 and previous config saved to /var/cache/conftool/dbconfig/20250114-093315-marostegui.json |
[production] |
09:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1044 (re)pooling @ 1%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72026 and previous config saved to /var/cache/conftool/dbconfig/20250114-093216-root.json |
[production] |
09:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Add es1044 to dbctl depooled T382569', diff saved to https://phabricator.wikimedia.org/P72025 and previous config saved to /var/cache/conftool/dbconfig/20250114-093147-marostegui.json |
[production] |
09:31 |
<hashar@deploy2002> |
anzx, hashar: Backport for [[gerrit:1111106|knwiki, knwikisource, knwiktionary, knwikiquote: update logo, wordmark (T382802)]], [[gerrit:1111109|hiwikisource: logo fix (T310961)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
09:26 |
<hashar@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1111106|knwiki, knwikisource, knwiktionary, knwikiquote: update logo, wordmark (T382802)]], [[gerrit:1111109|hiwikisource: logo fix (T310961)]] |
[production] |
09:19 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2215.codfw.wmnet with reason: host reimage |
[production] |
09:15 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2215.codfw.wmnet with reason: host reimage |
[production] |