2024-11-05
ยง
|
17:13 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1028 (T376905)', diff saved to https://phabricator.wikimedia.org/P70943 and previous config saved to /var/cache/conftool/dbconfig/20241105-171330-ladsgroup.json |
[production] |
17:06 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling es1028 (T376905)', diff saved to https://phabricator.wikimedia.org/P70942 and previous config saved to /var/cache/conftool/dbconfig/20241105-170636-ladsgroup.json |
[production] |
17:06 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1028.eqiad.wmnet with reason: Maintenance |
[production] |
17:06 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1028.eqiad.wmnet with reason: Maintenance |
[production] |
17:06 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1031 (T376905)', diff saved to https://phabricator.wikimedia.org/P70941 and previous config saved to /var/cache/conftool/dbconfig/20241105-170609-ladsgroup.json |
[production] |
16:51 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1031', diff saved to https://phabricator.wikimedia.org/P70940 and previous config saved to /var/cache/conftool/dbconfig/20241105-165103-ladsgroup.json |
[production] |
16:37 |
<lucaswerkmeister-wmde@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1087507|Fixup paths to moved resources (T379080)]] (duration: 08m 02s) |
[production] |
16:35 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1031', diff saved to https://phabricator.wikimedia.org/P70939 and previous config saved to /var/cache/conftool/dbconfig/20241105-163556-ladsgroup.json |
[production] |
16:34 |
<cdanis@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:32 |
<lucaswerkmeister-wmde@deploy2002> |
lucaswerkmeister-wmde: Continuing with sync |
[production] |
16:32 |
<lucaswerkmeister-wmde@deploy2002> |
lucaswerkmeister-wmde: Backport for [[gerrit:1087507|Fixup paths to moved resources (T379080)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
16:32 |
<cdanis@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
16:29 |
<lucaswerkmeister-wmde@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1087507|Fixup paths to moved resources (T379080)]] |
[production] |
16:20 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1031 (T376905)', diff saved to https://phabricator.wikimedia.org/P70938 and previous config saved to /var/cache/conftool/dbconfig/20241105-162048-ladsgroup.json |
[production] |
16:14 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling es1031 (T376905)', diff saved to https://phabricator.wikimedia.org/P70937 and previous config saved to /var/cache/conftool/dbconfig/20241105-161455-ladsgroup.json |
[production] |
16:14 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1031.eqiad.wmnet with reason: Maintenance |
[production] |
16:14 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1031.eqiad.wmnet with reason: Maintenance |
[production] |
16:13 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1033 (T376905)', diff saved to https://phabricator.wikimedia.org/P70936 and previous config saved to /var/cache/conftool/dbconfig/20241105-161340-ladsgroup.json |
[production] |
16:01 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc1017.eqiad.wmnet with OS bookworm |
[production] |
16:00 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1014.eqiad.wmnet |
[production] |
15:58 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1033', diff saved to https://phabricator.wikimedia.org/P70935 and previous config saved to /var/cache/conftool/dbconfig/20241105-155833-ladsgroup.json |
[production] |
15:54 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1014.eqiad.wmnet |
[production] |
15:54 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1014.eqiad.wmnet |
[production] |
15:54 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1014.eqiad.wmnet |
[production] |
15:53 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1042.eqiad.wmnet to cluster eqiad and group B |
[production] |
15:51 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.addnode for new host ganeti1042.eqiad.wmnet to cluster eqiad and group B |
[production] |
15:51 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1041.eqiad.wmnet to cluster eqiad and group B |
[production] |
15:50 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.addnode for new host ganeti1041.eqiad.wmnet to cluster eqiad and group B |
[production] |
15:48 |
<moritzm> |
remove ganeti1013 from active ganeti nodes T378921 |
[production] |
15:47 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1013.eqiad.wmnet |
[production] |
15:43 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1033', diff saved to https://phabricator.wikimedia.org/P70934 and previous config saved to /var/cache/conftool/dbconfig/20241105-154326-ladsgroup.json |
[production] |
15:40 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage |
[production] |
15:37 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage |
[production] |
15:32 |
<hashar> |
Switched PCC workers to Java 17 via https://horizon.wikimedia.org/project/prefixpuppet/?tab=prefix_puppet__puppet-pcc-worker # T359795 |
[production] |
15:28 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1033 (T376905)', diff saved to https://phabricator.wikimedia.org/P70933 and previous config saved to /var/cache/conftool/dbconfig/20241105-152819-ladsgroup.json |
[production] |
15:27 |
<hashar> |
Switched deployment-deploy04.deployment-prep.eqiad1.wikimedia.cloud to Java 17 # T359795 |
[production] |
15:21 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling es1033 (T376905)', diff saved to https://phabricator.wikimedia.org/P70932 and previous config saved to /var/cache/conftool/dbconfig/20241105-152139-ladsgroup.json |
[production] |
15:21 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1033.eqiad.wmnet with reason: Maintenance |
[production] |
15:21 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1033.eqiad.wmnet with reason: Maintenance |
[production] |
15:21 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1026 (T376905)', diff saved to https://phabricator.wikimedia.org/P70931 and previous config saved to /var/cache/conftool/dbconfig/20241105-152114-ladsgroup.json |
[production] |
15:20 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.reimage for host pc1017.eqiad.wmnet with OS bookworm |
[production] |
15:18 |
<hashar> |
Switched WMCS integration instances from Java 11 to Java 17 via Horizon project wide config. That was forgotten in T359795 and blocks today Jenkins upgrade ( T379059 ) |
[production] |
15:15 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc1017.eqiad.wmnet with OS bookworm |
[production] |
15:06 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance es1026', diff saved to https://phabricator.wikimedia.org/P70929 and previous config saved to /var/cache/conftool/dbconfig/20241105-150607-ladsgroup.json |
[production] |
15:02 |
<cdanis@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/chart-renderer: apply |
[production] |
15:02 |
<cdanis@deploy2002> |
helmfile [eqiad] START helmfile.d/services/chart-renderer: apply |
[production] |
15:02 |
<cdanis@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/chart-renderer: apply |
[production] |
15:01 |
<cdanis@deploy2002> |
helmfile [codfw] START helmfile.d/services/chart-renderer: apply |
[production] |
15:01 |
<hashar> |
Upgrading CI Jenkins | T379059 |
[production] |
14:53 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage |
[production] |