2025-01-16
ยง
|
12:42 |
<marostegui@dns1006> |
START - running authdns-update |
[production] |
12:42 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host mw[2335-2338].codfw.wmnet |
[production] |
12:40 |
<jelto@cumin1002> |
START - Cookbook sre.k8s.pool-depool-node depool for host mw[2335-2338].codfw.wmnet |
[production] |
12:39 |
<jynus@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts dbprov2001.codfw.wmnet |
[production] |
12:38 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts dbprov1002.eqiad.wmnet |
[production] |
12:38 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
12:38 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1002" |
[production] |
12:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote pc1014 to pc4 eqiad master dbmaint and enable pc4 back in eqiad and codfw T383398', diff saved to https://phabricator.wikimedia.org/P72118 and previous config saved to /var/cache/conftool/dbconfig/20250116-123656-marostegui.json |
[production] |
12:33 |
<jynus@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1002" |
[production] |
12:32 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on pc2014.codfw.wmnet,pc[1014,1016].eqiad.wmnet with reason: reorganizing pc4 |
[production] |
12:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool pc4 eqiad and codfw dbmaint T383398', diff saved to https://phabricator.wikimedia.org/P72117 and previous config saved to /var/cache/conftool/dbconfig/20250116-123129-marostegui.json |
[production] |
12:29 |
<jynus@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
12:27 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[2232-2235].codfw.wmnet |
[production] |
12:27 |
<jelto@cumin1002> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[2232-2235].codfw.wmnet |
[production] |
12:24 |
<jelto> |
homer 'cr*codfw*' commit 'T377877' |
[production] |
12:23 |
<jynus@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts dbprov1002.eqiad.wmnet |
[production] |
12:23 |
<jelto> |
homer 'lsw1-c6-codfw*' commit 'T377877' |
[production] |
12:23 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts dbprov1001.eqiad.wmnet |
[production] |
12:23 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
12:23 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1002" |
[production] |
12:22 |
<jynus@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1002" |
[production] |
12:22 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2235.codfw.wmnet with OS bookworm |
[production] |
12:20 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_magru and A:cp |
[production] |
12:17 |
<jynus@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
12:14 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2234.codfw.wmnet with OS bookworm |
[production] |
12:09 |
<jynus@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts dbprov1001.eqiad.wmnet |
[production] |
12:06 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2233.codfw.wmnet with OS bookworm |
[production] |
12:02 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2235.codfw.wmnet with reason: host reimage |
[production] |
12:01 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2232.codfw.wmnet with OS bookworm |
[production] |
11:56 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2235.codfw.wmnet with reason: host reimage |
[production] |
11:54 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_magru and A:cp |
[production] |
11:53 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2234.codfw.wmnet with reason: host reimage |
[production] |
11:50 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2234.codfw.wmnet with reason: host reimage |
[production] |
11:49 |
<kart_> |
Updated cxserver to 2025-01-16-103443-production (T383854, T377966) |
[production] |
11:47 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
11:46 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
11:46 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
11:46 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
11:45 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2233.codfw.wmnet with reason: host reimage |
[production] |
11:42 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
11:42 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
11:42 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2232.codfw.wmnet with reason: host reimage |
[production] |
11:40 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2233.codfw.wmnet with reason: host reimage |
[production] |
11:38 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2232.codfw.wmnet with reason: host reimage |
[production] |
11:38 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2235 |
[production] |
11:38 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.move-vlan for host wikikube-worker2235 |
[production] |
11:38 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.reimage for host wikikube-worker2235.codfw.wmnet with OS bookworm |
[production] |
11:37 |
<jelto@cumin1002> |
END (FAIL) - Cookbook sre.k8s.renumber-node (exit_code=99) Renumbering for host wikikube-worker2235.codfw.wmnet |
[production] |
11:37 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2235.codfw.wmnet with OS bullseye |
[production] |
11:32 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2234 |
[production] |