production SAL

2401-2450 of 10000 results (135ms)

2024-12-19 §
19:38	<cmooney@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: reset dns names for cloudcontrol1011 to newly-assigned ones - cmooney@cumin1002"	[production]
19:38	<cmooney@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: reset dns names for cloudcontrol1011 to newly-assigned ones - cmooney@cumin1002"	[production]
19:35	<cmooney@cumin1002>	START - Cookbook sre.dns.netbox	[production]
19:32	<jhancock@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudcephosd2004-dev.codfw.wmnet with OS bullseye	[production]
19:18	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcontrol1011.eqiad.wmnet with OS bookworm	[production]
19:18	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcontrol1011.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]
19:12	<dancy@deploy2002>	rebuilt and synchronized wikiversions files: group2 to 1.44.0-wmf.8 refs T375667	[production]
18:52	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host cloudcontrol1011.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]
18:51	<jclark@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
18:51	<jclark@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added mgmt for cloudcontrol1011 - jclark@cumin1002"	[production]
18:51	<jclark@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added mgmt for cloudcontrol1011 - jclark@cumin1002"	[production]
18:49	<jclark@cumin1002>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudcontrol1011	[production]
18:49	<jclark@cumin1002>	START - Cookbook sre.network.configure-switch-interfaces for host cloudcontrol1011	[production]
18:48	<swfrench@deploy2002>	Finished scap sync-world: Backport for [[gerrit:1105778\|maintenance: fix typo in job status logging (T382517)]], [[gerrit:1105776\|maintenance: fix typo in job status logging (T382517)]] (duration: 17m 11s)	[production]
18:47	<jclark@cumin1002>	START - Cookbook sre.dns.netbox	[production]
18:47	<andrew@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2004-dev.codfw.wmnet with OS bookworm	[production]
18:42	<swfrench@deploy2002>	swfrench: Continuing with sync	[production]
18:41	<swfrench@deploy2002>	swfrench: Backport for [[gerrit:1105778\|maintenance: fix typo in job status logging (T382517)]], [[gerrit:1105776\|maintenance: fix typo in job status logging (T382517)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
18:32	<bd808@deploy2002>	helmfile [codfw] DONE helmfile.d/services/developer-portal: apply	[production]
18:32	<bd808@deploy2002>	helmfile [codfw] START helmfile.d/services/developer-portal: apply	[production]
18:32	<bd808@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply	[production]
18:31	<bd808@deploy2002>	helmfile [eqiad] START helmfile.d/services/developer-portal: apply	[production]
18:31	<swfrench@deploy2002>	Started scap sync-world: Backport for [[gerrit:1105778\|maintenance: fix typo in job status logging (T382517)]], [[gerrit:1105776\|maintenance: fix typo in job status logging (T382517)]]	[production]
18:31	<bd808@deploy2002>	helmfile [staging] DONE helmfile.d/services/developer-portal: apply	[production]
18:31	<bd808@deploy2002>	helmfile [staging] START helmfile.d/services/developer-portal: apply	[production]
18:07	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd2004-dev.codfw.wmnet with OS bookworm	[production]
18:06	<andrew@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2004-dev.codfw.wmnet with OS bookworm	[production]
18:06	<kamila@cumin1002>	END (PASS) - Cookbook sre.k8s.roll-reimage-nodes (exit_code=0) rolling reimage on P{wikikube-worker[1022].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad)	[production]
18:06	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1022.eqiad.wmnet with OS bookworm	[production]
17:46	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1022.eqiad.wmnet with reason: host reimage	[production]
17:42	<kamila@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1022.eqiad.wmnet with reason: host reimage	[production]
17:39	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd2004-dev.codfw.wmnet with OS bookworm	[production]
17:38	<andrew@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2004-dev.codfw.wmnet with OS bookworm	[production]
17:23	<kamila@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-worker1022.eqiad.wmnet with OS bookworm	[production]
17:21	<kamila@cumin1002>	START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on P{wikikube-worker[1022].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad)	[production]
17:19	<btullis@deploy2002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
17:17	<btullis@deploy2002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
17:10	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd2004-dev.codfw.wmnet with OS bookworm	[production]
17:00	<kamila@cumin1002>	END (FAIL) - Cookbook sre.k8s.roll-reimage-nodes (exit_code=1) rolling reimage on P{wikikube-worker[1022].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad)	[production]
16:58	<kamila@cumin1002>	START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on P{wikikube-worker[1022].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad)	[production]
16:54	<cjming@deploy2002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply	[production]
16:53	<cjming@deploy2002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply	[production]
16:53	<cjming@deploy2002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply	[production]
16:52	<cjming@deploy2002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply	[production]
16:35	<cmooney@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on ripe-atlas-eqsin,ripe-atlas-eqsin IPv6 with reason: Atlas device offline, scheduling reboot	[production]
16:35	<cmooney@cumin1002>	START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on ripe-atlas-eqsin,ripe-atlas-eqsin IPv6 with reason: Atlas device offline, scheduling reboot	[production]
16:28	<cmooney@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on ripe-atlas-eqiad,ripe-atlas-eqiad IPv6 with reason: Atlas device offline, scheduling reboot	[production]
16:28	<cmooney@cumin1002>	START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on ripe-atlas-eqiad,ripe-atlas-eqiad IPv6 with reason: Atlas device offline, scheduling reboot	[production]
16:15	<jayme@cumin1002>	END (PASS) - Cookbook sre.k8s.roll-reimage-nodes (exit_code=0) rolling reimage on P{wikikube-worker[1296-1300].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad)	[production]
16:15	<jayme@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1300.eqiad.wmnet with OS bookworm	[production]