production SAL

1-50 of 10000 results (93ms)

2026-02-14 §
21:09	<ladsgroup@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance	[production]
21:09	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1226 (T410589)', diff saved to https://phabricator.wikimedia.org/P88823 and previous config saved to /var/cache/conftool/dbconfig/20260214-210906-ladsgroup.json	[production]
20:58	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P88822 and previous config saved to /var/cache/conftool/dbconfig/20260214-205858-ladsgroup.json	[production]
20:48	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P88821 and previous config saved to /var/cache/conftool/dbconfig/20260214-204849-ladsgroup.json	[production]
20:38	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1226 (T410589)', diff saved to https://phabricator.wikimedia.org/P88820 and previous config saved to /var/cache/conftool/dbconfig/20260214-203841-ladsgroup.json	[production]
17:57	<dzahn@cumin2002>	DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Alexandros Kosiaris out of all services on: 2447 hosts	[production]
10:02	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Depooling db1226 (T410589)', diff saved to https://phabricator.wikimedia.org/P88819 and previous config saved to /var/cache/conftool/dbconfig/20260214-100210-ladsgroup.json	[production]
10:02	<ladsgroup@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1226.eqiad.wmnet with reason: Maintenance	[production]
10:01	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1214 (T410589)', diff saved to https://phabricator.wikimedia.org/P88818 and previous config saved to /var/cache/conftool/dbconfig/20260214-100145-ladsgroup.json	[production]
09:51	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P88817 and previous config saved to /var/cache/conftool/dbconfig/20260214-095137-ladsgroup.json	[production]
09:41	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P88816 and previous config saved to /var/cache/conftool/dbconfig/20260214-094129-ladsgroup.json	[production]
09:31	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1214 (T410589)', diff saved to https://phabricator.wikimedia.org/P88815 and previous config saved to /var/cache/conftool/dbconfig/20260214-093121-ladsgroup.json	[production]
03:37	<brett>	Import lua5.4-maxminddb 0.1.1~deb13u1 into trixie-wikimedia (T401832)	[production]
02:14	<mwpresync@deploy2002>	Finished scap build-images: Publishing wmf/next image (duration: 13m 02s)	[production]
02:01	<mwpresync@deploy2002>	Started scap build-images: Publishing wmf/next image	[production]
01:12	<mutante>	cumin1003 - race condition between puppet and systemd - puppet fails because userdel fails. userdel fails because there is still a pid used by the user. that process is /lib/systemdl/system --user - ran "loginctl terminate-user"; killed tmux PID; ran puppet again T417465	[production]
2026-02-13 §
23:25	<brett>	Import prometheus-varnishkafka-exporter 0.1~deb13u1 into trixie-wikimedia (T401832)	[production]
23:01	<brett>	Import varnishkafka 1.2.0~deb13+wmf1 into trixie-wikimedia (T401832)	[production]
22:48	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Depooling db1214 (T410589)', diff saved to https://phabricator.wikimedia.org/P88814 and previous config saved to /var/cache/conftool/dbconfig/20260213-224820-ladsgroup.json	[production]
22:48	<ladsgroup@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1214.eqiad.wmnet with reason: Maintenance	[production]
22:48	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1203 (T410589)', diff saved to https://phabricator.wikimedia.org/P88813 and previous config saved to /var/cache/conftool/dbconfig/20260213-224806-ladsgroup.json	[production]
22:37	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P88812 and previous config saved to /var/cache/conftool/dbconfig/20260213-223758-ladsgroup.json	[production]
22:30	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mwlog2003.codfw.wmnet with OS bookworm	[production]
22:30	<pt1979@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
22:29	<sukhe@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on cp2043.codfw.wmnet with reason: host not provisioned yet	[production]
22:28	<sukhe>	disable puppet on cp2043: test host, wmfuniq service broken (needs upgrade for trixie)	[production]
22:27	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P88811 and previous config saved to /var/cache/conftool/dbconfig/20260213-222749-ladsgroup.json	[production]
22:25	<pt1979@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
22:17	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1203 (T410589)', diff saved to https://phabricator.wikimedia.org/P88810 and previous config saved to /var/cache/conftool/dbconfig/20260213-221741-ladsgroup.json	[production]
22:03	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mwlog2003.codfw.wmnet with reason: host reimage	[production]
22:01	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mwlog2003.codfw.wmnet with reason: host reimage	[production]
21:58	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup2015.codfw.wmnet with OS trixie	[production]
21:40	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host mwlog2003.codfw.wmnet with OS bookworm	[production]
21:39	<pt1979@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mwlog2003.codfw.wmnet with OS bookworm	[production]
20:54	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host mwlog2003.codfw.wmnet with OS bookworm	[production]
20:41	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.network.configure-switch-interfaces (exit_code=99) for host cloudcephosd2008-dev	[production]
20:41	<jhancock@cumin2002>	START - Cookbook sre.network.configure-switch-interfaces for host cloudcephosd2008-dev	[production]
20:41	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudcephosd2008-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]
20:41	<jhancock@cumin2002>	START - Cookbook sre.hosts.provision for host cloudcephosd2008-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]
20:40	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudcephosd2008-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]
20:40	<jhancock@cumin2002>	START - Cookbook sre.hosts.provision for host cloudcephosd2008-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]
20:38	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.network.configure-switch-interfaces (exit_code=99) for host cloudcephosd2008-dev	[production]
20:38	<jhancock@cumin2002>	START - Cookbook sre.network.configure-switch-interfaces for host cloudcephosd2008-dev	[production]
20:38	<jhancock@cumin2002>	START - Cookbook sre.hosts.reimage for host backup2015.codfw.wmnet with OS trixie	[production]
20:38	<jhancock@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
20:38	<jhancock@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding cloudcephosd2008-dev to codfw - jhancock@cumin2002"	[production]
20:38	<jhancock@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding cloudcephosd2008-dev to codfw - jhancock@cumin2002"	[production]
20:38	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['backup2015']	[production]
20:37	<jhancock@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['backup2015']	[production]
20:36	<jhancock@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host backup2015.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]