production SAL

2601-2650 of 10000 results (138ms)

2025-05-21 §
18:45	<bking@cumin2002>	START - Cookbook sre.hosts.reimage for host cirrussearch1103.eqiad.wmnet with OS bullseye	[production]
18:42	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from elastic1103 to cirrussearch1103	[production]
18:42	<bking@cumin2002>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cirrussearch1103	[production]
18:39	<bking@cumin2002>	START - Cookbook sre.network.configure-switch-interfaces for host cirrussearch1103	[production]
18:39	<bking@cumin2002>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch1103 on all recursors	[production]
18:39	<bking@cumin2002>	START - Cookbook sre.dns.wipe-cache cirrussearch1103 on all recursors	[production]
18:39	<bking@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
18:39	<bking@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic1103 to cirrussearch1103 - bking@cumin2002"	[production]
18:38	<bking@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic1103 to cirrussearch1103 - bking@cumin2002"	[production]
18:35	<bking@cumin2002>	START - Cookbook sre.dns.netbox	[production]
18:35	<bking@cumin2002>	START - Cookbook sre.hosts.rename from elastic1103 to cirrussearch1103	[production]
18:23	<bking@cumin2002>	conftool action : set/pooled=no:weight=10; selector: name=elastic1060.eqiad.wmnet\|name=elastic1061.eqiad.wmnet\|name=elastic1062.eqiad.wmnet\|name=elastic1063.eqiad.wmnet\|name=elastic1064.eqiad.wmnet\|name=elastic1065.eqiad.wmnet\|name=elastic1066.eqiad.wmnet\|name=elastic1067.eqiad.wmnet\|name=elastic1103.eqiad.wmnet	[production]
18:22	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2003.codfw.wmnet with OS bookworm	[production]
18:21	<dreamyjazz@deploy1003>	Finished scap sync-world: Backport for [[gerrit:1148924\|Support creating logs in emptyUserGroup.php (T394914)]] (duration: 13m 18s)	[production]
18:14	<dreamyjazz@deploy1003>	dreamyjazz: Continuing with sync	[production]
18:10	<dreamyjazz@deploy1003>	dreamyjazz: Backport for [[gerrit:1148924\|Support creating logs in emptyUserGroup.php (T394914)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there.	[production]
18:08	<dreamyjazz@deploy1003>	Started scap sync-world: Backport for [[gerrit:1148924\|Support creating logs in emptyUserGroup.php (T394914)]]	[production]
17:33	<swfrench@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply	[production]
17:33	<swfrench@deploy1003>	helmfile [eqiad] START helmfile.d/services/mw-cron: apply	[production]
16:46	<fceratto@cumin1002>	END (FAIL) - Cookbook sre.mysql.sanitarium-restart (exit_code=99)	[production]
16:46	<fceratto@cumin1002>	START - Cookbook sre.mysql.sanitarium-restart	[production]
16:46	<fceratto@cumin1002>	END (FAIL) - Cookbook sre.mysql.sanitarium-restart (exit_code=99)	[production]
16:46	<fceratto@cumin1002>	START - Cookbook sre.mysql.sanitarium-restart	[production]
16:45	<fceratto@cumin1002>	END (ERROR) - Cookbook sre.mysql.sanitarium-restart (exit_code=97)	[production]
16:45	<fceratto@cumin1002>	START - Cookbook sre.mysql.sanitarium-restart	[production]
16:40	<eevans@cumin1002>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:restbase-codfw: Upgrading to Java 11.0.27 - eevans@cumin1002	[production]
16:13	<hnowlan@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply	[production]
16:12	<hnowlan@deploy1003>	helmfile [eqiad] START helmfile.d/services/mw-cron: apply	[production]
16:03	<jhancock@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest2003.codfw.wmnet with OS bookworm	[production]
16:03	<jhancock@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2003.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]
15:56	<jgiannelos@deploy1003>	helmfile [staging] DONE helmfile.d/services/mobileapps: sync	[production]
15:56	<jgiannelos@deploy1003>	helmfile [staging] START helmfile.d/services/mobileapps: sync	[production]
15:25	<jhancock@cumin2002>	START - Cookbook sre.hosts.provision for host sretest2003.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED	[production]
15:25	<jynus>	forgetting 4 old instances @ orchestrator-web T384274	[production]
15:22	<jhancock@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc2018.codfw.wmnet with OS bookworm	[production]
15:21	<jhancock@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002"	[production]
15:19	<jhancock@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002"	[production]
15:16	<btullis@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply	[production]
15:15	<btullis@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply	[production]
15:02	<jhancock@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc2018.codfw.wmnet with reason: host reimage	[production]
14:59	<jhancock@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on pc2018.codfw.wmnet with reason: host reimage	[production]
14:43	<jhancock@cumin2002>	START - Cookbook sre.hosts.reimage for host pc2018.codfw.wmnet with OS bookworm	[production]
14:43	<moritzm>	installing postgresql-15 security updates	[production]
14:42	<sukhe@cumin1002>	END (PASS) - Cookbook sre.dns.roll-restart (exit_code=0) rolling restart_daemons on A:dnsbox and not P{dns7001*} and A:dnsbox	[production]
14:40	<jynus@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2200.codfw.wmnet,db1216.eqiad.wmnet with reason: Restart x3	[production]
14:38	<hnowlan@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply	[production]
14:37	<hnowlan@deploy1003>	helmfile [eqiad] START helmfile.d/services/mw-cron: apply	[production]
14:32	<jmm@cumin2002>	END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-eqiad	[production]
14:30	<sbassett>	Deployed updated security fix for T392341 (04) to 1.45-wmf.2	[production]
14:27	<jmm@cumin2002>	START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad	[production]