production SAL

901-950 of 10000 results (102ms)

2025-01-15 §
19:21	<aqu@deploy2002>	Finished deploy [airflow-dags/analytics@cd03eb7]: Cascading backfill under projectview hourly (duration: 01m 06s)	[production]
19:20	<aqu@deploy2002>	Started deploy [airflow-dags/analytics@cd03eb7]: Cascading backfill under projectview hourly	[production]
19:15	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
19:15	<andrew@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
19:08	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
19:08	<andrew@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
19:07	<sukhe@cumin1002>	END (FAIL) - Cookbook sre.dns.admin (exit_code=99) DNS admin: depool site ulsfo [reason: this is a test, not actual depool, no task ID specified]	[production]
19:07	<sukhe@cumin1002>	START - Cookbook sre.dns.admin DNS admin: depool site ulsfo [reason: this is a test, not actual depool, no task ID specified]	[production]
18:51	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
18:50	<andrew@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
18:22	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
18:22	<andrew@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
18:16	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1163 (T371742)', diff saved to https://phabricator.wikimedia.org/P72080 and previous config saved to /var/cache/conftool/dbconfig/20250115-181629-ladsgroup.json	[production]
18:16	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1163.eqiad.wmnet with reason: Maintenance	[production]
18:16	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 12:00:00 on db1163.eqiad.wmnet with reason: Maintenance	[production]
18:09	<volans@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:05:00 on sretest1002.eqiad.wmnet with reason: testing cookbook	[production]
18:06	<volans@cumin2002>	START - Cookbook sre.hosts.downtime for 0:05:00 on sretest1002.eqiad.wmnet with reason: testing cookbook	[production]
18:05	<volans@cumin2002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:05:00 on sretest1001.eqiad.wmnet with reason: testing cookbook	[production]
17:59	<lucaswerkmeister-wmde@deploy2002>	Finished scap sync-world: Backport for [[gerrit:1111166\|Added new stream config for haproxy_requestctl (T383392)]] (duration: 13m 38s)	[production]
17:58	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
17:56	<cmooney@cumin1002>	END (PASS) - Cookbook sre.hosts.dhcp (exit_code=0) for host cloudcephosd1012.eqiad.wmnet	[production]
17:54	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1163 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72079 and previous config saved to /var/cache/conftool/dbconfig/20250115-175456-root.json	[production]
17:53	<cmooney@cumin1002>	START - Cookbook sre.hosts.dhcp for host cloudcephosd1012.eqiad.wmnet	[production]
17:53	<kamila@cumin1002>	END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1102-1106].eqiad.wmnet	[production]
17:53	<kamila@cumin1002>	START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1102-1106].eqiad.wmnet	[production]
17:52	<lucaswerkmeister-wmde@deploy2002>	lucaswerkmeister-wmde, fabfur: Continuing with sync	[production]
17:52	<lucaswerkmeister-wmde@deploy2002>	lucaswerkmeister-wmde, fabfur: Backport for [[gerrit:1111166\|Added new stream config for haproxy_requestctl (T383392)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
17:52	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1103.eqiad.wmnet with OS bookworm	[production]
17:51	<kamila_>	homer creqiad commit T377876	[production]
17:49	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1102.eqiad.wmnet with OS bookworm	[production]
17:47	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1105.eqiad.wmnet with OS bookworm	[production]
17:45	<lucaswerkmeister-wmde@deploy2002>	Started scap sync-world: Backport for [[gerrit:1111166\|Added new stream config for haproxy_requestctl (T383392)]]	[production]
17:43	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1106.eqiad.wmnet with OS bookworm	[production]
17:39	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1163 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72077 and previous config saved to /var/cache/conftool/dbconfig/20250115-173951-root.json	[production]
17:36	<andrew@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
17:34	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1103.eqiad.wmnet with reason: host reimage	[production]
17:30	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1102.eqiad.wmnet with reason: host reimage	[production]
17:26	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1105.eqiad.wmnet with reason: host reimage	[production]
17:26	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
17:26	<andrew@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1012.eqiad.wmnet with OS bookworm	[production]
17:25	<root@cumin1002>	END (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for dbprov1006.eqiad.wmnet: Renew puppet certificate - root@cumin1002	[production]
17:24	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1163 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72076 and previous config saved to /var/cache/conftool/dbconfig/20250115-172445-root.json	[production]
17:24	<hnowlan>	running `decommission` for 5 codfw jobrunners	[production]
17:24	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1106.eqiad.wmnet with reason: host reimage	[production]
17:22	<kamila@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1103.eqiad.wmnet with reason: host reimage	[production]
17:21	<kamila@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1105.eqiad.wmnet with reason: host reimage	[production]
17:21	<root@cumin1002>	START - Cookbook sre.puppet.renew-cert for dbprov1006.eqiad.wmnet: Renew puppet certificate - root@cumin1002	[production]
17:21	<kamila@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1102.eqiad.wmnet with reason: host reimage	[production]
17:21	<kamila@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1106.eqiad.wmnet with reason: host reimage	[production]
17:18	<eevans@cumin1002>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:cassandra-dev: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002	[production]