production SAL

2951-3000 of 10000 results (124ms)

2024-12-10 §
21:41	<cjming@deploy2002>	cjming, dani: Backport for [[gerrit:1101875\|Reader Survey: Increase coverage (T378660)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
21:37	<cjming@deploy2002>	Started scap sync-world: Backport for [[gerrit:1101875\|Reader Survey: Increase coverage (T378660)]]	[production]
21:35	<cjming@deploy2002>	Finished scap sync-world: Backport for [[gerrit:1101600\|LanguageConverter: Ignore content inside <math> and <svg> elements (T381617)]] (duration: 11m 55s)	[production]
21:30	<cjming@deploy2002>	bvibber, cjming: Continuing with sync	[production]
21:27	<cjming@deploy2002>	bvibber, cjming: Backport for [[gerrit:1101600\|LanguageConverter: Ignore content inside <math> and <svg> elements (T381617)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
21:23	<cjming@deploy2002>	Started scap sync-world: Backport for [[gerrit:1101600\|LanguageConverter: Ignore content inside <math> and <svg> elements (T381617)]]	[production]
21:22	<ryankemper@cumin2002>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2026.codfw.wmnet w/ force delete existing files, repooling source-only afterwards	[production]
20:55	<mforns@deploy2002>	Finished deploy [airflow-dags/analytics@2af4e1a]: Fix for the Commons Impact Metrics job (duration: 01m 38s)	[production]
20:54	<mforns@deploy2002>	Started deploy [airflow-dags/analytics@2af4e1a]: Fix for the Commons Impact Metrics job	[production]
20:47	<mforns@deploy2002>	Finished deploy [analytics/refinery@25c1946] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@25c1946c] (duration: 00m 27s)	[production]
20:46	<mforns@deploy2002>	Started deploy [analytics/refinery@25c1946] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@25c1946c]	[production]
20:46	<mforns@deploy2002>	Finished deploy [analytics/refinery@25c1946] (thin): Regular analytics weekly train THIN [analytics/refinery@25c1946c] (duration: 00m 31s)	[production]
20:45	<mforns@deploy2002>	Started deploy [analytics/refinery@25c1946] (thin): Regular analytics weekly train THIN [analytics/refinery@25c1946c]	[production]
20:45	<mforns@deploy2002>	Finished deploy [analytics/refinery@25c1946]: Regular analytics weekly train [analytics/refinery@25c1946c] (duration: 13m 12s)	[production]
20:38	<ryankemper@cumin2002>	START - Cookbook sre.wdqs.data-transfer (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2026.codfw.wmnet w/ force delete existing files, repooling source-only afterwards	[production]
20:38	<ryankemper@cumin2002>	END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards	[production]
20:37	<ryankemper@cumin2002>	START - Cookbook sre.wdqs.data-transfer (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards	[production]
20:32	<mforns@deploy2002>	Started deploy [analytics/refinery@25c1946]: Regular analytics weekly train [analytics/refinery@25c1946c]	[production]
20:28	<jhathaway@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on ms-be1088.eqiad.wmnet with reason: T381919	[production]
20:28	<jhathaway@cumin1002>	START - Cookbook sre.hosts.downtime for 4:00:00 on ms-be1088.eqiad.wmnet with reason: T381919	[production]
20:04	<jhathaway@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on ms-be1088.eqiad.wmnet with reason: T381919	[production]
20:04	<jhathaway@cumin1002>	START - Cookbook sre.hosts.downtime for 3:00:00 on ms-be1088.eqiad.wmnet with reason: T381919	[production]
18:35	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2158 (re)pooling @ 100%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71693 and previous config saved to /var/cache/conftool/dbconfig/20241210-183545-root.json	[production]
18:20	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2158 (re)pooling @ 75%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71692 and previous config saved to /var/cache/conftool/dbconfig/20241210-182040-root.json	[production]
18:05	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2158 (re)pooling @ 50%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71691 and previous config saved to /var/cache/conftool/dbconfig/20241210-180534-root.json	[production]
18:02	<dbrant@deploy2002>	helmfile [codfw] DONE helmfile.d/services/mobileapps: apply	[production]
18:02	<dbrant@deploy2002>	helmfile [codfw] START helmfile.d/services/mobileapps: apply	[production]
18:02	<dbrant@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply	[production]
18:01	<elukey@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - elukey@cumin1002"	[production]
18:01	<dbrant@deploy2002>	helmfile [eqiad] START helmfile.d/services/mobileapps: apply	[production]
18:00	<dbrant@deploy2002>	helmfile [staging] DONE helmfile.d/services/mobileapps: apply	[production]
18:00	<dbrant@deploy2002>	helmfile [staging] START helmfile.d/services/mobileapps: apply	[production]
17:55	<hnowlan@deploy1003>	helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply	[production]
17:54	<hnowlan@deploy1003>	helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply	[production]
17:50	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2158 (re)pooling @ 25%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71690 and previous config saved to /var/cache/conftool/dbconfig/20241210-175029-root.json	[production]
17:47	<hnowlan@deploy1003>	helmfile [codfw] DONE helmfile.d/services/mw-videoscaler: apply	[production]
17:47	<hnowlan@deploy1003>	helmfile [codfw] START helmfile.d/services/mw-videoscaler: apply	[production]
17:42	<hnowlan@deploy1003>	helmfile [codfw] DONE helmfile.d/services/mw-videoscaler: apply	[production]
17:42	<hnowlan@deploy1003>	helmfile [codfw] START helmfile.d/services/mw-videoscaler: apply	[production]
17:42	<hnowlan@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-videoscaler: apply	[production]
17:41	<hnowlan@deploy1003>	helmfile [eqiad] START helmfile.d/services/mw-videoscaler: apply	[production]
17:35	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2158 (re)pooling @ 10%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71688 and previous config saved to /var/cache/conftool/dbconfig/20241210-173524-root.json	[production]
17:30	<elukey@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-lab1002.eqiad.wmnet with reason: host reimage	[production]
17:29	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2158.codfw.wmnet with reason: maintenance	[production]
17:28	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 1:00:00 on db2158.codfw.wmnet with reason: maintenance	[production]
17:25	<elukey@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on ml-lab1002.eqiad.wmnet with reason: host reimage	[production]
17:24	<herron@cumin1002>	dbctl commit (dc=all): 'depooling db2158 T381901', diff saved to https://phabricator.wikimedia.org/P71687 and previous config saved to /var/cache/conftool/dbconfig/20241210-172424-herron.json	[production]
17:18	<hnowlan@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-videoscaler: apply	[production]
17:18	<hnowlan@deploy1003>	helmfile [eqiad] START helmfile.d/services/mw-videoscaler: apply	[production]
17:13	<elukey@cumin1002>	START - Cookbook sre.hosts.reimage for host ml-lab1002.eqiad.wmnet with OS bookworm	[production]