analytics SAL

951-1000 of 2404 results (24ms)

2018-08-01 §
10:06	<elukey>	restart all the yarn nodemanagers after minor max memory allocation change	[analytics]
09:19	<elukey>	restart webrequest-load-wf-text-2018-8-1-7 (died due to yarn restarts)	[analytics]
06:59	<elukey>	restart eventlogging on eventlog1002 to pick up new logging settings	[analytics]
2018-07-28 §
17:29	<elukey>	restart eventlogging on eventlog1002 after tons of disconnects (still not clear what happened)	[analytics]
2018-07-27 §
15:18	<joal>	Deploying AQS with scap	[analytics]
11:42	<joal>	Restart mediawiki-history-denormalize oozie job after deploy	[analytics]
2018-07-26 §
17:09	<joal>	Restart mediawiki-history-reduced job after deploy	[analytics]
14:03	<joal>	Restart webrequest-bundle load job to pick new pageview definition	[analytics]
13:59	<joal>	Start wikidata-coeditors job	[analytics]
07:57	<joal>	Deploying refinery with scap - 2nd try	[analytics]
2018-07-25 §
15:42	<joal>	Deploying refinery onto HDFS	[analytics]
14:38	<joal>	Release refinery v0.0.67 to archiva	[analytics]
2018-07-24 §
11:46	<joal>	Cheked that oozie webrequest upload warning for hour 2018-07-24-07 contains only false positive	[analytics]
2018-07-18 §
08:48	<elukey>	re-run hour 7 of webrequest upload/text via Hue (failed due to a hadoop node restart)	[analytics]
2018-07-10 §
10:43	<elukey>	restart map reduce history server on an1001 as attempt to see if related with yarn.w.o unresponsiveness	[analytics]
10:03	<elukey>	bounce yarn RM on an100[12], some socket errors after the ip6 interface rollout	[analytics]
08:20	<joal>	Update AQS druid backend datasource to 2018-06	[analytics]
2018-07-05 §
10:36	<elukey>	restart oozie on analytics1003 - connection timeouts from thorium after mariadb maintenance	[analytics]
10:34	<elukey>	restart hive metastore on an1003, errors after mariadb maintenance this morning	[analytics]
07:44	<elukey>	all jobs re-enabled	[analytics]
06:26	<elukey>	stop camus to allow mariadb restart on analytics1003	[analytics]
2018-07-02 §
14:56	<elukey>	resume cassandra bundle via hue	[analytics]
13:27	<elukey>	suspend cassandra bundle via Hue to ease the reimage of aqs1004	[analytics]
09:12	<joal>	Rerun mediawiki-geoeditors-load-wf-2018-06 after having fixed the wmf_raw.mediawiki_private_cu_changes table issueb	[analytics]
07:12	<joal>	Restart cassandra bundle	[analytics]
2018-06-28 §
14:46	<elukey>	upgrade piwik 3.2.1 to matomo (new name/package) 3.5.1	[analytics]
11:27	<joal>	Change mediawiki-reduced table format to be parquet and restart mediawiki-reduced oozie job	[analytics]
11:19	<joal>	Restart druid uniques daily-monthly-aggregated indexation jobs	[analytics]
11:19	<joal>	Start backfilling job cassandra pageviews-top-countries ceiled-values	[analytics]
10:20	<joal>	Deploying refinery to HDFS	[analytics]
10:09	<joal>	Deploying refinery using scap	[analytics]
09:03	<joal>	deploying AQS pageviews-bycountry ceiled value glue code	[analytics]
07:41	<fdans>	testing load of 2 months of per country pageviews with the new ceiled value	[analytics]
06:10	<elukey>	move /srv/kafka to a dedicated 60G partition on deployment-jumbo hosts in deployment-prep	[analytics]
2018-06-27 §
21:51	<elukey>	piwik maintenance completed	[analytics]
13:08	<elukey>	piwik upgraded to 3.2.1 on bohrium + started the db migration procedure (will last 2/3h probably)	[analytics]
12:57	<elukey>	set Piwik in maintenance mode as prep step for backup + upgrade	[analytics]
2018-06-20 §
19:54	<ottomata>	removed Kafka MirrorMaker from kafka10(12\|13\|14)	[analytics]
2018-06-18 §
11:57	<joal>	Restart oozie webrequest refine jobs	[analytics]
11:19	<joal>	Launch oozie webrequest refine jobs for the failing hour 2018-06-14-11	[analytics]
10:18	<joal>	Deployed refiney on hdfs	[analytics]
10:18	<joal>	Deployed refinery with scap	[analytics]
2018-06-15 §
09:00	<joal>	Deleting corrupted file hdfs://analytics-hadoop/user/joal/wmf/data/raw/webrequest/webrequest_upload/hourly/2018/06/14/11/webrequest_upload.1004.10.1214791.15490650727.1528974000000._COPYING_ to prevent webrequest refine jobs from failing. No data will be lost as the correct file exist.	[analytics]
2018-06-14 §
19:29	<joal>	try rerunning webrequest-load-wf-upload-2018-6-14-11	[analytics]
13:14	<elukey>	re-run failed webrequest-upload/text jobs (namenodes restarted)	[analytics]
2018-06-11 §
13:56	<ottomata>	bouncing eventlogging processes to apply kafka event time producing	[analytics]
2018-06-08 §
11:45	<joal>	Launching manual sqooping of revision and archive table to recover from failure	[analytics]
2018-06-01 §
08:37	<joal>	Restart every druid loading oozie job (except mediawiki reduced) to pick new configuration	[analytics]
08:33	<joal>	Restart mediawiki-history-denormalize oozie job after deploy	[analytics]
08:24	<joal>	Deploy refinery on HDFS	[analytics]