951-1000 of 2404 results (19ms)
2018-08-01 §
10:06 <elukey> restart all the yarn nodemanagers after minor max memory allocation change [analytics]
09:19 <elukey> restart webrequest-load-wf-text-2018-8-1-7 (died due to yarn restarts) [analytics]
06:59 <elukey> restart eventlogging on eventlog1002 to pick up new logging settings [analytics]
2018-07-28 §
17:29 <elukey> restart eventlogging on eventlog1002 after tons of disconnects (still not clear what happened) [analytics]
2018-07-27 §
15:18 <joal> Deploying AQS with scap [analytics]
11:42 <joal> Restart mediawiki-history-denormalize oozie job after deploy [analytics]
2018-07-26 §
17:09 <joal> Restart mediawiki-history-reduced job after deploy [analytics]
14:03 <joal> Restart webrequest-bundle load job to pick new pageview definition [analytics]
13:59 <joal> Start wikidata-coeditors job [analytics]
07:57 <joal> Deploying refinery with scap - 2nd try [analytics]
2018-07-25 §
15:42 <joal> Deploying refinery onto HDFS [analytics]
14:38 <joal> Release refinery v0.0.67 to archiva [analytics]
2018-07-24 §
11:46 <joal> Cheked that oozie webrequest upload warning for hour 2018-07-24-07 contains only false positive [analytics]
2018-07-18 §
08:48 <elukey> re-run hour 7 of webrequest upload/text via Hue (failed due to a hadoop node restart) [analytics]
2018-07-10 §
10:43 <elukey> restart map reduce history server on an1001 as attempt to see if related with yarn.w.o unresponsiveness [analytics]
10:03 <elukey> bounce yarn RM on an100[12], some socket errors after the ip6 interface rollout [analytics]
08:20 <joal> Update AQS druid backend datasource to 2018-06 [analytics]
2018-07-05 §
10:36 <elukey> restart oozie on analytics1003 - connection timeouts from thorium after mariadb maintenance [analytics]
10:34 <elukey> restart hive metastore on an1003, errors after mariadb maintenance this morning [analytics]
07:44 <elukey> all jobs re-enabled [analytics]
06:26 <elukey> stop camus to allow mariadb restart on analytics1003 [analytics]
2018-07-02 §
14:56 <elukey> resume cassandra bundle via hue [analytics]
13:27 <elukey> suspend cassandra bundle via Hue to ease the reimage of aqs1004 [analytics]
09:12 <joal> Rerun mediawiki-geoeditors-load-wf-2018-06 after having fixed the wmf_raw.mediawiki_private_cu_changes table issueb [analytics]
07:12 <joal> Restart cassandra bundle [analytics]
2018-06-28 §
14:46 <elukey> upgrade piwik 3.2.1 to matomo (new name/package) 3.5.1 [analytics]
11:27 <joal> Change mediawiki-reduced table format to be parquet and restart mediawiki-reduced oozie job [analytics]
11:19 <joal> Restart druid uniques daily-monthly-aggregated indexation jobs [analytics]
11:19 <joal> Start backfilling job cassandra pageviews-top-countries ceiled-values [analytics]
10:20 <joal> Deploying refinery to HDFS [analytics]
10:09 <joal> Deploying refinery using scap [analytics]
09:03 <joal> deploying AQS pageviews-bycountry ceiled value glue code [analytics]
07:41 <fdans> testing load of 2 months of per country pageviews with the new ceiled value [analytics]
06:10 <elukey> move /srv/kafka to a dedicated 60G partition on deployment-jumbo hosts in deployment-prep [analytics]
2018-06-27 §
21:51 <elukey> piwik maintenance completed [analytics]
13:08 <elukey> piwik upgraded to 3.2.1 on bohrium + started the db migration procedure (will last 2/3h probably) [analytics]
12:57 <elukey> set Piwik in maintenance mode as prep step for backup + upgrade [analytics]
2018-06-20 §
19:54 <ottomata> removed Kafka MirrorMaker from kafka10(12|13|14) [analytics]
2018-06-18 §
11:57 <joal> Restart oozie webrequest refine jobs [analytics]
11:19 <joal> Launch oozie webrequest refine jobs for the failing hour 2018-06-14-11 [analytics]
10:18 <joal> Deployed refiney on hdfs [analytics]
10:18 <joal> Deployed refinery with scap [analytics]
2018-06-15 §
09:00 <joal> Deleting corrupted file hdfs://analytics-hadoop/user/joal/wmf/data/raw/webrequest/webrequest_upload/hourly/2018/06/14/11/webrequest_upload.1004.10.1214791.15490650727.1528974000000._COPYING_ to prevent webrequest refine jobs from failing. No data will be lost as the correct file exist. [analytics]
2018-06-14 §
19:29 <joal> try rerunning webrequest-load-wf-upload-2018-6-14-11 [analytics]
13:14 <elukey> re-run failed webrequest-upload/text jobs (namenodes restarted) [analytics]
2018-06-11 §
13:56 <ottomata> bouncing eventlogging processes to apply kafka event time producing [analytics]
2018-06-08 §
11:45 <joal> Launching manual sqooping of revision and archive table to recover from failure [analytics]
2018-06-01 §
08:37 <joal> Restart every druid loading oozie job (except mediawiki reduced) to pick new configuration [analytics]
08:33 <joal> Restart mediawiki-history-denormalize oozie job after deploy [analytics]
08:24 <joal> Deploy refinery on HDFS [analytics]