2201-2250 of 5398 results (24ms)
2020-12-01 §
09:03 <elukey> clean up old hive metastore/server old logs on an-coord1001 to free space [analytics]
2020-11-30 §
17:51 <joal> Deploy refinery onto hdfs [analytics]
17:49 <joal> Kill-restart mediawiki-history-load job after refactor (1 coordinator per table) and tables addition [analytics]
17:32 <joal> Kill-restart mediawiki-history-reduced job for druid-public datasource number of shards update [analytics]
17:32 <joal> Deploy refinery using scap for naming hotfix [analytics]
15:29 <ottomata> migrated EventLogging schemas SpecialMuteSubmit and SpecialInvestigate to EventGate - T268517 [analytics]
14:56 <joal> Deploying refinery onto hdfs [analytics]
14:49 <joal> Create new hive tables for newly sqooped data [analytics]
14:45 <joal> Deploy refinery using scap [analytics]
09:08 <elukey> force execution of refinery-drop-pageview-actor-hourly-partitions on an-launcher1002 (after args fixup from Joseph) [analytics]
2020-11-27 §
14:51 <elukey> roll restart zookeeper on druid* nodes for openjdk upgrades [analytics]
10:29 <elukey> restart eventlogging_to_druid_editattemptstep_hourly on an-launcher1002 (failed) to see if the hive metastore works [analytics]
10:27 <elukey> restart oozie and presto-server on an-coord1001 for openjdk upgrades [analytics]
10:27 <elukey> restart hive server and metastore on an-coord1001 - openjdk upgrades + problem with high GC caused by a job [analytics]
08:05 <elukey> roll restart druid public cluster for openjdk upgrades [analytics]
2020-11-26 §
13:52 <elukey> roll restart druid daemons on druid analytics to pick up new openjdk upgrades [analytics]
13:08 <elukey> force umount/mount of all /mnt/hdfs mountpoints to pick up opendjdk upgrades [analytics]
09:07 <elukey> force purging https://wikimedia.org/api/rest_v1/metrics/pageviews/per-article/en.wikipedia/all-access/user/Diego_Maradona/daily/2020110500/2020112500 from caches [analytics]
08:40 <elukey> roll restart cassandra on aqs10* for openjdk upgrades [analytics]
2020-11-25 §
19:04 <joal> Killing job application_1605880843685_18336 as it consumes too much resources [analytics]
18:40 <elukey> restart turnilo to pick up new netflow config changes [analytics]
16:46 <elukey> move analytics1066 to C3 [analytics]
16:11 <elukey> move analytics1065 to C3 [analytics]
15:38 <elukey> move stat1004 to A5 [analytics]
2020-11-24 §
19:33 <elukey> kill and restart webrequest_load bundle to pick up analytics-hive.eqiad.wmnet settings [analytics]
19:05 <elukey> deploy refinery to hdfs (even if not really needed) [analytics]
18:47 <elukey> deploy analytics refinery as part of the regular weekly train [analytics]
15:38 <elukey> move druid1005 from rack B7 to B6 [analytics]
14:59 <elukey> move analytics1072 from rack B2 to B3 [analytics]
09:16 <elukey> drop principals and keytabs for analytics10[42-57] - T267932 [analytics]
2020-11-21 §
08:10 <elukey> remove big stderrlog fine in /var/lib/hadoop/data/d/yarn/logs/application_1605880843685_1450 on an-worker1110 [analytics]
08:05 <elukey> remove big stderrlog fine in /var/lib/hadoop/data/e/yarn/logs/application_1605880843685_1450 on an-worker1105 [analytics]
2020-11-20 §
21:09 <razzi> truncate /var/lib/hadoop/data/u/yarn/logs/application_1605880843685_0581/container_e27_1605880843685_0581_01_000171/stderr logfile on an-worker1098 [analytics]
2020-11-19 §
16:35 <elukey> roll restart hadoop workers for openjdk upgrades [analytics]
07:07 <elukey> roll restart java daemons on Hadoop test for openjdk upgrades [analytics]
06:50 <elukey> restart refinery-import-siteinfo-dumps.service on an-launcher1002 [analytics]
2020-11-18 §
09:22 <elukey> set dns_canonicalize_hostname = false to all kerberos clients [analytics]
2020-11-17 §
23:09 <mforns> restarted browser general oozie job [analytics]
23:00 <mforns> finished deploying refinery (regular weekly deployment train) [analytics]
22:36 <mforns> deploying refinery (regular weekly deployment train) [analytics]
15:11 <elukey> drop backup@localhost user from an-coord1001's mariadb meta instance (not used anymore) [analytics]
15:09 <elukey> drop 'dump' user from an-coord1001's analytics meta (related to dbprov hosts, previous attempts before db1108) [analytics]
14:57 <elukey> stutdown stat1008 for ram expansion [analytics]
11:28 <elukey> set analytics meta instance on an-coord1002 as replica of an-coord1001 [analytics]
2020-11-16 §
10:41 <klausman> about to update stat1008 to new kernel and rocm [analytics]
09:13 <joal> Rerun webrequest-refine for hours 0 to 6 of day 2020-11-16 - This will prevent webrequest-druid-daily to get loaded with incoherent data due to bucketing change [analytics]
08:45 <joal> Correct webrequest job directly on HDFS and restart webrequest bundle oozie job [analytics]
08:43 <joal> Kill webrequest bundle to correct typo [analytics]
08:31 <joal> Restart webrequest bundle oozie job with update [analytics]
08:31 <joal> Restart webrequest bun [analytics]