2020-05-21
§
|
17:24 |
<elukey> |
add druid100[7,8] to the druid public cluster (not serving load balancer traffic for the moment, only joining the cluster) - T252771 |
[analytics] |
16:44 |
<elukey> |
roll restart druid historical nodes on druid100[4-6] (public cluster) to pick up new settings - T252771 |
[analytics] |
14:02 |
<elukey> |
restart druid kafka supervisor for wmf_netflow after maintenance |
[analytics] |
13:53 |
<elukey> |
restart druid-historical on an-druid100[1,2] to pick up new settings |
[analytics] |
13:17 |
<elukey> |
kill wmf_netflow druid supervisor for maintenance |
[analytics] |
13:13 |
<elukey> |
stop druid-daemons on druid100[1-3] (one at the time) to move the druid partition from /srv/druid to /srv (didn't think about it before) - T252771 |
[analytics] |
09:16 |
<elukey> |
move Druid Analytics SQL in Superset to druid://an-druid1001.eqiad.wmnet:8082/druid/v2/sql/ |
[analytics] |
09:05 |
<elukey> |
move turnilo to an-druid1001 (beefier host) |
[analytics] |
08:15 |
<elukey> |
roll restart of all druid historicals in the analytics cluster to pick up new settings |
[analytics] |
2020-05-07
§
|
17:39 |
<ottomata> |
deploying fix to refinery bin/camus CamusPartitionChecker when using dynamic stream configs |
[analytics] |
16:49 |
<joal> |
Restart and babysit mediawiki-history-denormalize-wf-2020-04 |
[analytics] |
16:37 |
<elukey> |
roll restart of all the nodemanagers on the hadoop cluster to pick up new jvm settings |
[analytics] |
13:53 |
<elukey> |
move stat1007 to role::statistics::explorer (adding jupyterhub) |
[analytics] |
11:00 |
<joal> |
Moving application_1583418280867_334532 to the nice queue |
[analytics] |
10:58 |
<joal> |
Rerun wikidata-articleplaceholder_metrics-wf-2020-5-6 |
[analytics] |
07:45 |
<elukey> |
re-run mediawiki-history-denormalize |
[analytics] |
07:43 |
<elukey> |
kill application_1583418280867_333560 after a chat with David, the job is consuming ~2TB of RAM |
[analytics] |
07:32 |
<elukey> |
re-run mediawiki history load |
[analytics] |
07:18 |
<elukey> |
execute yarn application -movetoqueue application_1583418280867_332862 -queue root.nice |
[analytics] |
07:06 |
<elukey> |
restart mediawiki-history-load via hue |
[analytics] |
06:41 |
<elukey> |
restart oozie on an-coord1001 |
[analytics] |
05:46 |
<elukey> |
re-run mediarequest-hourly-wf-2020-5-6-19 |
[analytics] |
05:35 |
<elukey> |
re-run two failed hours for webrequest load text (07/05T05) and upload (06/05T23) |
[analytics] |