2020-09-24
§
|
13:24 |
<elukey> |
moved the hadoop cluster to puppet TLS certificates |
[analytics] |
13:20 |
<elukey> |
re-enable timers on an-launcher1002 after maintenance |
[analytics] |
09:51 |
<elukey> |
stop all timers on an-launcher1002 to ease maintenance |
[analytics] |
09:41 |
<elukey> |
force re-creation of jupyterhub's default venv on stat1006 after reimage |
[analytics] |
07:29 |
<klausman> |
Starting reimaging of stat1006 |
[analytics] |
06:48 |
<elukey> |
on an-launcher1002: sudo -u hdfs kerberos-run-command hdfs hdfs dfs -rm -r -skipTrash /var/log/hadoop-yarn/apps/mirrys/logs/* |
[analytics] |
06:45 |
<elukey> |
on an-launcher1002: sudo -u hdfs kerberos-run-command hdfs hdfs dfs -rm -r -skipTrash /var/log/hadoop-yarn/apps/analytics-privatedata/logs/* |
[analytics] |
06:39 |
<elukey> |
manually ran "/usr/bin/find /srv/backup/hadoop/namenode -mtime +15 -delete" on an-master1002 to free some space in the backup partition |
[analytics] |
2020-09-16
§
|
19:12 |
<joal> |
Manually kill webrequest-hour oozie job that started before the restart could happen (waiting for previous hour to be finished) |
[analytics] |
19:00 |
<joal> |
Kill-restart data-quality-hourly bundle after deploy |
[analytics] |
18:57 |
<joal> |
Kill-restart webrequest after deploy |
[analytics] |
18:44 |
<joal> |
Kill restart mediawiki-history-reduced job after deploy |
[analytics] |
17:59 |
<joal> |
Deploy refinery onto HDFS |
[analytics] |
17:46 |
<joal> |
Deploy refinery using scap |
[analytics] |
15:27 |
<elukey> |
update the TLS backend certificate for Analytics UIs (unified one) to include hue-next.w.o as SAN |
[analytics] |
12:11 |
<klausman> |
stat1008 updated to use rock/rocm DKMS driver and back in operation |
[analytics] |
11:28 |
<klausman> |
starting to upgrade to rock-dkms driver on stat1008 |
[analytics] |