| 
      
        2021-04-29
      
      §
     | 
  
    
  | 15:55 | 
  <razzi> | 
  restart hadoop-yarn-nodemanager and hadoop-hdfs-datanode on an-worker1100 for hadoop to recognize new disk /dev/sdl | 
  [analytics] | 
            
  | 15:38 | 
  <ottomata> | 
  enabling event_sanitized_main jobs - T273789 | 
  [analytics] | 
            
  | 14:57 | 
  <elukey> | 
  run mysql_upgrade on an-coord1001 to complete the buster upgrade - T278424 | 
  [analytics] | 
            
  | 14:44 | 
  <hnowlan> | 
  restored all eventlogging jobs to eventlog1003 | 
  [analytics] | 
            
  | 14:21 | 
  <hnowlan> | 
  bump eventlog1003 CPUs to 6 | 
  [analytics] | 
            
  | 13:53 | 
  <joal> | 
  Rerun failed pageview-hourly-wf-2021-4-29-11 and pageview-hourly-wf-2021-4-29-12 | 
  [analytics] | 
            
  | 13:09 | 
  <joal> | 
  Rerun failed pageview-hourly-wf-2021-4-29-11 | 
  [analytics] | 
            
  | 12:35 | 
  <hnowlan> | 
  restarting 2 processors on eventlog1002 | 
  [analytics] | 
            
  | 12:02 | 
  <hnowlan> | 
  stopping processors on eventlog1002 to migrate to eventlog1003 | 
  [analytics] | 
            
  | 11:50 | 
  <elukey> | 
  manual stop of one of the eventlog processors on eventlog1002 to see if 1003 takes it over | 
  [analytics] | 
            
  | 02:59 | 
  <milimetric> | 
  deployed hotfix for referrer job | 
  [analytics] | 
            
  
    | 
      
        2021-04-28
      
      §
     | 
  
    
  | 17:46 | 
  <hnowlan> | 
  eventlog1003 joined to groups successfully | 
  [analytics] | 
            
  | 17:36 | 
  <razzi> | 
  sudo mkdir /srv/log/eventlogging and sudo chown eventlogging:eventlogging /srv/log/eventlogging to workaround missing directory puppet error (to be puppetized later) | 
  [analytics] | 
            
  | 17:31 | 
  <razzi> | 
  remove deployment cache on eventlogging1003: sudo rm -fr /srv/deployment/eventlogging/analytics-cache/ | 
  [analytics] | 
            
  | 17:26 | 
  <razzi> | 
  manually change /srv/deployment/eventlogging/analytics/.git/DEPLOY_HEAD to deployment1002 on deployment1002 to fix puppet scap error | 
  [analytics] | 
            
  | 16:53 | 
  <hnowlan> | 
  stopping deployment-eventlog05 in deployment-prep | 
  [analytics] | 
            
  | 14:42 | 
  <milimetric> | 
  deployed refinery with 0.1.9 jars and synced to hdfs | 
  [analytics] | 
            
  | 14:30 | 
  <elukey> | 
  chown -R analytics-deploy:analytics-deploy /srv/deployment/analytics on an-coord1001 | 
  [analytics] | 
            
  | 12:50 | 
  <ottomata> | 
  applied data_purge jobs in analytics test cluster; old data will now be dropped there - T273789 | 
  [analytics] | 
            
  
    | 
      
        2021-04-21
      
      §
     | 
  
    
  | 21:30 | 
  <ottomata> | 
  temporariliy disabling sanitize_eventlogging_analytics_delayed jobs until T280813 is completed (probably tomorrow) | 
  [analytics] | 
            
  | 20:04 | 
  <ottomata> | 
  renaming event_santized hive table directories to lower case and repairing table partition paths - T280813 | 
  [analytics] | 
            
  | 09:28 | 
  <elukey> | 
  roll restart druid-overlord on druid* after an-coord1001 maintenance | 
  [analytics] | 
            
  | 09:08 | 
  <elukey> | 
  upgrade hue on an-tool1009 to 4.9.0-2 | 
  [analytics] | 
            
  | 08:31 | 
  <elukey> | 
  re-enable timers on an-launcher1002 and airflow on an-airflow1001 after maintenance on an-coord1001 | 
  [analytics] | 
            
  | 07:08 | 
  <elukey> | 
  reimage an-coord1001 after partition reshape (/var/lib/mysql folded in /srv) | 
  [analytics] | 
            
  | 06:51 | 
  <elukey> | 
  stop airflow on an-airflow1001 | 
  [analytics] | 
            
  | 06:49 | 
  <elukey> | 
  stop all services on an-coord1001 as prep step for reimage | 
  [analytics] | 
            
  | 06:45 | 
  <elukey> | 
  PURGE BINARY LOGS BEFORE '2021-04-14 00:00:00'; on an-coord1001 to free some space before the reimage | 
  [analytics] | 
            
  | 06:00 | 
  <elukey> | 
  stop timers on an-launcher1002 as prep step for an-coord1001 reimage | 
  [analytics] |