| 
      
        2021-04-21
      
      §
     | 
  
    
  | 21:30 | 
  <ottomata> | 
  temporariliy disabling sanitize_eventlogging_analytics_delayed jobs until T280813 is completed (probably tomorrow) | 
  [analytics] | 
            
  | 20:04 | 
  <ottomata> | 
  renaming event_santized hive table directories to lower case and repairing table partition paths - T280813 | 
  [analytics] | 
            
  | 09:28 | 
  <elukey> | 
  roll restart druid-overlord on druid* after an-coord1001 maintenance | 
  [analytics] | 
            
  | 09:08 | 
  <elukey> | 
  upgrade hue on an-tool1009 to 4.9.0-2 | 
  [analytics] | 
            
  | 08:31 | 
  <elukey> | 
  re-enable timers on an-launcher1002 and airflow on an-airflow1001 after maintenance on an-coord1001 | 
  [analytics] | 
            
  | 07:08 | 
  <elukey> | 
  reimage an-coord1001 after partition reshape (/var/lib/mysql folded in /srv) | 
  [analytics] | 
            
  | 06:51 | 
  <elukey> | 
  stop airflow on an-airflow1001 | 
  [analytics] | 
            
  | 06:49 | 
  <elukey> | 
  stop all services on an-coord1001 as prep step for reimage | 
  [analytics] | 
            
  | 06:45 | 
  <elukey> | 
  PURGE BINARY LOGS BEFORE '2021-04-14 00:00:00'; on an-coord1001 to free some space before the reimage | 
  [analytics] | 
            
  | 06:00 | 
  <elukey> | 
  stop timers on an-launcher1002 as prep step for an-coord1001 reimage | 
  [analytics] | 
            
  
    | 
      
        2021-04-14
      
      §
     | 
  
    
  | 14:05 | 
  <elukey> | 
  run build/env/bin/hue migrate on an-tool1009 after the hue upgade | 
  [analytics] | 
            
  | 13:10 | 
  <elukey> | 
  rollback hue-next to 4.8 - issues not present in staging | 
  [analytics] | 
            
  | 13:00 | 
  <elukey> | 
  upgrade Hue to 4.9 on an-tool1009 - hue-next.wikimedia.org | 
  [analytics] | 
            
  | 10:02 | 
  <elukey> | 
  roll restart yarn nodemanagers on hadoop prod (attempt to see if they entered in a weird state, graceful restart) | 
  [analytics] | 
            
  | 09:54 | 
  <elukey> | 
  kill long running mediawiki-job refine erroring out application_1615988861843_166906 | 
  [analytics] | 
            
  | 09:46 | 
  <elukey> | 
  kill application_1615988861843_163186 for the same reason | 
  [analytics] | 
            
  | 09:43 | 
  <elukey> | 
  kill application_1615988861843_164387 to see if any improvement to socket consumption is made | 
  [analytics] | 
            
  | 09:14 | 
  <elukey> | 
  run "sudo kill `pgrep -f sqoop`" on an-launcher1002 to clean up old test processes still running | 
  [analytics] |