| 
      
        2021-02-23
      
      §
     | 
  
    
  | 21:20 | 
  <ottomata> | 
  started nodemanager on an-worker1112 | 
  [analytics] | 
            
  | 21:15 | 
  <razzi> | 
  rebalance kafka partitions for webrequest_upload partition 2 | 
  [analytics] | 
            
  | 19:31 | 
  <elukey> | 
  roll out new uid/gid for mapred/druid/analytics/yarn/hdfs for all buster nodes (no op for stretch) | 
  [analytics] | 
            
  | 17:47 | 
  <elukey> | 
  change uid/gid for yarn/mapred/analytics/hdfs/druid on stat100x, an-presto100x | 
  [analytics] | 
            
  | 15:57 | 
  <elukey> | 
  an-launcher1002's timers restored | 
  [analytics] | 
            
  | 15:28 | 
  <elukey> | 
  stop timers on an-launcher1002 to change gid/uid for yarn/hdfs/mapred/analytics/druid and to reboot for kernel updates | 
  [analytics] | 
            
  | 15:23 | 
  <elukey> | 
  deploy new uid/gid scheme for yarn/mapred/analytics/hdfs/druid on an-tool100[8,9] | 
  [analytics] | 
            
  | 15:22 | 
  <elukey> | 
  deploy new uid/gid scheme for yarn/mapred/analytics/hdfs/druid on an-airflow1001, an-test* buster nodes | 
  [analytics] | 
            
  | 15:05 | 
  <klausman> | 
  an-master1001 ~ $ sudo -u hdfs kerberos-run-command hdfs hdfs dfs -chgrp analytics-privatedata-users /wmf/data/raw/webrequest/webrequest_text/hourly/2021/02/22/01/webrequest* | 
  [analytics] | 
            
  | 14:51 | 
  <elukey> | 
  drop /srv/backup-1007 on stat1008 to free space | 
  [analytics] | 
            
  
    | 
      
        2021-02-18
      
      §
     | 
  
    
  | 18:38 | 
  <razzi> | 
  rebalance kafka partition for webrequest_upload partition 1 | 
  [analytics] | 
            
  | 17:27 | 
  <elukey> | 
  an-coord1002 back in service with raid1 configured | 
  [analytics] | 
            
  | 15:48 | 
  <elukey> | 
  stop hive/mysql on an-coord1002 as precautionary step to rebuild the md array | 
  [analytics] | 
            
  | 13:10 | 
  <elukey> | 
  failover analytics-hive to an-coord1001 after maintenance (DNS change) | 
  [analytics] | 
            
  | 11:32 | 
  <elukey> | 
  restart hive daemons on an-coord1001 to pick up new parquet settings | 
  [analytics] | 
            
  | 10:07 | 
  <elukey> | 
  hive failover to an-coord1002 to apply new hive settings to an-coord1001 | 
  [analytics] | 
            
  | 10:00 | 
  <elukey> | 
  restart hive daemons on an-coord1002 (standby coord) to pick up new default parquet file format change | 
  [analytics] | 
            
  | 09:46 | 
  <elukey> | 
  upgrade presto to 0.246-wmf on an-coord1001, an-presto*, stat100x | 
  [analytics] | 
            
  
    | 
      
        2021-02-12
      
      §
     | 
  
    
  | 19:19 | 
  <milimetric> | 
  deployed refinery with query syntax fix for the last broken cassandra job and an updated EL whitelist | 
  [analytics] | 
            
  | 18:34 | 
  <razzi> | 
  rebalance kafka partitions for atskafka_test_webrequest_text | 
  [analytics] | 
            
  | 18:31 | 
  <razzi> | 
  rebalance kafka partitions for __consumer_offsets | 
  [analytics] | 
            
  | 17:48 | 
  <joal> | 
  Rerun wikidata-articleplaceholder_metrics-wf-2021-2-10 | 
  [analytics] | 
            
  | 17:47 | 
  <joal> | 
  Rerun wikidata-specialentitydata_metrics-wf-2021-2-10 | 
  [analytics] | 
            
  | 17:43 | 
  <joal> | 
  Rerun wikidata-json_entity-weekly-wf-2021-02-01 | 
  [analytics] | 
            
  | 17:08 | 
  <elukey> | 
  reboot presto workers for kernel upgrade | 
  [analytics] | 
            
  | 16:32 | 
  <mforns> | 
  finished deployment of analytics-refinery | 
  [analytics] | 
            
  | 15:26 | 
  <mforns> | 
  started deployment of analytics-refinery | 
  [analytics] |