| 
      
        2023-06-27
      
      §
     | 
  
    
  | 14:53 | 
  <mforns> | 
  deployed airflow analytics to unbreak DataHub's Druid ingestion | 
  [analytics] | 
            
  | 13:32 | 
  <joal> | 
  Rerun druid_load_pageviews_hourly_aggregated_daily after deploy | 
  [analytics] | 
            
  | 13:32 | 
  <joal> | 
  druid_load_pageviews_hourly_aggregated_dailyRerun | 
  [analytics] | 
            
  | 13:25 | 
  <joal> | 
  Deploy Airflow | 
  [analytics] | 
            
  | 11:10 | 
  <joal> | 
  Deploy refinery onto HDFS | 
  [analytics] | 
            
  | 11:01 | 
  <stevemunene> | 
  upgrading an-test-worker1003 to bullseye, keeping `/srv/hadoop` intact | 
  [analytics] | 
            
  | 10:55 | 
  <joal> | 
  Deploy refinery using scap | 
  [analytics] | 
            
  | 09:42 | 
  <stevemunene> | 
  !log run puppet on hadoop-masters this does a refresh of the hdfs nodes | 
  [analytics] | 
            
  | 09:38 | 
  <stevemunene> | 
  Exclude analytics1061_1069 from HDFS and YARN | 
  [analytics] | 
            
  | 09:21 | 
  <btullis> | 
  upgrading an-test-worker1002 to bullseye, keeping `/srv/hadoop` intact | 
  [analytics] | 
            
  | 08:38 | 
  <elukey> | 
  revoked puppet cert for 'varnishkafka' and cleaned up its cergen's files in puppet private | 
  [analytics] | 
            
  | 07:14 | 
  <elukey> | 
  `sudo kill `pgrep -u paramd`` on stat1005 to unblock puppet | 
  [analytics] | 
            
  
    | 
      
        2023-06-22
      
      §
     | 
  
    
  | 15:57 | 
  <btullis> | 
  adding new bigtop-1.5 packages to apt.wikimedia.org for bullseye | 
  [analytics] | 
            
  | 15:50 | 
  <elukey> | 
  update the webrequest_sampled_live druid kafka supervisor to add the https field - T340097 | 
  [analytics] | 
            
  | 15:18 | 
  <btullis> | 
  cleared status for aqs_hourly.wait_for_webrequest run 13:00 and the downstream task on an-test-client1001. | 
  [analytics] | 
            
  | 15:07 | 
  <btullis> | 
  clearing task for refine_webrequest_hourly_test_text hour 13:00 | 
  [analytics] | 
            
  | 14:36 | 
  <btullis> | 
  restarted airflow-webserver and airflow-scheduler on an-test-client1001 with version 2.6.1. | 
  [analytics] | 
            
  | 14:11 | 
  <btullis> | 
  redeploying datahub to staging to try to get upgrade to 0.10.0 working. | 
  [analytics] | 
            
  | 14:02 | 
  <stevemunene> | 
  running sre.hadoop.roll-restart-masters restart the Namenodes to completely remove any reference of analytics106[1-3] T317861 | 
  [analytics] | 
            
  | 13:47 | 
  <stevemunene> | 
  run puppet on hadoop-masters | 
  [analytics] | 
            
  | 13:43 | 
  <stevemunene> | 
  Remove analytics106[1-3] from the HDFS topology | 
  [analytics] | 
            
  | 13:16 | 
  <elukey> | 
  move varnishafka instances in eqiad to PKI - T337825 | 
  [analytics] | 
            
  | 13:14 | 
  <btullis> | 
  deploying the new eventgate-wikimedia container to eventgate-main | 
  [analytics] | 
            
  | 08:57 | 
  <btullis> | 
  cleared airflow task for `projectview_geo.move_data_to_archive` | 
  [analytics] |