| 
      
        2024-11-25
      
      §
     | 
  
    
  | 16:07 | 
  <hashar> | 
  gerrit: renamed analytics/statsv to performance/statsv | 
  [releng] | 
            
  | 16:06 | 
  <hashar> | 
  gerrit: renamed performance/statsv to performance/statsv-back | 
  [releng] | 
            
  | 16:05 | 
  <robh@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dns7001.mgmt.magru.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 16:05 | 
  <jayme@cumin2002> | 
  END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on D{wikikube-worker[1305-1312].eqiad.wmnet} and (A:wikikube-staging-worker-codfw or A:wikikube-staging-master-codfw or A:wikikube-staging-worker-eqiad or A:wikikube-staging-master-eqiad or A:wikikube-worker-codfw or A:wikikube-master-codfw or A:wikikube-worker-eqiad or A:wikikube-master-eqiad or A:ml-serve-worker-eqiad or A:ml-se | 
  [production] | 
            
  | 16:04 | 
  <brouberol@deploy2002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 16:04 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade/restart of Apache Traffic Server on A:cp-text_drmrs and A:cp for 9.2.6-1wm2 | 
  [production] | 
            
  | 16:04 | 
  <brouberol@deploy2002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 16:04 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1224', diff saved to https://phabricator.wikimedia.org/P71134 and previous config saved to /var/cache/conftool/dbconfig/20241125-160408-ladsgroup.json | 
  [production] | 
            
  | 16:02 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade/restart of Apache Traffic Server on A:cp-upload_drmrs and A:cp for 9.2.6-1wm2 | 
  [production] | 
            
  | 15:58 | 
  <robh@cumin2002> | 
  START - Cookbook sre.hosts.provision for host dns7001.mgmt.magru.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 15:52 | 
  <Lucas_WMDE> | 
  UTC afternoon backport+config window done (apologies for the temporary flood of “Use of QuickSurveys survey” deprecation warnings – should be fixed again) | 
  [production] | 
            
  | 15:52 | 
  <robh@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host dns7001.mgmt.magru.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 15:49 | 
  <lucaswerkmeister-wmde@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1097410|Reader Survey: Fix question (T378660)]] (duration: 13m 02s) | 
  [production] | 
            
  | 15:49 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1224', diff saved to https://phabricator.wikimedia.org/P71133 and previous config saved to /var/cache/conftool/dbconfig/20241125-154901-ladsgroup.json | 
  [production] | 
            
  | 15:48 | 
  <robh@cumin2002> | 
  START - Cookbook sre.hosts.provision for host dns7001.mgmt.magru.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 15:47 | 
  <robh@cumin2002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 15:47 | 
  <robh@cumin2002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: magru swaps - robh@cumin2002" | 
  [production] | 
            
  | 15:46 | 
  <robh@cumin2002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: magru swaps - robh@cumin2002" | 
  [production] | 
            
  | 15:46 | 
  <claime> | 
  homer cr*eqiad* commit 'T380027' | 
  [production] | 
            
  | 15:42 | 
  <robh@cumin2002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 15:41 | 
  <lucaswerkmeister-wmde@deploy2002> | 
  lucaswerkmeister-wmde, dani: Continuing with sync | 
  [production] | 
            
  | 15:41 | 
  <cgoubert@cumin1002> | 
  END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts kubernetes[1009-1014].eqiad.wmnet | 
  [production] | 
            
  | 15:41 | 
  <cgoubert@cumin1002> | 
  END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) | 
  [production] | 
            
  | 15:40 | 
  <brouberol@deploy2002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wmde: apply | 
  [production] | 
            
  | 15:40 | 
  <brouberol@deploy2002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wmde: apply | 
  [production] | 
            
  | 15:40 | 
  <lucaswerkmeister-wmde@deploy2002> | 
  lucaswerkmeister-wmde, dani: Backport for [[gerrit:1097410|Reader Survey: Fix question (T378660)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 15:39 | 
  <cgoubert@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 15:38 | 
  <cgoubert@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1309.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 15:37 | 
  <jynus@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on backup2011.codfw.wmnet with reason: Reboot | 
  [production] | 
            
  | 15:37 | 
  <jynus@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 1:00:00 on backup2011.codfw.wmnet with reason: Reboot | 
  [production] | 
            
  | 15:37 | 
  <jynus@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on backup2010.codfw.wmnet with reason: Reboot | 
  [production] | 
            
  | 15:37 | 
  <jynus@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 1:00:00 on backup2010.codfw.wmnet with reason: Reboot | 
  [production] | 
            
  | 15:36 | 
  <lucaswerkmeister-wmde@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1097410|Reader Survey: Fix question (T378660)]] | 
  [production] | 
            
  | 15:36 | 
  <robh@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp7001.mgmt.magru.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 15:33 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1224 (T380449)', diff saved to https://phabricator.wikimedia.org/P71132 and previous config saved to /var/cache/conftool/dbconfig/20241125-153354-ladsgroup.json | 
  [production] | 
            
  | 15:32 | 
  <hashar> | 
  Deleted old fundraising branches fundraising/REL1_27 fundraising/REL1_31 and fundraising/REL1_35 from mediawiki/core and mediawiki/vendor | 
  [releng] | 
            
  | 15:31 | 
  <robh@cumin2002> | 
  START - Cookbook sre.hosts.provision for host cp7001.mgmt.magru.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | 
  [production] | 
            
  | 15:23 | 
  <lucaswerkmeister-wmde@deploy2002> | 
  dani, lucaswerkmeister-wmde: Continuing with sync | 
  [production] | 
            
  | 15:21 | 
  <lucaswerkmeister-wmde@deploy2002> | 
  dani, lucaswerkmeister-wmde: Backport for [[gerrit:1093987|Reader Survey: Deploy on enwiki (T378660)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 15:19 | 
  <cgoubert@cumin1002> | 
  START - Cookbook sre.hosts.decommission for hosts kubernetes[1009-1014].eqiad.wmnet | 
  [production] | 
            
  | 15:18 | 
  <cgoubert@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1309.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:18 | 
  <hashar> | 
  Deleted old release branch of mediawiki/core and replaced them with signed tag (affecting REL1_23 to REL1_38) | 
  [releng] | 
            
  | 15:18 | 
  <robh@cumin1002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 15:17 | 
  <lucaswerkmeister-wmde@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1093987|Reader Survey: Deploy on enwiki (T378660)]] | 
  [production] | 
            
  | 15:15 | 
  <robh@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 15:15 | 
  <cgoubert@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1309.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:14 | 
  <lucaswerkmeister-wmde@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1094511|New stream config for Android Rabbit Holes feature. (T380107)]] (duration: 15m 45s) | 
  [production] | 
            
  | 15:11 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db1224 (T380449)', diff saved to https://phabricator.wikimedia.org/P71131 and previous config saved to /var/cache/conftool/dbconfig/20241125-151103-ladsgroup.json | 
  [production] | 
            
  | 15:10 | 
  <ladsgroup@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1224.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:10 | 
  <ladsgroup@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 6:00:00 on db1224.eqiad.wmnet with reason: Maintenance | 
  [production] |