| 2024-12-11
      
      § | 
    
  | 09:08 | <jelto@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host kubernetes[2011-2014].codfw.wmnet | [production] | 
            
  | 09:08 | <jelto@cumin1002> | START - Cookbook sre.k8s.pool-depool-node depool for host kubernetes[2011-2014].codfw.wmnet | [production] | 
            
  | 09:06 | <jelto@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host kubernetes[2011-2014].codfw.wmnet | [production] | 
            
  | 09:05 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depool db2136 to upgrade MariaDB 10.11 T378940', diff saved to https://phabricator.wikimedia.org/P71694 and previous config saved to /var/cache/conftool/dbconfig/20241211-090538-marostegui.json | [production] | 
            
  | 09:04 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2136.codfw.wmnet with reason: maintenance | [production] | 
            
  | 09:04 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1:00:00 on db2136.codfw.wmnet with reason: maintenance | [production] | 
            
  | 09:04 | <jelto@cumin1002> | START - Cookbook sre.k8s.pool-depool-node depool for host kubernetes[2011-2014].codfw.wmnet | [production] | 
            
  | 08:12 | <fnegri@cloudcumin1001> | END (PASS) - Cookbook wmcs.openstack.quota_increase (exit_code=0) (T381911) | [codesearch] | 
            
  | 08:12 | <fnegri@cloudcumin1001> | START - Cookbook wmcs.openstack.quota_increase (T381911) | [codesearch] | 
            
  | 02:30 | <eileen> | civicrm upgraded from 3ef855ca to ddda6d67 | [production] | 
            
  | 01:36 | <ryankemper@cumin2002> | END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2027.codfw.wmnet w/ force delete existing files, repooling both afterwards | [production] | 
            
  | 00:50 | <ryankemper@cumin2002> | START - Cookbook sre.wdqs.data-transfer (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2027.codfw.wmnet w/ force delete existing files, repooling both afterwards | [production] | 
            
  
    | 2024-12-10
      
      § | 
    
  | 23:35 | <eileen> | config revision changed from b3741848 to ca701cba add phone update job | [production] | 
            
  | 22:54 | <jhathaway@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on ms-be1088.eqiad.wmnet with reason: T381919 | [production] | 
            
  | 22:54 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.downtime for 1:00:00 on ms-be1088.eqiad.wmnet with reason: T381919 | [production] | 
            
  | 22:49 | <jhathaway@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1088.eqiad.wmnet with OS bookworm | [production] | 
            
  | 22:36 | <cjming> | end of UTC late backport window | [production] | 
            
  | 22:22 | <jhathaway@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1088.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 22:19 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be1088.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 22:15 | <cjming@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1100864|Disable stats collection when WMF_MAINTENANCE_OFFLINE is set (T380609)]] (duration: 11m 24s) | [production] | 
            
  | 22:10 | <cjming@deploy2002> | cwhite, cjming: Continuing with sync | [production] | 
            
  | 22:08 | <cjming@deploy2002> | cwhite, cjming: Backport for [[gerrit:1100864|Disable stats collection when WMF_MAINTENANCE_OFFLINE is set (T380609)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 22:04 | <cjming@deploy2002> | Started scap sync-world: Backport for [[gerrit:1100864|Disable stats collection when WMF_MAINTENANCE_OFFLINE is set (T380609)]] | [production] | 
            
  | 21:59 | <cjming@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1101840|Beta Cluster: Enable MetricsPlatform extension on all wikis (T381849 T381853)]] (duration: 10m 50s) | [production] | 
            
  | 21:56 | <jhathaway@cumin1002> | START - Cookbook sre.hosts.reimage for host ms-be1088.eqiad.wmnet with OS bookworm | [production] | 
            
  | 21:53 | <cjming@deploy2002> | cjming, phuedx: Continuing with sync | [production] | 
            
  | 21:52 | <cjming@deploy2002> | cjming, phuedx: Backport for [[gerrit:1101840|Beta Cluster: Enable MetricsPlatform extension on all wikis (T381849 T381853)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 21:48 | <cjming@deploy2002> | Started scap sync-world: Backport for [[gerrit:1101840|Beta Cluster: Enable MetricsPlatform extension on all wikis (T381849 T381853)]] | [production] | 
            
  | 21:47 | <eileen> | ivicrm upgraded from f9c89e50 to 3ef855ca | [production] | 
            
  | 21:47 | <cjming@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1101875|Reader Survey: Increase coverage (T378660)]] (duration: 10m 02s) | [production] | 
            
  | 21:41 | <cjming@deploy2002> | cjming, dani: Continuing with sync | [production] | 
            
  | 21:41 | <cjming@deploy2002> | cjming, dani: Backport for [[gerrit:1101875|Reader Survey: Increase coverage (T378660)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 21:37 | <cjming@deploy2002> | Started scap sync-world: Backport for [[gerrit:1101875|Reader Survey: Increase coverage (T378660)]] | [production] | 
            
  | 21:35 | <cjming@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1101600|LanguageConverter: Ignore content inside <math> and <svg> elements (T381617)]] (duration: 11m 55s) | [production] | 
            
  | 21:30 | <cjming@deploy2002> | bvibber, cjming: Continuing with sync | [production] | 
            
  | 21:27 | <cjming@deploy2002> | bvibber, cjming: Backport for [[gerrit:1101600|LanguageConverter: Ignore content inside <math> and <svg> elements (T381617)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 21:23 | <cjming@deploy2002> | Started scap sync-world: Backport for [[gerrit:1101600|LanguageConverter: Ignore content inside <math> and <svg> elements (T381617)]] | [production] | 
            
  | 21:22 | <ryankemper@cumin2002> | END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2026.codfw.wmnet w/ force delete existing files, repooling source-only afterwards | [production] | 
            
  | 21:00 | <mforns> | re-ran commons_impact_metrics_monthly 2024-11 and cleared dependent DAGs which had timed out | [analytics] | 
            
  | 20:56 | <mforns> | deployed airflow-analytics to fix the commons impact metrics pipeline | [analytics] | 
            
  | 20:55 | <mforns@deploy2002> | Finished deploy [airflow-dags/analytics@2af4e1a]: Fix for the Commons Impact Metrics job (duration: 01m 38s) | [production] | 
            
  | 20:54 | <mforns@deploy2002> | Started deploy [airflow-dags/analytics@2af4e1a]: Fix for the Commons Impact Metrics job | [production] | 
            
  | 20:50 | <mforns> | finished deployment of refinery to fix Commons Impact Metrics job | [analytics] | 
            
  | 20:47 | <mforns@deploy2002> | Finished deploy [analytics/refinery@25c1946] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@25c1946c] (duration: 00m 27s) | [production] | 
            
  | 20:46 | <mforns@deploy2002> | Started deploy [analytics/refinery@25c1946] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@25c1946c] | [production] | 
            
  | 20:46 | <mforns@deploy2002> | Finished deploy [analytics/refinery@25c1946] (thin): Regular analytics weekly train THIN [analytics/refinery@25c1946c] (duration: 00m 31s) | [production] | 
            
  | 20:45 | <mforns@deploy2002> | Started deploy [analytics/refinery@25c1946] (thin): Regular analytics weekly train THIN [analytics/refinery@25c1946c] | [production] | 
            
  | 20:45 | <mforns@deploy2002> | Finished deploy [analytics/refinery@25c1946]: Regular analytics weekly train [analytics/refinery@25c1946c] (duration: 13m 12s) | [production] | 
            
  | 20:38 | <ryankemper@cumin2002> | START - Cookbook sre.wdqs.data-transfer (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2026.codfw.wmnet w/ force delete existing files, repooling source-only afterwards | [production] | 
            
  | 20:38 | <ryankemper@cumin2002> | END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) (T376150, xfer wdqs scholarly 2023(public)->2026(internal)) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards | [production] |