| 
      
        2025-09-15
      
      §
     | 
  
    
  | 09:34 | 
  <elukey@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/wikifeeds: sync | 
  [production] | 
            
  | 09:34 | 
  <elukey@deploy1003> | 
  helmfile [staging] START helmfile.d/services/wikifeeds: sync | 
  [production] | 
            
  | 09:34 | 
  <elukey@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: sync | 
  [production] | 
            
  | 09:33 | 
  <elukey@deploy1003> | 
  helmfile [staging] START helmfile.d/services/changeprop-jobqueue: sync | 
  [production] | 
            
  | 09:33 | 
  <ladsgroup@deploy1003> | 
  Started scap sync-world: Backport: [[gerrit:1188285|Reduce db lock timeout in LinksUpdate and CategoryMembershipChangeJob]] (T366938) | 
  [production] | 
            
  | 09:33 | 
  <elukey@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/changeprop: sync | 
  [production] | 
            
  | 09:33 | 
  <elukey@deploy1003> | 
  helmfile [staging] START helmfile.d/services/changeprop: sync | 
  [production] | 
            
  | 09:32 | 
  <elukey@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/mobileapps: sync | 
  [production] | 
            
  | 09:31 | 
  <elukey@deploy1003> | 
  helmfile [staging] START helmfile.d/services/mobileapps: sync | 
  [production] | 
            
  | 09:30 | 
  <elukey@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/termbox: sync | 
  [production] | 
            
  | 09:30 | 
  <stevemunene@cumin1003> | 
  END (PASS) - Cookbook sre.k8s.wipe-cluster (exit_code=0) Wipe the K8s cluster dse-codfw: Cleanup the dse-k8s-codfw cluster | 
  [production] | 
            
  | 09:30 | 
  <effie> | 
  stopping puppet on A:lvs-low-traffic-eqiad and A:lvs-low-traffic-codfw | 
  [production] | 
            
  | 09:30 | 
  <elukey@deploy1003> | 
  helmfile [staging] START helmfile.d/services/termbox: sync | 
  [production] | 
            
  | 09:28 | 
  <stevemunene@deploy1003> | 
  helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'sync'. | 
  [production] | 
            
  | 09:25 | 
  <stevemunene@deploy1003> | 
  helmfile [dse-k8s-codfw] START helmfile.d/admin 'sync'. | 
  [production] | 
            
  | 09:24 | 
  <hashar> | 
  integration: added back to Jenkins integration-agent-docker-1043 | 
  [releng] | 
            
  | 09:20 | 
  <elukey@deploy1003> | 
  helmfile [staging] DONE helmfile.d/services/linkrecommendation: sync | 
  [production] | 
            
  | 09:20 | 
  <elukey@deploy1003> | 
  helmfile [staging] START helmfile.d/services/linkrecommendation: sync | 
  [production] | 
            
  | 09:20 | 
  <hashar> | 
  integration: stopped and started "integration-agent-docker-1043 which has been disconnected from Jenkins since April 2nd | 
  [releng] | 
            
  | 09:20 | 
  <stevemunene@deploy1003> | 
  helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 09:19 | 
  <stevemunene@deploy1003> | 
  helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 09:13 | 
  <stevemunene@cumin1003> | 
  START - Cookbook sre.k8s.wipe-cluster Wipe the K8s cluster dse-codfw: Cleanup the dse-k8s-codfw cluster | 
  [production] | 
            
  | 08:54 | 
  <btullis@cumin1003> | 
  START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling restart_daemons on A:kafka-jumbo-eqiad | 
  [production] | 
            
  | 08:15 | 
  <stevemunene@deploy1003> | 
  helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 08:14 | 
  <stevemunene@deploy1003> | 
  helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 08:07 | 
  <stevemunene@deploy1003> | 
  helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 08:04 | 
  <stevemunene@deploy1003> | 
  helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 07:44 | 
  <slyngshede@cumin1003> | 
  DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jly out of all services on: 2418 hosts | 
  [production] | 
            
  | 07:39 | 
  <jmm@cumin2002> | 
  DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jly out of all services on: 2418 hosts | 
  [production] | 
            
  | 07:20 | 
  <volans> | 
  restarted the gerrit job, was not reporting updates since last friday | 
  [tools.wikibugs] | 
            
  | 07:18 | 
  <jmm@cumin2002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on maps1011.eqiad.wmnet with reason: in setup | 
  [production] | 
            
  | 07:17 | 
  <jmm@cumin2002> | 
  DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging jwheeler out of all services on: 2418 hosts | 
  [production] | 
            
  | 06:56 | 
  <moritzm> | 
  reindex gis database on maps1011 following initial OSM import T381565 | 
  [production] | 
            
  | 06:55 | 
  <jynus> | 
  restarted atftpd on install1004 | 
  [production] | 
            
  
    | 
      
        2025-09-12
      
      §
     | 
  
    
  | 21:25 | 
  <dzahn@cumin2002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on zuul1002.eqiad.wmnet with reason: in setup | 
  [production] | 
            
  | 21:25 | 
  <dzahn@cumin2002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on zuul2002.codfw.wmnet with reason: in setup | 
  [production] | 
            
  | 20:20 | 
  <ladsgroup@cumin1003> | 
  END (FAIL) - Cookbook sre.mysql.pool (exit_code=99) db1160* gradually with 4 steps - Work done | 
  [production] | 
            
  | 20:05 | 
  <ladsgroup@cumin1003> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 20:05 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1226 (T402763)', diff saved to https://phabricator.wikimedia.org/P83301 and previous config saved to /var/cache/conftool/dbconfig/20250912-200538-ladsgroup.json | 
  [production] | 
            
  | 19:50 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P83300 and previous config saved to /var/cache/conftool/dbconfig/20250912-195030-ladsgroup.json | 
  [production] | 
            
  | 19:35 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P83299 and previous config saved to /var/cache/conftool/dbconfig/20250912-193523-ladsgroup.json | 
  [production] | 
            
  | 19:20 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1226 (T402763)', diff saved to https://phabricator.wikimedia.org/P83298 and previous config saved to /var/cache/conftool/dbconfig/20250912-192015-ladsgroup.json | 
  [production] | 
            
  | 19:14 | 
  <James_F> | 
  Zuul: [mediawiki/extensions/DataTable2] Add Scribuntu phan dependency | 
  [releng] | 
            
  | 19:13 | 
  <ladsgroup@cumin1003> | 
  dbctl commit (dc=all): 'Depooling db1226 (T402763)', diff saved to https://phabricator.wikimedia.org/P83297 and previous config saved to /var/cache/conftool/dbconfig/20250912-191338-ladsgroup.json | 
  [production] |