| 2025-09-22
      
      § | 
    
  | 16:54 | <andrew@cumin2002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1023.eqiad.wmnet with OS bookworm | [production] | 
            
  | 16:51 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) | [admin] | 
            
  | 16:50 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.reactivate | [admin] | 
            
  | 16:48 | <andrew@cumin2002> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1025.eqiad.wmnet with OS bookworm | [production] | 
            
  | 16:47 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.ceph.osd.reactivate (exit_code=99) | [admin] | 
            
  | 16:47 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.reactivate | [admin] | 
            
  | 16:45 | <andrew@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1020.eqiad.wmnet with OS bookworm | [production] | 
            
  | 16:43 | <andrew@cumin2002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm | [production] | 
            
  | 16:41 | <andrew@cumin2002> | END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1025.eqiad.wmnet'] | [production] | 
            
  | 16:41 | <James_F> | Zuul: [mediawiki/extensions/ReaderExperiments] Add WikimediaMessages dependency | [releng] | 
            
  | 16:37 | <dcaro@cloudcumin1001> | END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api | [tools] | 
            
  | 16:32 | <dcaro@cloudcumin1001> | START - Cookbook wmcs.toolforge.component.deploy for component components-api | [tools] | 
            
  | 16:32 | <andrew@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1025.eqiad.wmnet'] | [production] | 
            
  | 16:31 | <andrew@cumin2002> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1025.eqiad.wmnet with OS bookworm | [production] | 
            
  | 16:28 | <andrew@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:22 | <andrew@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:16 | <dcaro@cloudcumin1001> | END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api | [toolsbeta] | 
            
  | 16:12 | <andrew@cumin2002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm | [production] | 
            
  | 16:11 | <dcaro@cloudcumin1001> | START - Cookbook wmcs.toolforge.component.deploy for component components-api | [toolsbeta] | 
            
  | 16:10 | <jhathaway@cumin2002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on sretest2001.codfw.wmnet with reason: T383173 | [production] | 
            
  | 16:05 | <andrew@cumin2002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1020.eqiad.wmnet with OS bookworm | [production] | 
            
  | 16:02 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) | [admin] | 
            
  | 16:01 | <andrew@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1019.eqiad.wmnet with OS bookworm | [production] | 
            
  | 16:01 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.reactivate | [admin] | 
            
  | 15:59 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.ceph.osd.reactivate (exit_code=99) | [admin] | 
            
  | 15:59 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.reactivate | [admin] | 
            
  | 15:59 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) | [admin] | 
            
  | 15:59 | <andrew@cloudcumin1001> | START - Cookbook wmcs.ceph.osd.reactivate | [admin] | 
            
  | 15:45 | <toyofuku@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] (duration: 11m 35s) | [production] | 
            
  | 15:43 | <andrew@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1019.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 15:40 | <toyofuku@deploy1003> | jdlrobson, toyofuku: Continuing with sync | [production] | 
            
  | 15:39 | <andrew@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1019.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 15:37 | <toyofuku@deploy1003> | jdlrobson, toyofuku: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | [production] | 
            
  | 15:33 | <toyofuku@deploy1003> | Started scap sync-world: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] | [production] | 
            
  | 15:22 | <andrew@cumin2002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1019.eqiad.wmnet with OS bookworm | [production] | 
            
  | 15:19 | <pt1979@cumin2002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on fasw2-c8a-codfw,fasw2-c8b-codfw with reason: pfw1-codfw relocation | [production] | 
            
  | 15:17 | <pt1979@cumin2002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pfw1-codfw with reason: pfw1-codfw relocation | [production] | 
            
  | 15:15 | <moritzm> | installing clamav security updates | [production] | 
            
  | 15:11 | <pt1979@cumin2002> | DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on ‘pfw1-codfw’ with reason: ‘pfw1 | [production] | 
            
  | 14:32 | <dcausse@deploy1003> | helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 14:32 | <dcausse@deploy1003> | helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 14:31 | <dcausse@deploy1003> | helmfile [codfw] DONE helmfile.d/services/rdf-streaming-updater: apply | [production] | 
            
  | 14:31 | <dcausse@deploy1003> | helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply | [production] | 
            
  | 14:22 | <brouberol@deploy1003> | helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 14:21 | <brouberol@deploy1003> | helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. | [production] | 
            
  | 14:20 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 14:20 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 14:15 | <btullis> | restarting the hadoop-yarn-resourcemanager.service on an-master1003 and then an-master1004 for T404871 | [analytics] | 
            
  | 14:12 | <brouberol@deploy1003> | helmfile [codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 14:11 | <brouberol@deploy1003> | helmfile [codfw] START helmfile.d/admin 'apply'. | [production] |