1-50 of 10000 results (94ms)
2026-03-16 ยง
20:16 <xcollazo@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply [production]
20:15 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp2041.codfw.wmnet with reason: host reimage [production]
20:15 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephmon2007-dev.codfw.wmnet with reason: host reimage [production]
20:15 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6003.drmrs.wmnet with OS trixie [production]
20:13 <catrope@deploy2002> kharlan, catrope: Continuing with sync [production]
20:12 <xcollazo@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply [production]
20:12 <catrope@deploy2002> kharlan, catrope: Backport for [[gerrit:1248665|Enable passwordless login in production (T419198)]], [[gerrit:1253572|Instrument clicks on external links to selected domains (T419837)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:12 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp2042.codfw.wmnet with reason: host reimage [production]
20:11 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp2041.codfw.wmnet with reason: host reimage [production]
20:10 <catrope@deploy2002> Started scap sync-world: Backport for [[gerrit:1248665|Enable passwordless login in production (T419198)]], [[gerrit:1253572|Instrument clicks on external links to selected domains (T419837)]] [production]
20:09 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6014.drmrs.wmnet with OS trixie [production]
20:03 <brett@cumin2002> START - Cookbook sre.hosts.decommission for hosts cp[2027-2040].codfw.wmnet [production]
20:01 <dreamyjazz@deploy2002> Finished scap sync-world: Backport for [[gerrit:1251589|Uninstall GlobalBlocking from closed wikis (T420062)]] (duration: 08m 20s) [production]
19:57 <dreamyjazz@deploy2002> dreamyjazz: Continuing with sync [production]
19:55 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp6004.drmrs.wmnet with reason: host reimage [production]
19:55 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephmon2007-dev.codfw.wmnet with OS bookworm [production]
19:54 <dreamyjazz@deploy2002> dreamyjazz: Backport for [[gerrit:1251589|Uninstall GlobalBlocking from closed wikis (T420062)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
19:54 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp2042.codfw.wmnet with OS trixie [production]
19:53 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephmon2007-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:53 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp2041.codfw.wmnet with OS trixie [production]
19:52 <dreamyjazz@deploy2002> Started scap sync-world: Backport for [[gerrit:1251589|Uninstall GlobalBlocking from closed wikis (T420062)]] [production]
19:52 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host cloudcephmon2007-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:51 <dreamyjazz@deploy2002> Finished scap sync-world: Backport for [[gerrit:1251582|Uninstall AbuseFilter from closed wikis with no AbuseFilter logs (T420063)]] (duration: 09m 26s) [production]
19:51 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp6003.drmrs.wmnet with reason: host reimage [production]
19:47 <dreamyjazz@deploy2002> dreamyjazz: Continuing with sync [production]
19:47 <mutante> releases2003 - rm rsync-srv-org-wikimedia-releases-releases2003.* - alerts flapping since server reboot - puppet code needs to be improved to ensure units are removed when primary server is switched (T420246) [production]
19:47 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp6004.drmrs.wmnet with reason: host reimage [production]
19:46 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp6003.drmrs.wmnet with reason: host reimage [production]
19:44 <dreamyjazz@deploy2002> dreamyjazz: Backport for [[gerrit:1251582|Uninstall AbuseFilter from closed wikis with no AbuseFilter logs (T420063)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
19:43 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp6014.drmrs.wmnet with reason: host reimage [production]
19:42 <dreamyjazz@deploy2002> Started scap sync-world: Backport for [[gerrit:1251582|Uninstall AbuseFilter from closed wikis with no AbuseFilter logs (T420063)]] [production]
19:41 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudcephmon2007-dev [production]
19:41 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host cloudcephmon2007-dev [production]
19:40 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:40 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: updating cloudcephmon2007-dev in codfw - jhancock@cumin2002" [production]
19:40 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp6014.drmrs.wmnet with reason: host reimage [production]
19:39 <ladsgroup@deploy2002> Finished scap sync-world: Backport for [[gerrit:1253622|Revert "Media: Use previous step for non-standard width between steps and original" (T419927)]] (duration: 07m 10s) [production]
19:35 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]
19:34 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:1253622|Revert "Media: Use previous step for non-standard width between steps and original" (T419927)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
19:32 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: updating cloudcephmon2007-dev in codfw - jhancock@cumin2002" [production]
19:32 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1253622|Revert "Media: Use previous step for non-standard width between steps and original" (T419927)]] [production]
19:28 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp404[5-6].ulsfo.wmnet} and A:cp [production]
19:28 <brett@cumin2002> cookbooks.sre.cdn.roll-reboot finished rebooting cp4046.ulsfo.wmnet [production]
19:27 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp6004.drmrs.wmnet with OS trixie [production]
19:27 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
19:27 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp6004.drmrs.wmnet [reason: trixie reimaging] [production]
19:27 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp6003.drmrs.wmnet with OS trixie [production]
19:26 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp6003.drmrs.wmnet [reason: trixie reimaging] [production]
19:25 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp6002.drmrs.wmnet [reason: trixie reimaging] [production]
19:25 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp6001.drmrs.wmnet [reason: trixie reimaging] [production]