1051-1100 of 10000 results (129ms)
2025-05-05 ยง
22:59 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['wdqs1017.eqiad.wmnet'] [production]
22:57 <zabe@deploy1003> zabe, novemlinguae: Continuing with sync [production]
22:57 <zabe@deploy1003> zabe, novemlinguae: Backport for [[gerrit:1140661|core-Permissions: refactor enwiki wgRemoveGroups]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
22:52 <zabe@deploy1003> Started scap sync-world: Backport for [[gerrit:1140661|core-Permissions: refactor enwiki wgRemoveGroups]] [production]
22:47 <ryankemper@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs1017.eqiad.wmnet'] [production]
22:46 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs1017.eqiad.wmnet'] [production]
22:46 <ryankemper@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs1017.eqiad.wmnet'] [production]
22:46 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1017.eqiad.wmnet with OS bullseye [production]
22:35 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe1003.wikimedia.org with OS bookworm [production]
22:12 <sbassett> Deployed security fix (2) for T392341 [production]
21:57 <sbassett> Deployed security fix (1) for T392341 [production]
21:34 <ryankemper@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs1017.eqiad.wmnet with OS bullseye [production]
21:15 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host apus-fe1003.wikimedia.org with OS bookworm [production]
21:14 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe1003.wikimedia.org with OS bookworm [production]
21:03 <jsn@deploy1003> Finished scap sync-world: Backport for [[gerrit:1141969|Fix link for first set of Patroller Tools surveys (T389401)]] (duration: 14m 43s) [production]
20:59 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host apus-fe1003.wikimedia.org with OS bookworm [production]
20:59 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1017.eqiad.wmnet with OS bullseye [production]
20:56 <jsn@deploy1003> jsn: Continuing with sync [production]
20:56 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host apus-fe1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:56 <cjming@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
20:55 <cjming@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
20:55 <jsn@deploy1003> jsn: Backport for [[gerrit:1141969|Fix link for first set of Patroller Tools surveys (T389401)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:51 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host apus-fe1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:50 <vriley@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host apus-fe1003 [production]
20:49 <vriley@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host apus-fe1003 [production]
20:48 <jsn@deploy1003> Started scap sync-world: Backport for [[gerrit:1141969|Fix link for first set of Patroller Tools surveys (T389401)]] [production]
20:48 <vriley@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:48 <vriley@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt apus-fe1003 - vriley@cumin1002" [production]
20:48 <vriley@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt apus-fe1003 - vriley@cumin1002" [production]
20:44 <ryankemper@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on wdqs[2008,2014-2015].codfw.wmnet,wdqs[1011,1016].eqiad.wmnet with reason: T388134 [production]
20:41 <vriley@cumin1002> START - Cookbook sre.dns.netbox [production]
20:35 <jsn@deploy1003> Finished scap sync-world: Backport for [[gerrit:1141569|Design Research Participant Survey: Undeploy (T392325)]], [[gerrit:1141947|Deploy first set of Patroller Tools surveys (T389401)]] (duration: 19m 58s) [production]
20:28 <jsn@deploy1003> dani, jsn: Continuing with sync [production]
20:21 <jsn@deploy1003> dani, jsn: Backport for [[gerrit:1141569|Design Research Participant Survey: Undeploy (T392325)]], [[gerrit:1141947|Deploy first set of Patroller Tools surveys (T389401)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:15 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1184.eqiad.wmnet with OS bullseye [production]
20:15 <vriley@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1002" [production]
20:15 <jsn@deploy1003> Started scap sync-world: Backport for [[gerrit:1141569|Design Research Participant Survey: Undeploy (T392325)]], [[gerrit:1141947|Deploy first set of Patroller Tools surveys (T389401)]] [production]
20:11 <vriley@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1002" [production]
19:58 <ryankemper@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs1017.eqiad.wmnet with OS bullseye [production]
19:46 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1184.eqiad.wmnet with reason: host reimage [production]
19:43 <vriley@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1184.eqiad.wmnet with reason: host reimage [production]
19:37 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-worker1185.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:35 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1185.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:27 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1184.eqiad.wmnet with OS bullseye [production]
18:12 <aokoth@deploy1003> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
18:07 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:07 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
18:07 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
18:03 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:02 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]