2025-05-20
§
|
02:18 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for service: project,cinder |
[admin] |
02:18 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for service: project,cinder |
[admin] |
00:54 |
<rzl@deploy1003> |
Stopping before sync operations |
[production] |
00:54 |
<rzl@deploy1003> |
Started scap sync-world: 1147901 |
[production] |
00:52 |
<wmbot~bd808@tools-bastion-12> |
Restart to pick up new image |
[tools.gitlab-content] |
00:51 |
<wmbot~bd808@tools-bastion-12> |
Built new image from fadea6d5 |
[tools.gitlab-content] |
00:29 |
<wmbot~bd808@tools-bastion-12> |
Built and deployed image from aa4d2285. The webservice is using the new golang implementation now. |
[tools.gitlab-content] |
00:21 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1039.eqiad.wmnet' (T394727) |
[admin] |
00:11 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1039.eqiad.wmnet' (T394727) |
[admin] |
2025-05-19
§
|
23:01 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch1081.eqiad.wmnet with OS bullseye |
[production] |
22:58 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1039.eqiad.wmnet' (T394727) |
[admin] |
22:47 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1039.eqiad.wmnet' (T394727) |
[admin] |
22:45 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.reimage for host thanos-be2007.codfw.wmnet with OS bullseye |
[production] |
22:43 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.openstack.restart_openstack (exit_code=97) on deployment eqiad1 for service: project,nova |
[admin] |
22:43 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for service: project,nova |
[admin] |
22:35 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for service: project,nova |
[admin] |
22:30 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1072.eqiad.wmnet with OS bookworm |
[production] |
22:29 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for service: project,nova |
[admin] |
22:29 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=0) on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:29 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cirrussearch1081.eqiad.wmnet with reason: host reimage |
[production] |
22:28 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:28 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch1080.eqiad.wmnet with OS bullseye |
[production] |
22:27 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=99) on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:27 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:26 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.openstack.restart_openstack (exit_code=97) on deployment eqiad1 for service: project,nova |
[admin] |
22:26 |
<bking@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on relforge[1003-1004,1008-1010].eqiad.wmnet with reason: decom in progress |
[production] |
22:26 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for service: project,nova |
[admin] |
22:25 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cirrussearch1081.eqiad.wmnet with reason: host reimage |
[production] |
22:25 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=99) on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:24 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:23 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=99) on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:22 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:21 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=99) on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:20 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1072'] |
[cloudvirt-canary] |
22:11 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host cirrussearch1081 |
[production] |
22:11 |
<bking@cumin2002> |
START - Cookbook sre.hosts.move-vlan for host cirrussearch1081 |
[production] |
22:11 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host cirrussearch1081.eqiad.wmnet with OS bullseye |
[production] |
22:08 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts relforge[1003-1004].eqiad.wmnet |
[production] |
22:06 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1039.eqiad.wmnet' (T394727) |
[admin] |
22:04 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Remove db1255 and db2241 from s8 (T351820)', diff saved to https://phabricator.wikimedia.org/P76317 and previous config saved to /var/cache/conftool/dbconfig/20250519-220432-ladsgroup.json |
[production] |
22:03 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cirrussearch1080.eqiad.wmnet with reason: host reimage |
[production] |
22:02 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-text_eqiad - <bound method SREBatchRunnerBase._reason of <cookbooks.sre.cdn.roll-upgrade-varnish.RollUpgradeVarnishRunner object at 0x7f1ff9224b80>> |
[production] |
22:02 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Remove db1209 and db2195 from x3 (T351820)', diff saved to https://phabricator.wikimedia.org/P76316 and previous config saved to /var/cache/conftool/dbconfig/20250519-220201-ladsgroup.json |
[production] |
22:01 |
<ryankemper@cumin2002> |
START - Cookbook sre.hosts.decommission for hosts relforge[1003-1004].eqiad.wmnet |
[production] |
22:01 |
<swfrench@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/shellbox-video: apply |
[production] |
22:00 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cirrussearch1080.eqiad.wmnet with reason: host reimage |
[production] |
22:00 |
<swfrench@deploy1003> |
helmfile [codfw] START helmfile.d/services/shellbox-video: apply |
[production] |
21:58 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-upload_eqiad - <bound method SREBatchRunnerBase._reason of <cookbooks.sre.cdn.roll-upgrade-varnish.RollUpgradeVarnishRunner object at 0x7f9083afdf10>> |
[production] |
21:58 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1072.eqiad.wmnet with reason: host reimage |
[production] |
21:56 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1039.eqiad.wmnet' (T394727) |
[admin] |