51-100 of 10000 results (103ms)
2026-05-15 ยง
09:49 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-mcrouter: apply [production]
09:48 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie [production]
09:48 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2010.codfw.wmnet with OS trixie [production]
09:33 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie [production]
09:32 <mvernon@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-be2064.codfw.wmnet with OS bullseye [production]
09:32 <topranks> Migrate cr4-ulsfo link to asw1-23-ulsfo to tagged interface T424611 [production]
09:30 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-mcrouter: apply [production]
09:30 <jiji@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-mcrouter: apply [production]
09:30 <mvernon@cumin2002> END (FAIL) - Cookbook sre.swift.convert-disks (exit_code=99) for host ms-be2065 [production]
09:30 <jiji@deploy1003> helmfile [codfw] START helmfile.d/services/mw-mcrouter: apply [production]
09:10 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2010.codfw.wmnet with OS trixie [production]
09:08 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on db2218.codfw.wmnet with reason: Host crashed T426383 [production]
09:08 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be2064 [production]
09:08 <mvernon@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be2064 [production]
09:06 <mvernon@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ms-be2064 [production]
09:06 <mvernon@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ms-be2064.codfw.wmnet 56.32.192.10.in-addr.arpa 6.5.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
09:06 <mvernon@cumin2002> START - Cookbook sre.dns.wipe-cache ms-be2064.codfw.wmnet 56.32.192.10.in-addr.arpa 6.5.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
09:06 <mvernon@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:06 <mvernon@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2064 - mvernon@cumin2002" [production]
09:06 <mvernon@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2064 - mvernon@cumin2002" [production]
09:03 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie [production]
09:02 <mvernon@cumin2002> START - Cookbook sre.dns.netbox [production]
09:02 <mvernon@cumin2002> START - Cookbook sre.hosts.move-vlan for host ms-be2064 [production]
09:01 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2064.codfw.wmnet with OS bullseye [production]
09:00 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool db2218 T426380', diff saved to https://phabricator.wikimedia.org/P92553 and previous config saved to /var/cache/conftool/dbconfig/20260515-090000-marostegui.json [production]
08:58 <marostegui@cumin1003> dbctl commit (dc=all): 'Promote db2220 to s7 primary T426380', diff saved to https://phabricator.wikimedia.org/P92552 and previous config saved to /var/cache/conftool/dbconfig/20260515-085836-marostegui.json [production]
08:56 <marostegui> Starting s7 codfw failover from db2218 to db2220 - T426380 [production]
08:54 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 28 hosts with reason: Primary switchover s7 T426380 [production]
08:54 <marostegui@cumin1003> dbctl commit (dc=all): 'Set db2220 with weight 0 T426380', diff saved to https://phabricator.wikimedia.org/P92551 and previous config saved to /var/cache/conftool/dbconfig/20260515-085420-marostegui.json [production]
08:41 <mvernon@cumin2002> START - Cookbook sre.swift.convert-disks for host ms-be2065 [production]
08:41 <mvernon@cumin2002> END (FAIL) - Cookbook sre.swift.convert-disks (exit_code=99) for host ms-be2064 [production]
08:28 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host kafka-logging1006.eqiad.wmnet with OS trixie [production]
08:17 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host kafka-logging1006.eqiad.wmnet with OS trixie [production]
08:16 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:05 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:03 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:03 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:58 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:58 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:55 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:55 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:55 <mvernon@cumin2002> START - Cookbook sre.swift.convert-disks for host ms-be2064 [production]
07:54 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:54 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:42 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2010.codfw.wmnet with OS trixie [production]
07:41 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.powercycle (exit_code=0) for host sretest2010 [production]
07:39 <elukey@cumin1003> START - Cookbook sre.hosts.powercycle for host sretest2010 [production]
07:10 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie [production]
02:37 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db1290.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
02:34 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host db1290.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]