1851-1900 of 10000 results (136ms)
2025-06-02 ยง
09:55 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host rpki2003.codfw.wmnet [production]
09:55 <marostegui@cumin1002> dbctl commit (dc=all): 'es2039 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P76800 and previous config saved to /var/cache/conftool/dbconfig/20250602-095514-root.json [production]
09:45 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2039.codfw.wmnet with reason: Maintenance [production]
09:44 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es2039 T395647', diff saved to https://phabricator.wikimedia.org/P76798 and previous config saved to /var/cache/conftool/dbconfig/20250602-094402-marostegui.json [production]
09:40 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netmon1003.wikimedia.org [production]
09:33 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host netmon1003.wikimedia.org [production]
09:28 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir7003.magru.wmnet with reason: host reimage [production]
09:27 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netmon2002.wikimedia.org [production]
09:24 <jmm@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir7003.magru.wmnet with reason: host reimage [production]
09:22 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host netmon2002.wikimedia.org [production]
09:13 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetboard1003.eqiad.wmnet [production]
09:10 <jelto> update gitlab-settings artifact retention to 6 month - T395014 [production]
09:09 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host puppetboard1003.eqiad.wmnet [production]
09:02 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetboard2003.codfw.wmnet [production]
08:58 <jmm@cumin1003> START - Cookbook sre.hosts.reimage for host ncredir7003.magru.wmnet with OS bookworm [production]
08:58 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host puppetboard2003.codfw.wmnet [production]
08:51 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM ncredir7003.magru.wmnet - jmm@cumin1003" [production]
08:51 <jmm@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM ncredir7003.magru.wmnet - jmm@cumin1003" [production]
08:51 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ncredir7003.magru.wmnet on all recursors [production]
08:51 <jmm@cumin1003> START - Cookbook sre.dns.wipe-cache ncredir7003.magru.wmnet on all recursors [production]
08:51 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:51 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM ncredir7003.magru.wmnet - jmm@cumin1003" [production]
08:51 <jmm@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM ncredir7003.magru.wmnet - jmm@cumin1003" [production]
08:51 <marostegui@cumin1002> dbctl commit (dc=all): 'es1040 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P76796 and previous config saved to /var/cache/conftool/dbconfig/20250602-085105-root.json [production]
08:47 <jmm@cumin1003> START - Cookbook sre.dns.netbox [production]
08:47 <jmm@cumin1003> START - Cookbook sre.ganeti.makevm for new host ncredir7003.magru.wmnet [production]
08:45 <jmm@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ncredir7003.magru.wmnet with OS bookworm [production]
08:42 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts ncredir7003.magru.wmnet [production]
08:42 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:42 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ncredir7003.magru.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin2002" [production]
08:41 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ncredir7003.magru.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin2002" [production]
08:37 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
08:36 <marostegui@cumin1002> dbctl commit (dc=all): 'es1040 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P76795 and previous config saved to /var/cache/conftool/dbconfig/20250602-083559-root.json [production]
08:33 <jmm@cumin2002> START - Cookbook sre.hosts.decommission for hosts ncredir7003.magru.wmnet [production]
08:20 <marostegui@cumin1002> dbctl commit (dc=all): 'es1040 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P76794 and previous config saved to /var/cache/conftool/dbconfig/20250602-082053-root.json [production]
08:11 <jmm@cumin1003> START - Cookbook sre.hosts.reimage for host ncredir7003.magru.wmnet with OS bookworm [production]
08:05 <marostegui@cumin1002> dbctl commit (dc=all): 'es1040 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P76793 and previous config saved to /var/cache/conftool/dbconfig/20250602-080547-root.json [production]
07:58 <phuedx@deploy1003> Finished scap sync-world: Backport for [[gerrit:1152253|Beta Cluster: Support A/B experiments (T393918)]] (duration: 35m 59s) [production]
07:50 <marostegui@cumin1002> dbctl commit (dc=all): 'es1040 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P76792 and previous config saved to /var/cache/conftool/dbconfig/20250602-075041-root.json [production]
07:49 <phuedx@deploy1003> phuedx, dr0ptp4kt: Continuing with sync [production]
07:38 <phuedx@deploy1003> phuedx, dr0ptp4kt: Backport for [[gerrit:1152253|Beta Cluster: Support A/B experiments (T393918)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:36 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti7002.magru.wmnet [production]
07:35 <marostegui@cumin1002> dbctl commit (dc=all): 'es1040 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P76791 and previous config saved to /var/cache/conftool/dbconfig/20250602-073535-root.json [production]
07:27 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ganeti7002.magru.wmnet [production]
07:22 <phuedx@deploy1003> Started scap sync-world: Backport for [[gerrit:1152253|Beta Cluster: Support A/B experiments (T393918)]] [production]
07:20 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host bast5004.wikimedia.org [production]
07:13 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 6 hosts with reason: Maintenance [production]
07:13 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es1040.eqiad.wmnet with reason: Maintenance [production]
07:08 <marostegui@cumin1002> dbctl commit (dc=all): 'db1169 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P76790 and previous config saved to /var/cache/conftool/dbconfig/20250602-070837-root.json [production]
07:08 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host bast4005.wikimedia.org [production]