1551-1600 of 10000 results (31ms)
2025-06-03 ยง
07:56 <jmm@cumin1003> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host prometheus7002.magru.wmnet [production]
07:56 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2154 (T395241)', diff saved to https://phabricator.wikimedia.org/P76899 and previous config saved to /var/cache/conftool/dbconfig/20250603-075622-fceratto.json [production]
07:56 <jmm@cumin1003> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
07:49 <mvernon@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-be2009.codfw.wmnet with reason: host reimage [production]
07:49 <jmm@cumin1003> END (PASS) - Cookbook sre.misc-clusters.restart-reboot-config-master (exit_code=0) rolling reboot on A:config-master [production]
07:46 <jmm@cumin1003> START - Cookbook sre.dns.netbox [production]
07:46 <jmm@cumin1003> START - Cookbook sre.ganeti.makevm for new host prometheus7002.magru.wmnet [production]
07:46 <mvernon@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-be2009.codfw.wmnet with reason: host reimage [production]
07:44 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) config-master.discovery.wmnet. on all recursors [production]
07:44 <jmm@cumin1003> START - Cookbook sre.dns.wipe-cache config-master.discovery.wmnet. on all recursors [production]
07:43 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host bast7002.wikimedia.org [production]
07:43 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host bast7002.wikimedia.org with OS bookworm [production]
07:41 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P76898 and previous config saved to /var/cache/conftool/dbconfig/20250603-074113-fceratto.json [production]
07:38 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) config-master.discovery.wmnet. on all recursors [production]
07:38 <jmm@cumin1003> START - Cookbook sre.dns.wipe-cache config-master.discovery.wmnet. on all recursors [production]
07:37 <jmm@cumin1003> START - Cookbook sre.misc-clusters.restart-reboot-config-master rolling reboot on A:config-master [production]
07:27 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on bast7002.wikimedia.org with reason: host reimage [production]
07:26 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P76897 and previous config saved to /var/cache/conftool/dbconfig/20250603-072604-fceratto.json [production]
07:25 <tchanders@deploy1003> Finished scap sync-world: Backport for [[gerrit:1142649|Assign IP auto-reveal rights to certain groups (T386492)]] (duration: 10m 39s) [production]
07:23 <mvernon@cumin1002> START - Cookbook sre.hosts.reimage for host thanos-be2009.codfw.wmnet with OS bullseye [production]
07:23 <jmm@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on bast7002.wikimedia.org with reason: host reimage [production]
07:18 <tchanders@deploy1003> tchanders: Continuing with sync [production]
07:16 <tchanders@deploy1003> tchanders: Backport for [[gerrit:1142649|Assign IP auto-reveal rights to certain groups (T386492)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:14 <tchanders@deploy1003> Started scap sync-world: Backport for [[gerrit:1142649|Assign IP auto-reveal rights to certain groups (T386492)]] [production]
07:10 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2154 (T395241)', diff saved to https://phabricator.wikimedia.org/P76896 and previous config saved to /var/cache/conftool/dbconfig/20250603-071057-fceratto.json [production]
07:06 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
07:06 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
07:02 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
07:02 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
07:01 <marostegui@cumin1002> dbctl commit (dc=all): 'es1037 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P76895 and previous config saved to /var/cache/conftool/dbconfig/20250603-070155-root.json [production]
07:00 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2154 (T395241)', diff saved to https://phabricator.wikimedia.org/P76894 and previous config saved to /var/cache/conftool/dbconfig/20250603-070036-fceratto.json [production]
07:00 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2154.codfw.wmnet with reason: Maintenance [production]
07:00 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2152 (T395241)', diff saved to https://phabricator.wikimedia.org/P76893 and previous config saved to /var/cache/conftool/dbconfig/20250603-070021-fceratto.json [production]
06:58 <jmm@cumin1003> START - Cookbook sre.hosts.reimage for host bast7002.wikimedia.org with OS bookworm [production]
06:54 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM bast7002.wikimedia.org - jmm@cumin1003" [production]
06:54 <jmm@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM bast7002.wikimedia.org - jmm@cumin1003" [production]
06:54 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) bast7002.wikimedia.org on all recursors [production]
06:54 <jmm@cumin1003> START - Cookbook sre.dns.wipe-cache bast7002.wikimedia.org on all recursors [production]
06:54 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
06:54 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM bast7002.wikimedia.org - jmm@cumin1003" [production]
06:53 <jmm@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM bast7002.wikimedia.org - jmm@cumin1003" [production]
06:48 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host thanos-be2009.codfw.wmnet with OS bullseye [production]
06:46 <marostegui@cumin1002> dbctl commit (dc=all): 'es1037 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P76892 and previous config saved to /var/cache/conftool/dbconfig/20250603-064649-root.json [production]
06:45 <jmm@cumin1003> START - Cookbook sre.dns.netbox [production]
06:45 <jmm@cumin1003> START - Cookbook sre.ganeti.makevm for new host bast7002.wikimedia.org [production]
06:45 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2152', diff saved to https://phabricator.wikimedia.org/P76891 and previous config saved to /var/cache/conftool/dbconfig/20250603-064513-fceratto.json [production]
06:41 <marostegui@cumin1002> dbctl commit (dc=all): 'es2037 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P76890 and previous config saved to /var/cache/conftool/dbconfig/20250603-064147-root.json [production]
06:37 <marostegui> Decrease buffer size on clouddb1016:s8 T390954 [production]
06:31 <marostegui@cumin1002> dbctl commit (dc=all): 'es1037 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P76889 and previous config saved to /var/cache/conftool/dbconfig/20250603-063144-root.json [production]
06:30 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1154.eqiad.wmnet with reason: Setting up x3 T390954 [production]