151-200 of 10000 results (14ms)
2025-07-11 ยง
11:35 <fceratto@cumin1002> dbctl commit (dc=all): 'Depool es1034 for upgrade', diff saved to https://phabricator.wikimedia.org/P78915 and previous config saved to /var/cache/conftool/dbconfig/20250711-113532-fceratto.json [production]
11:34 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on es1034.eqiad.wmnet with reason: Maintenance [production]
11:31 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on es1032.eqiad.wmnet with reason: Maintenance [production]
11:30 <fceratto@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for es1031.eqiad.wmnet [production]
11:30 <fceratto@cumin1002> START - Cookbook sre.hosts.remove-downtime for es1031.eqiad.wmnet [production]
11:29 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2192.codfw.wmnet with reason: Maintenance [production]
11:29 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P78914 and previous config saved to /var/cache/conftool/dbconfig/20250711-112933-root.json [production]
11:26 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1037.eqiad.wmnet with OS bullseye [production]
11:26 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on es1031.eqiad.wmnet with reason: Maintenance [production]
11:14 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P78913 and previous config saved to /var/cache/conftool/dbconfig/20250711-111428-root.json [production]
11:10 <AntiComposite> cvn-app12 restart all bots [cvn]
11:09 <AntiComposite> cvn-app10 restart all bots [cvn]
10:59 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P78912 and previous config saved to /var/cache/conftool/dbconfig/20250711-105922-root.json [production]
10:50 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P78911 and previous config saved to /var/cache/conftool/dbconfig/20250711-105039-root.json [production]
10:35 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P78910 and previous config saved to /var/cache/conftool/dbconfig/20250711-103533-root.json [production]
10:32 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
10:32 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
10:31 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
10:31 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
10:30 <hnowlan@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:30 <hnowlan@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:26 <jmm@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1003.eqiad.wmnet with OS trixie [production]
10:20 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P78909 and previous config saved to /var/cache/conftool/dbconfig/20250711-102027-root.json [production]
10:05 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P78908 and previous config saved to /var/cache/conftool/dbconfig/20250711-100522-root.json [production]
10:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2192', diff saved to https://phabricator.wikimedia.org/P78907 and previous config saved to /var/cache/conftool/dbconfig/20250711-100106-root.json [production]
10:00 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P78906 and previous config saved to /var/cache/conftool/dbconfig/20250711-100033-root.json [production]
09:45 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P78905 and previous config saved to /var/cache/conftool/dbconfig/20250711-094527-root.json [production]
09:41 <fnegri@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-77, tools-k8s-worker-nfs-68, tools-k8s-worker-nfs-37 [tools]
09:39 <root@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2192.codfw.wmnet with reason: Maintenance [production]
09:31 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2192 T399280', diff saved to https://phabricator.wikimedia.org/P78904 and previous config saved to /var/cache/conftool/dbconfig/20250711-093115-root.json [production]
09:30 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote db2213 to s5 primary T399280', diff saved to https://phabricator.wikimedia.org/P78903 and previous config saved to /var/cache/conftool/dbconfig/20250711-093006-marostegui.json [production]
09:29 <marostegui> Starting s5 codfw failover from db2192 to db2213 - T399280 [production]
09:27 <jmm@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1003.eqiad.wmnet with OS trixie [production]
09:25 <moritzm> imported perccli for trixie-wikimedia T391083 [production]
09:25 <fnegri@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-77, tools-k8s-worker-nfs-68, tools-k8s-worker-nfs-37 [tools]
09:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Remove db2213 from API/vslow/dump T399280', diff saved to https://phabricator.wikimedia.org/P78902 and previous config saved to /var/cache/conftool/dbconfig/20250711-091812-root.json [production]
09:15 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 22 hosts with reason: Primary switchover s5 T399280 [production]
09:12 <marostegui@cumin1002> dbctl commit (dc=all): 'db2223 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P78901 and previous config saved to /var/cache/conftool/dbconfig/20250711-091242-root.json [production]
09:04 <jmm@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1003.eqiad.wmnet with OS trixie [production]
08:57 <marostegui@cumin1002> dbctl commit (dc=all): 'db2223 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P78900 and previous config saved to /var/cache/conftool/dbconfig/20250711-085736-root.json [production]
08:51 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'sync'. [production]
08:51 <elukey@deploy1003> helmfile [codfw] START helmfile.d/admin 'sync'. [production]
08:51 <elukey@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'sync'. [production]
08:51 <elukey@deploy1003> helmfile [eqiad] START helmfile.d/admin 'sync'. [production]
08:42 <marostegui@cumin1002> dbctl commit (dc=all): 'db2223 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P78899 and previous config saved to /var/cache/conftool/dbconfig/20250711-084230-root.json [production]
08:42 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop: sync [production]
08:41 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop: sync [production]
08:34 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts puppetserver2003.codfw.wmnet [production]
08:34 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:34 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: puppetserver2003.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin1003" [production]