451-500 of 10000 results (151ms)
2026-06-09 ยง
08:08 <jiji@cumin1003> conftool action : set/pooled=false; selector: dnsdisc=docker-registry,name=codfw [production]
07:59 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
07:59 <ayounsi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: fix typoes - ayounsi@cumin1003" [production]
07:59 <ayounsi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: fix typoes - ayounsi@cumin1003" [production]
07:52 <ayounsi@cumin1003> START - Cookbook sre.dns.netbox [production]
07:47 <brouberol@dns1004> END - running authdns-update [production]
07:46 <brouberol@dns1004> START - running authdns-update [production]
07:44 <brouberol@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/kafka-ui: apply [production]
07:43 <brouberol@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/kafka-ui: apply [production]
07:43 <brouberol@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/aux-k8s-services/kafka-ui: apply [production]
07:42 <brouberol@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/aux-k8s-services/kafka-ui: apply [production]
07:41 <brouberol@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/aux-k8s-services/kafka-ui: apply [production]
07:39 <brouberol@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/aux-k8s-services/kafka-ui: apply [production]
07:38 <brouberol@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
07:37 <brouberol@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
07:37 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host db1237.eqiad.wmnet with OS trixie [production]
07:36 <marostegui@cumin1003> END (ERROR) - Cookbook sre.mysql.major-upgrade (exit_code=97) [production]
07:36 <brouberol@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
07:36 <brouberol@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
07:36 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
07:26 <fceratto@dns1004> END - running authdns-update [production]
07:24 <fceratto@dns1004> START - running authdns-update [production]
07:22 <marostegui@dns1004> END - running authdns-update [production]
07:21 <marostegui@dns1004> START - running authdns-update [production]
07:19 <elukey@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
07:19 <elukey@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Fix dse-k8s-wdqs2002 duplicate ipv6 address - elukey@cumin1003" [production]
07:19 <elukey@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Fix dse-k8s-wdqs2002 duplicate ipv6 address - elukey@cumin1003" [production]
07:16 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1160.eqiad.wmnet with reason: Maintenance [production]
07:12 <elukey@cumin1003> START - Cookbook sre.dns.netbox [production]
07:11 <fceratto@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) pool db1160: Repooling [production]
07:11 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db1160: Repooling [production]
07:11 <fceratto@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) pool db1160: Repooling [production]
07:11 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db1160: Repooling [production]
07:00 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.major-upgrade (exit_code=99) [production]
07:00 <marostegui@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1237.eqiad.wmnet with OS trixie [production]
06:24 <fceratto@cumin1003> dbctl commit (dc=all): 'Depool db1160 T426086', diff saved to https://phabricator.wikimedia.org/P93940 and previous config saved to /var/cache/conftool/dbconfig/20260609-062412-fceratto.json [production]
06:17 <cscott@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-parsoid: apply [production]
06:16 <cscott@deploy1003> helmfile [codfw] START helmfile.d/services/mw-parsoid: apply [production]
06:16 <cscott@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
06:16 <cscott@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
06:15 <cscott@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-parsoid: apply [production]
06:15 <cscott@deploy1003> helmfile [codfw] START helmfile.d/services/mw-parsoid: apply [production]
06:15 <cscott@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
06:14 <cscott@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
06:12 <fceratto@cumin1003> dbctl commit (dc=all): 'Promote db1244 to s4 primary and set section read-write T426086', diff saved to https://phabricator.wikimedia.org/P93939 and previous config saved to /var/cache/conftool/dbconfig/20260609-061222-fceratto.json [production]
06:11 <fceratto@cumin1003> dbctl commit (dc=all): 'Set s4 eqiad as read-only for maintenance - T426086', diff saved to https://phabricator.wikimedia.org/P93938 and previous config saved to /var/cache/conftool/dbconfig/20260609-061131-fceratto.json [production]
06:10 <federico3> Starting s4 eqiad failover from db1160 to db1244 - T426086 [production]
06:01 <fceratto@cumin1003> dbctl commit (dc=all): 'Set db1244 with weight 0 T426086', diff saved to https://phabricator.wikimedia.org/P93937 and previous config saved to /var/cache/conftool/dbconfig/20260609-060121-fceratto.json [production]
06:01 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 40 hosts with reason: Primary switchover s4 T426086 [production]
05:40 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host db1237.eqiad.wmnet with OS trixie [production]