1-50 of 10000 results (129ms)
2026-03-24 ยง
18:36 <daniel@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
18:35 <daniel@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
18:25 <daniel@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
18:24 <daniel@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
18:20 <akhatun@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply [production]
18:20 <akhatun@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply [production]
18:13 <daniel@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
18:11 <daniel@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
18:07 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on phab2002.codfw.wmnet with reason: T420228 [production]
18:01 <aokoth@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
18:01 <aokoth@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
18:01 <aokoth@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
18:00 <aokoth@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
18:00 <mutante> codesearch9.codesearch - systemctl restart hound_proxy (T421147) [production]
17:34 <brett@cumin2002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) rebooting P{lvs7003.magru.wmnet} and A:liberica [production]
17:30 <brett@cumin2002> START - Cookbook sre.loadbalancer.admin rebooting P{lvs7003.magru.wmnet} and A:liberica [production]
17:20 <blake@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply [production]
17:20 <blake@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply [production]
17:20 <blake@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply [production]
17:20 <blake@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-ext: apply [production]
17:00 <blake@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
17:00 <blake@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
17:00 <blake@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-web: apply [production]
17:00 <blake@deploy2002> helmfile [codfw] START helmfile.d/services/mw-web: apply [production]
16:47 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp1115.eqiad.wmnet with OS trixie [production]
16:38 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp1113.* [production]
16:32 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db1170: Degraded drive replaced T420873 [production]
16:24 <sfaci@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/test-kitchen-next: apply [production]
16:24 <sfaci@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/test-kitchen-next: apply [production]
16:22 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1113.eqiad.wmnet with OS trixie [production]
16:05 <brouberol@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
16:04 <brouberol@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
16:03 <blake@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
16:03 <brouberol@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
16:03 <blake@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
16:03 <blake@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-web: apply [production]
16:03 <blake@deploy2002> helmfile [codfw] START helmfile.d/services/mw-web: apply [production]
16:03 <brouberol@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
15:59 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1113.eqiad.wmnet with reason: host reimage [production]
15:54 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1113.eqiad.wmnet with reason: host reimage [production]
15:54 <bjensen> Services portion of the datacenter switchover is complete [production]
15:50 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aux-k8s-worker2009.codfw.wmnet with OS trixie [production]
15:46 <blake@cumin1003> END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all services in codfw: Datacenter Switchover - T413974 [production]
15:38 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
15:38 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp1113.eqiad.wmnet with OS trixie [production]
15:38 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1113.eqiad.wmnet with OS trixie [production]
15:36 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
15:34 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-worker2009.codfw.wmnet with reason: host reimage [production]
15:30 <elukey@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker2009.codfw.wmnet with reason: host reimage [production]
15:20 <blake@cumin1003> START - Cookbook sre.discovery.datacenter depool all services in codfw: Datacenter Switchover - T413974 [production]