201-250 of 10000 results (105ms)
2026-04-02 ยง
10:19 <jayme@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
10:19 <jayme@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
10:19 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wikikube-worker-exp2001.codfw.wmnet [production]
10:19 <moritzm> installing freetype security updates [production]
10:19 <jayme@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
10:19 <jayme@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
10:18 <jayme@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
10:18 <daniel@deploy1003> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
10:18 <jayme@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]
10:18 <jayme@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. [production]
10:18 <jayme@deploy1003> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
10:18 <jayme@deploy1003> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
10:18 <jayme@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
10:17 <jayme@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
10:17 <jayme@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
10:17 <jayme@deploy1003> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
10:17 <jayme@deploy1003> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
10:17 <jayme@deploy1003> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
10:17 <jayme@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
10:17 <daniel@deploy1003> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
10:17 <jayme@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
10:17 <jayme@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
10:16 <jmm@deploy1003> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
10:16 <jayme@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
10:15 <jiji@cumin1003> START - Cookbook sre.hosts.reboot-single for host wikikube-worker-exp2001.codfw.wmnet [production]
10:14 <jmm@deploy1003> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
10:12 <jmm@deploy1003> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
10:10 <jmm@deploy1003> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
10:05 <jmm@deploy1003> helmfile [staging] DONE helmfile.d/services/thumbor: apply [production]
10:05 <jmm@deploy1003> helmfile [staging] START helmfile.d/services/thumbor: apply [production]
09:48 <javiermonton@deploy1003> helmfile [codfw] DONE helmfile.d/services/eventgate-main: sync [production]
09:48 <javiermonton@deploy1003> helmfile [codfw] START helmfile.d/services/eventgate-main: sync [production]
09:48 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2156 (T419635)', diff saved to https://phabricator.wikimedia.org/P90211 and previous config saved to /var/cache/conftool/dbconfig/20260402-094834-fceratto.json [production]
09:48 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2156.codfw.wmnet with reason: Maintenance [production]
09:48 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2149 (T419635)', diff saved to https://phabricator.wikimedia.org/P90210 and previous config saved to /var/cache/conftool/dbconfig/20260402-094808-fceratto.json [production]
09:48 <javiermonton@deploy1003> helmfile [eqiad] DONE helmfile.d/services/eventgate-main: sync [production]
09:47 <javiermonton@deploy1003> helmfile [eqiad] START helmfile.d/services/eventgate-main: sync [production]
09:45 <javiermonton@deploy1003> helmfile [staging] DONE helmfile.d/services/eventgate-main: sync [production]
09:45 <javiermonton@deploy1003> helmfile [staging] START helmfile.d/services/eventgate-main: sync [production]
09:38 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P90209 and previous config saved to /var/cache/conftool/dbconfig/20260402-093759-fceratto.json [production]
09:33 <jmm@cumin2002> END (PASS) - Cookbook sre.misc-clusters.restart-reboot-config-master (exit_code=0) rolling reboot on A:config-master-codfw [production]
09:29 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) config-master.discovery.wmnet. on all recursors [production]
09:29 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache config-master.discovery.wmnet. on all recursors [production]
09:29 <jmm@cumin2002> START - Cookbook sre.misc-clusters.restart-reboot-config-master rolling reboot on A:config-master-codfw [production]
09:27 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P90208 and previous config saved to /var/cache/conftool/dbconfig/20260402-092751-fceratto.json [production]
09:27 <gkyziridis@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
09:27 <gkyziridis@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
09:19 <moritzm> upgrading Envoy on the config-master servers to 1.35.9 T419637 T410975 [production]
09:17 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2149 (T419635)', diff saved to https://phabricator.wikimedia.org/P90207 and previous config saved to /var/cache/conftool/dbconfig/20260402-091743-fceratto.json [production]
08:55 <gkyziridis@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]