201-250 of 10000 results (127ms)
2026-06-01 ยง
12:46 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
12:44 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
12:43 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
12:42 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
12:41 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
12:39 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1180 (T426633)', diff saved to https://phabricator.wikimedia.org/P93435 and previous config saved to /var/cache/conftool/dbconfig/20260601-123926-fceratto.json [production]
12:35 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
12:35 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
12:29 <bwojtowicz@deploy1003> helmfile [ml-serve-eqiad] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
12:28 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2027.codfw.wmnet [production]
12:28 <bwojtowicz@deploy1003> helmfile [ml-serve-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
12:27 <bwojtowicz@deploy1003> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
12:27 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster2005.codfw.wmnet to plain [production]
12:26 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster2005.codfw.wmnet to plain [production]
12:26 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2027.codfw.wmnet [production]
12:26 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2027.codfw.wmnet [production]
12:23 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster2005.codfw.wmnet to drbd [production]
12:20 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
12:17 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
12:15 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-cluster (exit_code=0) depool all services in eqiad/ml-serve-eqiad: maintenance [production]
12:15 <dpogorzelski@cumin1003> START - Cookbook sre.k8s.pool-depool-cluster depool all services in eqiad/ml-serve-eqiad: maintenance [production]
12:11 <dpogorzelski@cumin1003> conftool action : set/pooled=false; selector: dnsdisc=inference,name=eqiad [production]
12:07 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster2005.codfw.wmnet to drbd [production]
12:05 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2027.codfw.wmnet [production]
12:04 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2027.codfw.wmnet [production]
12:04 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti2027.codfw.wmnet [production]
12:04 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2027.codfw.wmnet [production]
11:59 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-cluster (exit_code=0) pool all services in eqiad/ml-serve-eqiad: maintenance [production]
11:59 <dpogorzelski@cumin1003> START - Cookbook sre.k8s.pool-depool-cluster pool all services in eqiad/ml-serve-eqiad: maintenance [production]
11:39 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1180 (T426633)', diff saved to https://phabricator.wikimedia.org/P93434 and previous config saved to /var/cache/conftool/dbconfig/20260601-113911-fceratto.json [production]
11:39 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1180.eqiad.wmnet with reason: Maintenance [production]
11:38 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1173 (T426633)', diff saved to https://phabricator.wikimedia.org/P93433 and previous config saved to /var/cache/conftool/dbconfig/20260601-113843-fceratto.json [production]
11:37 <moritzm> installing Exim security updates [production]
11:36 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
11:34 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
11:33 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
11:33 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
11:32 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
11:32 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
11:32 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
11:28 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
11:28 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
11:28 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1173', diff saved to https://phabricator.wikimedia.org/P93432 and previous config saved to /var/cache/conftool/dbconfig/20260601-112835-fceratto.json [production]
11:25 <jmm@deploy1003> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
11:23 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
11:23 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
11:22 <moritzm> installing imagemagick security updates [production]
11:22 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
11:22 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
11:22 <jmm@deploy1003> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]