1-50 of 10000 results (115ms)
2026-03-20 ยง
18:43 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage [production]
18:39 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage [production]
18:28 <cdobbins@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs2013.codfw.wmnet with reason: reboot [production]
18:16 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-tcp-proxy rolling reboot on A:tcpproxy and A:tcpproxy [production]
18:14 <jhathaway@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on db1253.eqiad.wmnet with reason: T420041 [production]
17:59 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS trixie [production]
17:54 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp5021.eqsin.wmnet with OS trixie [production]
17:51 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host lvs2014.codfw.wmnet [production]
17:40 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on contint1003.wikimedia.org with reason: jenkins on java21 [production]
17:39 <cdobbins@cumin2002> START - Cookbook sre.hosts.reboot-single for host lvs2014.codfw.wmnet [production]
16:54 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-worker1172.eqiad.wmnet with OS bullseye [production]
16:54 <btullis@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1172.eqiad.wmnet with OS bullseye [production]
16:46 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-worker1172.eqiad.wmnet with OS bullseye [production]
16:33 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS trixie [production]
16:32 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp5021.eqsin.wmnet [reason: trixie reimaging] [production]
16:09 <btullis@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1172.eqiad.wmnet with OS bullseye [production]
16:08 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc-wf2002.codfw.wmnet [production]
16:02 <jiji@cumin1003> START - Cookbook sre.hosts.reboot-single for host mc-wf2002.codfw.wmnet [production]
15:51 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc-wf2001.codfw.wmnet [production]
15:45 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
15:45 <jiji@cumin1003> START - Cookbook sre.hosts.reboot-single for host mc-wf2001.codfw.wmnet [production]
15:43 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2041.codfw.wmnet [production]
15:37 <jiji@cumin1003> START - Cookbook sre.hosts.reboot-single for host mc2041.codfw.wmnet [production]
15:32 <cparle@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply [production]
15:32 <cparle@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-experimental: apply [production]
15:29 <jiji@cumin1003> END (PASS) - Cookbook sre.memcached.roll-reboot-restart (exit_code=0) rolling reboot on A:memcached-eqiad [production]
15:16 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2040.codfw.wmnet [production]
15:10 <jiji@cumin1003> START - Cookbook sre.hosts.reboot-single for host mc2040.codfw.wmnet [production]
15:02 <trueg@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs-queryhammer: apply [production]
15:01 <trueg@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs-queryhammer: apply [production]
15:00 <trueg@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs-queryhammer: apply [production]
14:59 <trueg@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs-queryhammer: apply [production]
14:58 <trueg@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs-queryhammer: apply [production]
14:58 <trueg@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs-queryhammer: apply [production]
14:57 <trueg@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs-queryhammer: apply [production]
14:56 <trueg@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs-queryhammer: apply [production]
14:55 <trueg@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs-queryhammer: apply [production]
14:50 <trueg@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs-queryhammer: apply [production]
14:45 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:45 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
14:45 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on P{wikikube-worker[2001-2002].codfw.wmnet} and (A:wikikube-master-codfw or A:wikikube-worker-codfw) [production]
14:45 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2002.codfw.wmnet [production]
14:45 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2002.codfw.wmnet [production]
14:44 <trueg@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs-queryhammer: apply [production]
14:44 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
14:37 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker2002.codfw.wmnet [production]
14:37 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker2002.codfw.wmnet [production]
14:37 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2001.codfw.wmnet [production]
14:37 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2001.codfw.wmnet [production]
14:36 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]