101-150 of 10000 results (113ms)
2026-03-31 §
17:25 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1110.eqiad.wmnet with reason: host reimage [production]
17:18 <eevans@cumin1003> START - Cookbook sre.cassandra.roll-restart for nodes matching restbase[1031,2034]*: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003 [production]
17:16 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1003.eqiad.wmnet [production]
17:13 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
17:12 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
17:10 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1003.eqiad.wmnet [production]
17:10 <joal@deploy1003> Finished deploy [analytics/refinery@8d91f24] (thin): Regular analytics weekly train THIN [analytics/refinery@8d91f242] (duration: 01m 56s) [production]
17:08 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp1110.eqiad.wmnet with OS trixie [production]
17:08 <joal@deploy1003> Started deploy [analytics/refinery@8d91f24] (thin): Regular analytics weekly train THIN [analytics/refinery@8d91f242] [production]
17:08 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1110.eqiad.wmnet with OS trixie [production]
17:08 <joal@deploy1003> Finished deploy [analytics/refinery@8d91f24]: Regular analytics weekly train [analytics/refinery@8d91f242] (duration: 07m 47s) [production]
17:07 <otto@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
17:07 <otto@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
17:07 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
17:07 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
17:07 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
17:06 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
17:00 <joal@deploy1003> Started deploy [analytics/refinery@8d91f24]: Regular analytics weekly train [analytics/refinery@8d91f242] [production]
16:59 <joal@deploy1003> Finished deploy [analytics/refinery@8d91f24] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@8d91f242] (duration: 01m 53s) [production]
16:58 <joal@deploy1003> Started deploy [analytics/refinery@8d91f24] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@8d91f242] [production]
16:56 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp1110.eqiad.wmnet with OS trixie [production]
16:55 <eevans@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs2001.codfw.wmnet: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003 [production]
16:54 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
16:54 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
16:51 <moritzm> installing Bind security updates (client-side tools and libs) [production]
16:48 <eevans@cumin1003> START - Cookbook sre.cassandra.roll-restart for nodes matching aqs2001.codfw.wmnet: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003 [production]
16:45 <eevans@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs1010.eqiad.wmnet: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003 [production]
16:45 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1365.eqiad.wmnet with OS trixie [production]
16:40 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2238 (T419635)', diff saved to https://phabricator.wikimedia.org/P90094 and previous config saved to /var/cache/conftool/dbconfig/20260331-164004-fceratto.json [production]
16:37 <eevans@cumin1003> START - Cookbook sre.cassandra.roll-restart for nodes matching aqs1010.eqiad.wmnet: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003 [production]
16:31 <eevans@cumin1003> END (PASS) - Cookbook sre.misc-clusters.roll-restart-restbase (exit_code=0) rolling restart_daemons on A:restbase [production]
16:29 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2238', diff saved to https://phabricator.wikimedia.org/P90092 and previous config saved to /var/cache/conftool/dbconfig/20260331-162956-fceratto.json [production]
16:29 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1365.eqiad.wmnet with reason: host reimage [production]
16:28 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp1114.eqiad.wmnet [reason: trixie reimaging] [production]
16:22 <ayounsi@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1365.eqiad.wmnet with reason: host reimage [production]
16:22 <akhatun@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply [production]
16:22 <akhatun@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply [production]
16:19 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2238', diff saved to https://phabricator.wikimedia.org/P90091 and previous config saved to /var/cache/conftool/dbconfig/20260331-161947-fceratto.json [production]
16:15 <eevans@cumin1003> START - Cookbook sre.misc-clusters.roll-restart-restbase rolling restart_daemons on A:restbase [production]
16:14 <ebysans@deploy1003> helmfile [staging] DONE helmfile.d/services/media-analytics: apply [production]
16:12 <rzl@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply [production]
16:12 <rzl@deploy1003> helmfile [codfw] START helmfile.d/services/mw-experimental: apply [production]
16:11 <rzl@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply [production]
16:10 <rzl@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-experimental: apply [production]
16:10 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker1365 [production]
16:10 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1365 [production]
16:10 <ayounsi@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1365 [production]
16:10 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker1365.eqiad.wmnet 206.48.64.10.in-addr.arpa 6.0.2.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
16:10 <ayounsi@cumin1003> START - Cookbook sre.dns.wipe-cache wikikube-worker1365.eqiad.wmnet 206.48.64.10.in-addr.arpa 6.0.2.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
16:10 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]