2301-2350 of 10000 results (127ms)
2025-03-11 ยง
09:57 <jelto@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
09:55 <jelto@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
09:55 <jelto@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
09:53 <vgutierrez@cumin1002> END (FAIL) - Cookbook sre.loadbalancer.liberica-admin (exit_code=1) depooling P{lvs4010.ulsfo.wmnet} and A:liberica [production]
09:52 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.liberica-admin depooling P{lvs4010.ulsfo.wmnet} and A:liberica [production]
09:48 <jelto@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
09:48 <jelto@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
09:41 <kart_> Script run: `mwscript updateCollation.php --wiki=kkwiki --previous-collation=uppercase` (T384395) [production]
09:32 <elukey@puppetserver1001> conftool action : set/pooled=inactive; selector: name=maps1006.eqiad.wmnet,dc=eqiad,cluster=maps,service=kartotherian-k8s-ssl [production]
09:32 <elukey@puppetserver1001> conftool action : set/pooled=inactive; selector: name=maps2006.codfw.wmnet,dc=codfw,cluster=maps,service=kartotherian-k8s-ssl [production]
09:28 <jelto@cumin1002> START - Cookbook sre.k8s.wipe-cluster Wipe the K8s cluster staging-codfw: Kubernetes upgrade [production]
08:59 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1037.eqiad.wmnet [production]
08:52 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1037.eqiad.wmnet [production]
08:51 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2144.codfw.wmnet,db1151.eqiad.wmnet with reason: Maintenance [production]
08:50 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1037.eqiad.wmnet [production]
08:47 <kartik@deploy2002> Finished scap sync-world: Backport for [[gerrit:1126483|Add uca collation for Kazakh (T384395)]] (duration: 12m 13s) [production]
08:41 <kartik@deploy2002> kartik, jhsoby: Continuing with sync [production]
08:38 <kartik@deploy2002> kartik, jhsoby: Backport for [[gerrit:1126483|Add uca collation for Kazakh (T384395)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:37 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
08:37 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
08:37 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
08:37 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
08:35 <kartik@deploy2002> Started scap sync-world: Backport for [[gerrit:1126483|Add uca collation for Kazakh (T384395)]] [production]
08:32 <kartik@deploy2002> Finished scap sync-world: Backport for [[gerrit:1126220|EventLogging: Improve handling when suggestions are not present (T388467)]] (duration: 26m 56s) [production]
08:23 <kartik@deploy2002> abi, kartik: Continuing with sync [production]
08:12 <kartik@deploy2002> abi, kartik: Backport for [[gerrit:1126220|EventLogging: Improve handling when suggestions are not present (T388467)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:08 <moritzm> installing systemd bugfix updates from Bookworm point release [production]
08:05 <kartik@deploy2002> Started scap sync-world: Backport for [[gerrit:1126220|EventLogging: Improve handling when suggestions are not present (T388467)]] [production]
08:02 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 8 hosts with reason: Cloning [production]
08:00 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1217.eqiad.wmnet with reason: Maintenance [production]
07:59 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1228.eqiad.wmnet with reason: Maintenance [production]
07:23 <marostegui@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1228.eqiad.wmnet [production]
07:19 <marostegui@cumin1002> START - Cookbook sre.mysql.upgrade for db1228.eqiad.wmnet [production]
07:19 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1228.eqiad.wmnet with reason: Maintenance [production]
07:13 <marostegui> Failover m2 from db1228 to db1164 - T388396 [production]
07:00 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2160,2233].codfw.wmnet,db[1164,1217,1228].eqiad.wmnet with reason: Primary switchover m2 T388396 [production]
06:45 <marostegui> Drop rt database from m1 T388437 [production]
06:44 <marostegui> Remove rt grants from m1 T388437 [production]
04:03 <mwpresync@deploy2002> Pruned MediaWiki: 1.44.0-wmf.17 (duration: 03m 02s) [production]
03:54 <eileen> civicrm upgraded from f2222fcd to ec20a105 [production]
03:52 <mwpresync@deploy2002> Finished scap sync-world: testwikis to 1.44.0-wmf.20 refs T386215 (duration: 49m 13s) [production]
03:03 <mwpresync@deploy2002> Started scap sync-world: testwikis to 1.44.0-wmf.20 refs T386215 [production]
00:22 <aaron@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
00:21 <aaron@deploy2002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply [production]
00:18 <aaron@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
00:13 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2089.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
00:08 <pt1979@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host restbase1045.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
00:07 <pt1979@cumin1002> START - Cookbook sre.hosts.provision for host restbase1045.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
00:07 <aaron@deploy2002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
00:02 <pt1979@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host restbase1045.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]