351-400 of 10000 results (117ms)
2025-09-15 ยง
11:35 <fabfur> restarting pybal on lvs1020 to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/1188309 (T404388) [production]
11:35 <ladsgroup@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1160* gradually with 4 steps - Work done [production]
11:32 <btullis@cumin1003> END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. [production]
11:26 <btullis@cumin1003> START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. [production]
11:25 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2207', diff saved to https://phabricator.wikimedia.org/P83310 and previous config saved to /var/cache/conftool/dbconfig/20250915-112512-ladsgroup.json [production]
11:10 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2207 (T402925)', diff saved to https://phabricator.wikimedia.org/P83308 and previous config saved to /var/cache/conftool/dbconfig/20250915-111005-ladsgroup.json [production]
11:08 <jiji@cumin1003> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-secondary-eqiad and A:lvs [production]
10:55 <jiji@cumin1003> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-secondary-eqiad and A:lvs [production]
10:44 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2207 (T402925)', diff saved to https://phabricator.wikimedia.org/P83305 and previous config saved to /var/cache/conftool/dbconfig/20250915-104420-ladsgroup.json [production]
10:44 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2207.codfw.wmnet with reason: Maintenance [production]
10:40 <jayme@cumin1002> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "[not really into teleological thinking] - jayme@cumin1002" [production]
10:40 <jayme@cumin1002> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: [not really into teleological thinking] - jayme@cumin1002 [production]
10:39 <jayme@cumin1002> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: [not really into teleological thinking] - jayme@cumin1002 [production]
10:39 <jayme@cumin1002> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "[not really into teleological thinking] - jayme@cumin1002" [production]
10:38 <btullis@cumin1003> END (PASS) - Cookbook sre.kafka.roll-restart-mirror-maker (exit_code=0) restart MirrorMaker for Kafka A:kafka-mirror-maker-jumbo-eqiad cluster: Roll restart of jvm daemons. [production]
10:20 <btullis@cumin1003> START - Cookbook sre.kafka.roll-restart-mirror-maker restart MirrorMaker for Kafka A:kafka-mirror-maker-jumbo-eqiad cluster: Roll restart of jvm daemons. [production]
10:17 <jiji@cumin1003> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-secondary-eqiad and A:lvs [production]
10:15 <ladsgroup@deploy1003> Finished scap sync-world: Backport: [[gerrit:1188285|Reduce db lock timeout in LinksUpdate and CategoryMembershipChangeJob]] (T366938) (duration: 42m 06s) [production]
10:14 <ladsgroup@cumin1003> START - Cookbook sre.mysql.pool db1160* gradually with 4 steps - Work done [production]
10:09 <jiji@cumin1003> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-secondary-eqiad and A:lvs [production]
10:04 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/wikifeeds: sync [production]
10:02 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/wikifeeds: sync [production]
09:57 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/termbox: sync [production]
09:56 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/termbox: sync [production]
09:55 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/mobileapps: sync [production]
09:54 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/mobileapps: sync [production]
09:53 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/linkrecommendation: sync [production]
09:52 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/linkrecommendation: sync [production]
09:47 <btullis@cumin1003> END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling restart_daemons on A:kafka-jumbo-eqiad [production]
09:41 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
09:40 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
09:37 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: sync [production]
09:37 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: sync [production]
09:34 <elukey@deploy1003> helmfile [staging] DONE helmfile.d/services/wikifeeds: sync [production]
09:34 <elukey@deploy1003> helmfile [staging] START helmfile.d/services/wikifeeds: sync [production]
09:34 <elukey@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
09:33 <elukey@deploy1003> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: sync [production]
09:33 <ladsgroup@deploy1003> Started scap sync-world: Backport: [[gerrit:1188285|Reduce db lock timeout in LinksUpdate and CategoryMembershipChangeJob]] (T366938) [production]
09:33 <elukey@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: sync [production]
09:33 <elukey@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: sync [production]
09:32 <elukey@deploy1003> helmfile [staging] DONE helmfile.d/services/mobileapps: sync [production]
09:31 <elukey@deploy1003> helmfile [staging] START helmfile.d/services/mobileapps: sync [production]
09:30 <elukey@deploy1003> helmfile [staging] DONE helmfile.d/services/termbox: sync [production]
09:30 <stevemunene@cumin1003> END (PASS) - Cookbook sre.k8s.wipe-cluster (exit_code=0) Wipe the K8s cluster dse-codfw: Cleanup the dse-k8s-codfw cluster [production]
09:30 <effie> stopping puppet on A:lvs-low-traffic-eqiad and A:lvs-low-traffic-codfw [production]
09:30 <elukey@deploy1003> helmfile [staging] START helmfile.d/services/termbox: sync [production]
09:28 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'sync'. [production]
09:25 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'sync'. [production]
09:20 <elukey@deploy1003> helmfile [staging] DONE helmfile.d/services/linkrecommendation: sync [production]
09:20 <elukey@deploy1003> helmfile [staging] START helmfile.d/services/linkrecommendation: sync [production]