1251-1300 of 10000 results (45ms)
2025-07-08 ยง
12:10 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
12:09 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:08 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:06 <hnowlan@deploy1003> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
12:06 <hnowlan@deploy1003> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
12:04 <jmm@cumin2002> END (PASS) - Cookbook sre.ldap.roll-restart-reboot-replica (exit_code=0) rolling restart_daemons on A:ldap-replicas-eqiad [production]
12:03 <jmm@cumin2002> START - Cookbook sre.ldap.roll-restart-reboot-replica rolling restart_daemons on A:ldap-replicas-eqiad [production]
12:02 <jmm@cumin1002> END (PASS) - Cookbook sre.elasticsearch.restart-nginx (exit_code=0) rolling restart_daemons on A:elastic-eqiad [production]
12:01 <jmm@cumin2002> END (PASS) - Cookbook sre.ldap.roll-restart-reboot-replica (exit_code=0) rolling restart_daemons on A:ldap-replicas-codfw [production]
12:00 <hnowlan@deploy1003> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
12:00 <hnowlan@deploy1003> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
12:00 <jmm@cumin2002> START - Cookbook sre.ldap.roll-restart-reboot-replica rolling restart_daemons on A:ldap-replicas-codfw [production]
11:59 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-ctrl1002.eqiad.wmnet [production]
11:54 <btullis@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cephosd[2001-2003].codfw.wmnet with reason: Bootstrapping new ceph cluster [production]
11:52 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
11:52 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
11:52 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-ctrl1002.eqiad.wmnet [production]
11:52 <moritzm> restarting FPM on Phabricator nodes to pick up OpenSSL updates [production]
11:49 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-ctrl1001.eqiad.wmnet [production]
11:49 <moritzm> restarting exim on Phabricator nodes to pick up OpenSSL updates [production]
11:44 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-ctrl1001.eqiad.wmnet [production]
11:42 <jmm@cumin1002> START - Cookbook sre.elasticsearch.restart-nginx rolling restart_daemons on A:elastic-eqiad [production]
11:39 <jynus> upgrade db2201 mariadb package T394487 [production]
11:37 <jmm@cumin1002> END (PASS) - Cookbook sre.elasticsearch.restart-nginx (exit_code=0) rolling restart_daemons on A:relforge [production]
11:36 <jmm@cumin1002> START - Cookbook sre.elasticsearch.restart-nginx rolling restart_daemons on A:relforge [production]
11:35 <btullis@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host db1208.eqiad.wmnet [production]
11:35 <hashar> Restarted Apache on gerrit1003 and gerrit2002 [production]
11:31 <zabe@deploy1003> Finished scap sync-world: Backport for [[gerrit:1167162|Remove redundant group0 config for categorylinks]], [[gerrit:1167164|Set categorylinks to read new in cebwiki (T397912)]] (duration: 09m 35s) [production]
11:29 <hnowlan@deploy1003> helmfile [staging] DONE helmfile.d/services/mobileapps: sync [production]
11:29 <hnowlan@deploy1003> helmfile [staging] START helmfile.d/services/mobileapps: sync [production]
11:27 <moritzm> restarting apache on mirror1001 to pick up openssl sec updates [production]
11:27 <jmm@cumin1002> END (PASS) - Cookbook sre.elasticsearch.restart-nginx (exit_code=0) rolling restart_daemons on A:elastic-codfw [production]
11:25 <zabe@deploy1003> zabe: Continuing with sync [production]
11:24 <zabe@deploy1003> zabe: Backport for [[gerrit:1167162|Remove redundant group0 config for categorylinks]], [[gerrit:1167164|Set categorylinks to read new in cebwiki (T397912)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
11:24 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host db1208.eqiad.wmnet [production]
11:23 <marostegui@cumin1002> dbctl commit (dc=all): 'db1159 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P78807 and previous config saved to /var/cache/conftool/dbconfig/20250708-112344-root.json [production]
11:22 <zabe@deploy1003> Started scap sync-world: Backport for [[gerrit:1167162|Remove redundant group0 config for categorylinks]], [[gerrit:1167164|Set categorylinks to read new in cebwiki (T397912)]] [production]
11:20 <jynus> upgrade db1216 mariadb package T394487 [production]
11:15 <moritzm> restarting slapd on seaborgium/serpens to pick up OpenSSL updates [production]
11:10 <jmm@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-eqiad [production]
11:09 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-cluster (exit_code=0) [production]
11:08 <marostegui@cumin1002> dbctl commit (dc=all): 'db1159 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P78806 and previous config saved to /var/cache/conftool/dbconfig/20250708-110838-root.json [production]
11:07 <jmm@cumin1002> START - Cookbook sre.elasticsearch.restart-nginx rolling restart_daemons on A:elastic-codfw [production]
11:06 <marostegui@cumin1002> dbctl commit (dc=all): 'db2157 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P78805 and previous config saved to /var/cache/conftool/dbconfig/20250708-110656-root.json [production]
11:06 <jmm@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad [production]
11:06 <jmm@cumin1002> END (PASS) - Cookbook sre.elasticsearch.restart-nginx (exit_code=0) rolling restart_daemons on A:cloudelastic [production]
11:04 <jmm@cumin1002> START - Cookbook sre.elasticsearch.restart-nginx rolling restart_daemons on A:cloudelastic [production]
11:03 <jynus@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2201.codfw.wmnet,db1216.eqiad.wmnet with reason: MariaDB package update [production]
11:00 <jmm@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-codfw [production]
11:00 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host matomo1003.eqiad.wmnet [production]