651-700 of 10000 results (127ms)
2025-06-02 ยง
11:51 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doh7003.wikimedia.org with reason: host reimage [production]
11:49 <jmm@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on doh7003.wikimedia.org with reason: host reimage [production]
11:48 <marostegui@cumin1002> START - Cookbook sre.mysql.pool es2047 gradually with 4 steps - Pool es2047.codfw.wmnet in after cloning [production]
11:47 <bwojtowicz@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'article-descriptions' for release 'main' . [production]
11:46 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-rw1001.wikimedia.org [production]
11:42 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ldap-rw1001.wikimedia.org [production]
11:40 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P76812 and previous config saved to /var/cache/conftool/dbconfig/20250602-114001-fceratto.json [production]
11:39 <claime> cgoubert@mwmaint1002:~$ sudo systemctl restart mediawiki_job_generatecaptcha.service - T388531 [production]
11:37 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-rw2001.wikimedia.org [production]
11:33 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ldap-rw2001.wikimedia.org [production]
11:32 <claime> Manual run of cronjobs/generatecaptcha on k8s - T388531 [production]
11:31 <cgoubert@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply [production]
11:30 <cgoubert@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
11:24 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1156 (T395241)', diff saved to https://phabricator.wikimedia.org/P76811 and previous config saved to /var/cache/conftool/dbconfig/20250602-112453-fceratto.json [production]
11:23 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-maint1001.eqiad.wmnet [production]
11:19 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ldap-maint1001.eqiad.wmnet [production]
11:18 <jmm@cumin1003> START - Cookbook sre.hosts.reimage for host doh7003.wikimedia.org with OS bookworm [production]
11:17 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM doh7003.wikimedia.org - jmm@cumin1003" [production]
11:17 <jmm@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM doh7003.wikimedia.org - jmm@cumin1003" [production]
11:17 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doh7003.wikimedia.org on all recursors [production]
11:17 <jmm@cumin1003> START - Cookbook sre.dns.wipe-cache doh7003.wikimedia.org on all recursors [production]
11:17 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM doh7003.wikimedia.org - jmm@cumin1003" [production]
11:17 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
11:16 <jmm@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM doh7003.wikimedia.org - jmm@cumin1003" [production]
11:15 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1156 (T395241)', diff saved to https://phabricator.wikimedia.org/P76810 and previous config saved to /var/cache/conftool/dbconfig/20250602-111519-fceratto.json [production]
11:15 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1014,1018].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
11:14 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1156.eqiad.wmnet with reason: Maintenance [production]
11:11 <jmm@cumin1003> START - Cookbook sre.dns.netbox [production]
11:11 <jmm@cumin1003> START - Cookbook sre.ganeti.makevm for new host doh7003.wikimedia.org [production]
11:10 <marostegui@cumin1002> dbctl commit (dc=all): 'es2039 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P76809 and previous config saved to /var/cache/conftool/dbconfig/20250602-111044-root.json [production]
10:58 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum7003.magru.wmnet with OS bookworm [production]
10:58 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host durum7003.magru.wmnet [production]
10:58 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-maint2001.codfw.wmnet [production]
10:55 <marostegui@cumin1002> dbctl commit (dc=all): 'es2039 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P76808 and previous config saved to /var/cache/conftool/dbconfig/20250602-105539-root.json [production]
10:54 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ldap-maint2001.codfw.wmnet [production]
10:48 <marostegui@cumin1002> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) es2036 gradually with 4 steps - Pool es2036.codfw.wmnet in after cloning [production]
10:41 <kamila@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/aux-k8s-services/jaeger: apply [production]
10:40 <kamila@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/aux-k8s-services/jaeger: apply [production]
10:40 <marostegui@cumin1002> dbctl commit (dc=all): 'es2039 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P76806 and previous config saved to /var/cache/conftool/dbconfig/20250602-104032-root.json [production]
10:40 <kamila@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/jaeger: apply [production]
10:40 <kamila@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/jaeger: apply [production]
10:37 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum7003.magru.wmnet with reason: host reimage [production]
10:34 <jmm@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on durum7003.magru.wmnet with reason: host reimage [production]
10:31 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host debmonitor1003.eqiad.wmnet [production]
10:27 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host debmonitor1003.eqiad.wmnet [production]
10:25 <marostegui@cumin1002> dbctl commit (dc=all): 'es2039 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P76804 and previous config saved to /var/cache/conftool/dbconfig/20250602-102526-root.json [production]
10:22 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host debmonitor2003.codfw.wmnet [production]
10:18 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host debmonitor2003.codfw.wmnet [production]
10:12 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rpki1001.eqiad.wmnet [production]
10:10 <marostegui@cumin1002> dbctl commit (dc=all): 'es2039 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P76802 and previous config saved to /var/cache/conftool/dbconfig/20250602-101020-root.json [production]