2025-06-02
ยง
|
12:24 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
12:24 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM prometheus7002.magru.wmnet - jmm@cumin1003" |
[production] |
12:22 |
<jmm@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM prometheus7002.magru.wmnet - jmm@cumin1003" |
[production] |
12:20 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1182 (T395241)', diff saved to https://phabricator.wikimedia.org/P76819 and previous config saved to /var/cache/conftool/dbconfig/20250602-122001-fceratto.json |
[production] |
12:17 |
<jmm@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
12:17 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.makevm for new host prometheus7002.magru.wmnet |
[production] |
12:12 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host doh7003.wikimedia.org |
[production] |
12:12 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host doh7003.wikimedia.org with OS bookworm |
[production] |
12:11 |
<claime> |
cgoubert@mwmaint1002:~$ sudo systemctl restart mediawiki_job_generatecaptcha.service - T388531 |
[production] |
12:11 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host apt-staging2001.codfw.wmnet |
[production] |
12:10 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of global-data-captcha-render |
[production] |
12:10 |
<mvernon@cumin1002> |
START - Cookbook sre.swift.check-dbs Checking container DBs of global-data-captcha-render |
[production] |
12:10 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1182 (T395241)', diff saved to https://phabricator.wikimedia.org/P76818 and previous config saved to /var/cache/conftool/dbconfig/20250602-121041-fceratto.json |
[production] |
12:10 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1182.eqiad.wmnet with reason: Maintenance |
[production] |
12:10 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1156 (T395241)', diff saved to https://phabricator.wikimedia.org/P76817 and previous config saved to /var/cache/conftool/dbconfig/20250602-121016-fceratto.json |
[production] |
12:07 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host apt-staging2001.codfw.wmnet |
[production] |
12:00 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host build2002.codfw.wmnet |
[production] |
11:59 |
<bwojtowicz@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . |
[production] |
11:57 |
<bwojtowicz@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'readability' for release 'main' . |
[production] |
11:55 |
<bwojtowicz@deploy1003> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . |
[production] |
11:55 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P76814 and previous config saved to /var/cache/conftool/dbconfig/20250602-115509-fceratto.json |
[production] |
11:54 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host build2002.codfw.wmnet |
[production] |
11:51 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doh7003.wikimedia.org with reason: host reimage |
[production] |
11:49 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on doh7003.wikimedia.org with reason: host reimage |
[production] |
11:48 |
<marostegui@cumin1002> |
START - Cookbook sre.mysql.pool es2047 gradually with 4 steps - Pool es2047.codfw.wmnet in after cloning |
[production] |
11:47 |
<bwojtowicz@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'article-descriptions' for release 'main' . |
[production] |
11:46 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-rw1001.wikimedia.org |
[production] |
11:42 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host ldap-rw1001.wikimedia.org |
[production] |
11:40 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P76812 and previous config saved to /var/cache/conftool/dbconfig/20250602-114001-fceratto.json |
[production] |
11:39 |
<claime> |
cgoubert@mwmaint1002:~$ sudo systemctl restart mediawiki_job_generatecaptcha.service - T388531 |
[production] |
11:37 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-rw2001.wikimedia.org |
[production] |
11:33 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host ldap-rw2001.wikimedia.org |
[production] |
11:32 |
<claime> |
Manual run of cronjobs/generatecaptcha on k8s - T388531 |
[production] |
11:31 |
<cgoubert@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
11:30 |
<cgoubert@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
11:24 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1156 (T395241)', diff saved to https://phabricator.wikimedia.org/P76811 and previous config saved to /var/cache/conftool/dbconfig/20250602-112453-fceratto.json |
[production] |
11:23 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-maint1001.eqiad.wmnet |
[production] |
11:19 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host ldap-maint1001.eqiad.wmnet |
[production] |
11:18 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reimage for host doh7003.wikimedia.org with OS bookworm |
[production] |
11:17 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM doh7003.wikimedia.org - jmm@cumin1003" |
[production] |
11:17 |
<jmm@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM doh7003.wikimedia.org - jmm@cumin1003" |
[production] |
11:17 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doh7003.wikimedia.org on all recursors |
[production] |
11:17 |
<jmm@cumin1003> |
START - Cookbook sre.dns.wipe-cache doh7003.wikimedia.org on all recursors |
[production] |
11:17 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM doh7003.wikimedia.org - jmm@cumin1003" |
[production] |
11:17 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
11:16 |
<jmm@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM doh7003.wikimedia.org - jmm@cumin1003" |
[production] |
11:15 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1156 (T395241)', diff saved to https://phabricator.wikimedia.org/P76810 and previous config saved to /var/cache/conftool/dbconfig/20250602-111519-fceratto.json |
[production] |
11:15 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1014,1018].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
11:14 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1156.eqiad.wmnet with reason: Maintenance |
[production] |
11:11 |
<jmm@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |