4001-4050 of 10000 results (12ms)
2025-09-24 ยง
15:03 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
15:03 <jasmine@cumin1003> START - Cookbook sre.switchdc.mediawiki.03-set-db-readonly for datacenter switchover from eqiad to codfw [production]
15:03 <jasmine@cumin1003> END (PASS) - Cookbook sre.switchdc.mediawiki.02-set-readonly (exit_code=0) for datacenter switchover from eqiad to codfw [production]
15:02 <jasmine@cumin1003> MediaWiki read-only period starts at: 2025-09-24 15:02:35.395589 [production]
15:02 <jasmine@cumin1003> START - Cookbook sre.switchdc.mediawiki.02-set-readonly for datacenter switchover from eqiad to codfw [production]
15:02 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=99) [admin]
15:02 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
15:01 <cmooney@dns2005> START - running authdns-update [production]
15:01 <jasmine@cumin1003> END (PASS) - Cookbook sre.switchdc.mediawiki.01-stop-maintenance (exit_code=0) for datacenter switchover from eqiad to codfw [production]
14:59 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
14:59 <jasmine@cumin1003> START - Cookbook sre.switchdc.mediawiki.01-stop-maintenance for datacenter switchover from eqiad to codfw [production]
14:58 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephmon1006.eqiad.wmnet with OS bookworm [production]
14:58 <jasmine@cumin1003> END (PASS) - Cookbook sre.switchdc.mediawiki.00-reduce-ttl (exit_code=0) for datacenter switchover from eqiad to codfw [production]
14:52 <jasmine@cumin1003> START - Cookbook sre.switchdc.mediawiki.00-reduce-ttl for datacenter switchover from eqiad to codfw [production]
14:51 <jasmine@cumin1003> END (PASS) - Cookbook sre.switchdc.mediawiki.00-downtime-db-readonly-checks (exit_code=0) for datacenter switchover from eqiad to codfw [production]
14:51 <jasmine@cumin1003> START - Cookbook sre.switchdc.mediawiki.00-downtime-db-readonly-checks for datacenter switchover from eqiad to codfw [production]
14:47 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
14:38 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephmon1006.eqiad.wmnet with reason: host reimage [production]
14:35 <jasmine@deploy1003> Locking from deployment [ALL REPOSITORIES]: Datacenter Switchover - T399891 [production]
14:34 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephmon1006.eqiad.wmnet with reason: host reimage [production]
14:13 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephmon1006.eqiad.wmnet with OS bookworm [production]
14:04 <klausman@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:04 <klausman@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
14:04 <klausman@deploy1003> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
14:04 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephmon1005.eqiad.wmnet with OS bookworm [production]
14:03 <klausman@deploy1003> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
14:03 <klausman@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:03 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis tokwiki in section s5 [production]
14:02 <klausman@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. [production]
13:56 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis tokwiki in section s5 [production]
13:49 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Checking sanitization for wikis tokwiki in section s5 [production]
13:49 <dcaro> patched all tools with new resource defaults, everything looks good [tools]
13:48 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Checking sanitization for wikis tokwiki in section s5 [production]
13:48 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis tokwiki in section s5 [production]
13:46 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis tokwiki in section s5 [production]
13:46 <andrew@cumin2002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on cloudcephmon1005.eqiad.wmnet with reason: host reimage [production]
13:46 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephmon1005.eqiad.wmnet with reason: host reimage [production]
13:45 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:45 <cmooney@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: generate new snippet files for reverse range for 2620:0:861:fe17::/64 - cmooney@cumin1003" [production]
13:41 <cmooney@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: generate new snippet files for reverse range for 2620:0:861:fe17::/64 - cmooney@cumin1003" [production]
13:38 <cmooney@cumin1003> START - Cookbook sre.dns.netbox [production]
13:37 <Emperor> update envoyproxy to 1.29.12 on apus rgw nodes T405469 [production]
13:37 <cmooney@cumin1003> END (ERROR) - Cookbook sre.dns.netbox (exit_code=97) [production]
13:37 <cmooney@cumin1003> START - Cookbook sre.dns.netbox [production]
13:34 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [tools]
13:27 <Emperor> update envoyproxy to 1.29.12 on thanos-fe nodes T405469 [production]
13:27 <moritzm> upgrade Envoy on puppet servers T403663 [production]
13:25 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephmon1005.eqiad.wmnet with OS bookworm [production]
13:22 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:22 <cmooney@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns - cmooney@cumin1003" [production]