1-50 of 10000 results (91ms)
2025-04-03 ยง
10:10 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host apus-fe2003.codfw.wmnet with OS bookworm [production]
10:02 <fabfur@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 15 days, 0:00:00 on cp4047.ulsfo.wmnet with reason: HW errors [production]
09:59 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:59 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:59 <fabfur> disable puppet on A:cp-eqsin [production]
09:58 <fabfur> applying https://gerrit.wikimedia.org/r/c/operations/puppet/+/1133850 to use TLS on tmpfs on A:cp-eqsin (T384227) [production]
09:54 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:54 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:54 <akosiaris> deploy https://gerrit.wikimedia.org/r/c/operations/deployment-charts/+/1133745 in all k8s ingresses to stop ingressgateway from forcefully setting the HTTP server header in the responses to "istio-envoy" [production]
09:52 <akosiaris@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
09:52 <akosiaris@deploy1003> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
09:52 <godog> lvextend --resizefs --size +1TB vg0/srv on mwlog[12]002 [production]
09:52 <akosiaris@deploy1003> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
09:51 <akosiaris@deploy1003> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
09:51 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:51 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:15 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:15 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:08 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3006.esams.wmnet [production]
09:08 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3006.esams.wmnet [production]
09:03 <fabfur> secure deleting certificates in /etc/ssl/private from A:cp-ulsfo (T384227) [production]
09:00 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti3006.esams.wmnet [production]
08:53 <fabfur> secure deleting certificates in /etc/ssl/private from A:cp-magru (T384227) [production]
08:48 <jnuche@deploy1003> Finished deploy [releng/jenkins-deploy@c274545] (releasing): (no justification provided) (duration: 01m 03s) [production]
08:47 <jnuche@deploy1003> Started deploy [releng/jenkins-deploy@c274545] (releasing): (no justification provided) [production]
08:46 <jnuche@deploy1003> Finished deploy [releng/jenkins-deploy@c274545] (releasing): (no justification provided) (duration: 00m 54s) [production]
08:45 <jnuche@deploy1003> Started deploy [releng/jenkins-deploy@c274545] (releasing): (no justification provided) [production]
08:42 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3006.esams.wmnet [production]
08:29 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3005.esams.wmnet [production]
08:28 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3005.esams.wmnet [production]
08:24 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
08:22 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
08:21 <hashar> Upgrading CI Jenkins [production]
08:21 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti3005.esams.wmnet [production]
08:20 <slyngshede@dns1004> END - running authdns-update [production]
08:18 <slyngshede@dns1004> START - running authdns-update [production]
08:12 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
08:12 <slyngshede@dns1004> START - running authdns-update [production]
08:06 <kevinbazira@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
08:05 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3005.esams.wmnet [production]
07:54 <moritzm> failover ganeti masters in esams to ganeti3007/3008 [production]
07:47 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3008.esams.wmnet [production]
07:47 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3008.esams.wmnet [production]
07:44 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2044.codfw.wmnet [production]
07:44 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1044.eqiad.wmnet [production]
07:41 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti3008.esams.wmnet [production]
07:38 <jiji@cumin1002> START - Cookbook sre.hosts.reboot-single for host mc2044.codfw.wmnet [production]
07:38 <jiji@cumin1002> START - Cookbook sre.hosts.reboot-single for host mc1044.eqiad.wmnet [production]
07:36 <moritzm> added spiderpig-access LDAP group T390338 [production]
07:31 <fabfur> applying patch to use TLS on tmpfs on A:cp-ulsfo (T384227) [production]