301-350 of 10000 results (117ms)
2026-03-16 ยง
14:56 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:1253518|Revert "mediawiki.util: Prefer prev step over non-standard in adjustThumbWidthForSteps" (T419927)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:55 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:55 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:54 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1253518|Revert "mediawiki.util: Prefer prev step over non-standard in adjustThumbWidthForSteps" (T419927)]] [production]
14:53 <andrew@cumin2002> START - Cookbook sre.hosts.reboot-single for host cloudgw1004.eqiad.wmnet [production]
14:53 <jiji@cumin1003> END (PASS) - Cookbook sre.memcached.roll-reboot-restart (exit_code=0) rolling reboot on A:memcached-gutter-codfw [production]
14:52 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2028.codfw.wmnet [production]
14:51 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:51 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2028.codfw.wmnet [production]
14:51 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:50 <mvernon@cumin1003> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling reboot on A:swift-fe-eqiad [production]
14:34 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudgw1003.eqiad.wmnet with OS trixie [production]
14:31 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1020.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:31 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host dse-k8s-worker1020.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:30 <fceratto@deploy2002> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
14:28 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:28 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:26 <fceratto@deploy2002> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
14:22 <blake@cumin1003> START - Cookbook sre.k8s.reboot-nodes rolling reboot on P{wikikube-worker[1002-1327].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
14:22 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1002-1003].eqiad.wmnet [production]
14:22 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1002-1003].eqiad.wmnet [production]
14:21 <blake@cumin1003> END (ERROR) - Cookbook sre.k8s.reboot-nodes (exit_code=97) rolling reboot on P{wikikube-worker[1002-1327].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
14:21 <blake@cumin1003> START - Cookbook sre.k8s.reboot-nodes rolling reboot on P{wikikube-worker[1002-1327].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
14:20 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:20 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:20 <fceratto@deploy2002> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
14:18 <sgimeno@deploy2002> Finished scap sync-world: Backport for [[gerrit:1253461|fix(anon warning): remove wring type=signup param (T415160)]], [[gerrit:1253450|AccountCreation: track account registrations for WE1.8 experiments (T416100)]] (duration: 09m 16s) [production]
14:17 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:17 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:14 <sgimeno@deploy2002> sgimeno: Continuing with sync [production]
14:13 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:13 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:11 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:11 <sgimeno@deploy2002> sgimeno: Backport for [[gerrit:1253461|fix(anon warning): remove wring type=signup param (T415160)]], [[gerrit:1253450|AccountCreation: track account registrations for WE1.8 experiments (T416100)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:11 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-sre: apply [production]
14:10 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1003.eqiad.wmnet with reason: host reimage [production]
14:09 <sgimeno@deploy2002> Started scap sync-world: Backport for [[gerrit:1253461|fix(anon warning): remove wring type=signup param (T415160)]], [[gerrit:1253450|AccountCreation: track account registrations for WE1.8 experiments (T416100)]] [production]
14:08 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2026.codfw.wmnet [production]
14:08 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2026.codfw.wmnet [production]
14:08 <fceratto@deploy2002> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
14:04 <arnaudb@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit2002.wikimedia.org with reason: testing [production]
14:03 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1003.eqiad.wmnet with reason: host reimage [production]
14:02 <arnaudb@cumin1003> DONE (ERROR) - Cookbook sre.hosts.downtime (exit_code=97) for 2:00:00 on gerrit2002.wikimedia.org with reason: T418256 [production]
14:01 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow1003.eqiad.wmnet [production]
13:58 <fceratto@deploy2002> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
13:54 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2026.codfw.wmnet [production]
13:54 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow1003.eqiad.wmnet [production]
13:45 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow1002.eqiad.wmnet [production]
13:45 <mszwarc@deploy2002> Finished scap sync-world: Backport for [[gerrit:1253046|bowiki: update logos (T419268)]] (duration: 06m 17s) [production]
13:45 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudgw1003.eqiad.wmnet with OS trixie [production]