201-250 of 10000 results (9ms)
2025-11-18 ยง
16:10 <swfrench-wmf> migrating etcd to PKI certs on conf2004 - T352245 [production]
16:08 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on releases1003.eqiad.wmnet with reason: releases [production]
16:04 <brennen@deploy2002> Finished deploy [phabricator/deployment@8b1bc09]: deploy phab1004 for T409947 (duration: 00m 56s) [production]
16:03 <brennen@deploy2002> Started deploy [phabricator/deployment@8b1bc09]: deploy phab1004 for T409947 [production]
16:03 <brennen@deploy2002> Finished deploy [phabricator/deployment@8b1bc09]: deploy phab2002 for T409947 (duration: 00m 31s) [production]
16:02 <Amir1> cumin2024@db2205.codfw.wmnet[(none)]> drop database if exists fixcopyrightwiki; drop database if exists langcomwiki; drop database if exists mowiki; drop database if exists mowiktionary; (T297297) [production]
16:02 <brennen@deploy2002> Started deploy [phabricator/deployment@8b1bc09]: deploy phab2002 for T409947 [production]
16:02 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab1004.eqiad.wmnet with reason: deployment [production]
16:01 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab2002.codfw.wmnet with reason: deployment [production]
15:56 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1062.eqiad.wmnet' [admin]
15:55 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' [admin]
15:55 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1060.eqiad.wmnet with reason: host reimage [production]
15:54 <James_F> Zuul: [mediawiki/extensions/Capiunto] Move out of production section, for T410172 [releng]
15:54 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' [admin]
15:53 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/mobileapps: sync [production]
15:52 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host zuul2001.codfw.wmnet [production]
15:52 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/mobileapps: sync [production]
15:51 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host zuul2001.codfw.wmnet [production]
15:51 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1060.eqiad.wmnet with reason: host reimage [production]
15:50 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' [admin]
15:48 <swfrench-wmf> transferred etcd-mirror replication to conf2006 - T352245 [production]
15:48 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be1088.eqiad.wmnet [production]
15:48 <sukhe@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts hcaptcha-proxy7002.wikimedia.org [production]
15:48 <sukhe@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:48 <sukhe@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: hcaptcha-proxy7002.wikimedia.org decommissioned, removing all IPs except the asset tag one - sukhe@cumin1003" [production]
15:47 <sukhe@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: hcaptcha-proxy7002.wikimedia.org decommissioned, removing all IPs except the asset tag one - sukhe@cumin1003" [production]
15:44 <sukhe@cumin1003> START - Cookbook sre.dns.netbox [production]
15:42 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be1088.eqiad.wmnet [production]
15:41 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host zuul1002.eqiad.wmnet [production]
15:40 <sukhe@cumin1003> START - Cookbook sre.hosts.decommission for hosts hcaptcha-proxy7002.wikimedia.org [production]
15:39 <swfrench-wmf> silenced EtcdReplicationDown db7447af-851f-4faa-a4fd-b535ee9fbcdb - T352245 [production]
15:38 <JJMC89> copypatrol-backend-prod-02 sudo systemctl restart copypatrol-backend-check-changes [copypatrol]
15:37 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host zuul1002.eqiad.wmnet [production]
15:35 <swfrench-wmf> disable puppet on A:conf-codfw - T352245 [production]
15:35 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' [admin]
15:34 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudvirt1060.eqiad.wmnet with OS trixie [production]
15:33 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1060.eqiad.wmnet' [admin]
15:30 <sgimeno@deploy2002> Finished scap sync-world: Backport for [[gerrit:1206851|undeploy Extension:Capiunto (T410172)]] (duration: 46m 51s) [production]
15:29 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1059.eqiad.wmnet with OS trixie [production]
15:21 <hashar> Reloaded Zuul for "add option to opt out recursive dependencies" | https://gerrit.wikimedia.org/r/c/integration/config/+/1202059 . That is a noop. [releng]
15:16 <sgimeno@deploy2002> novemlinguae, sgimeno: Continuing with sync [production]
15:13 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.restart_openstack (exit_code=99) on deployment codfw1dev for all services [admin]
15:12 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment codfw1dev for all services [admin]
15:11 <sgimeno@deploy2002> novemlinguae, sgimeno: Backport for [[gerrit:1206851|undeploy Extension:Capiunto (T410172)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:06 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
15:05 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
15:03 <btullis@dns1004> END - running authdns-update [production]
15:03 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1059.eqiad.wmnet with reason: host reimage [production]
15:02 <btullis@dns1004> START - running authdns-update [production]
14:59 <taavi> $ toolforge jobs restart irc # to get it to join channels after the bouncer pod was moved to a different node [tools.wikibugs]