3501-3550 of 10000 results (120ms)
2024-10-09 ยง
14:07 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host debmonitor1003.eqiad.wmnet [production]
14:07 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp2004.wikimedia.org [production]
14:06 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot finished rebooting dns1004.wikimedia.org [production]
14:05 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host debmonitor2003.codfw.wmnet [production]
14:05 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:04 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host backup1012.eqiad.wmnet with OS bookworm [production]
14:03 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp2004.wikimedia.org [production]
14:02 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:02 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host debmonitor2003.codfw.wmnet [production]
14:01 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
14:01 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host flink-zk1002.eqiad.wmnet [production]
13:58 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P69517 and previous config saved to /var/cache/conftool/dbconfig/20241009-135812-ladsgroup.json [production]
13:57 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host flink-zk1002.eqiad.wmnet [production]
13:56 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host zookeeper-test1002.eqiad.wmnet [production]
13:55 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp-test2005.wikimedia.org [production]
13:54 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host flink-zk1003.eqiad.wmnet [production]
13:53 <jayme@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
13:53 <jayme@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
13:53 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot begin reboot of dns1004.wikimedia.org [production]
13:52 <sukhe@cumin1002> START - Cookbook sre.dns.roll-reboot rolling reboot on A:dnsbox [production]
13:52 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host zookeeper-test1002.eqiad.wmnet [production]
13:51 <jayme@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
13:51 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp-test2005.wikimedia.org [production]
13:51 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp-test2004.wikimedia.org [production]
13:50 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host flink-zk1003.eqiad.wmnet [production]
13:50 <jynus@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on backup[1010-1011].eqiad.wmnet with reason: T376800 [production]
13:50 <jynus@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on backup[1010-1011].eqiad.wmnet with reason: T376800 [production]
13:49 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudcephosd1028.eqiad.wmnet [production]
13:49 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host flink-zk1001.eqiad.wmnet [production]
13:48 <Lucas_WMDE> UTC afternoon backport+config window done [production]
13:48 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp-test2004.wikimedia.org [production]
13:48 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1078774|[brwikimedia] Enable the CampaignEvents extension (T376747)]] (duration: 07m 04s) [production]
13:48 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp1004.wikimedia.org [production]
13:45 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host flink-zk1001.eqiad.wmnet [production]
13:45 <brouberol@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host flink-zk1001.eqiad.wmnet [production]
13:44 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host flink-zk1001.eqiad.wmnet [production]
13:44 <lucaswerkmeister-wmde@deploy2002> albertoleoncio, lucaswerkmeister-wmde: Continuing with sync [production]
13:44 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp1004.wikimedia.org [production]
13:43 <lucaswerkmeister-wmde@deploy2002> albertoleoncio, lucaswerkmeister-wmde: Backport for [[gerrit:1078774|[brwikimedia] Enable the CampaignEvents extension (T376747)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:43 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp-test1004.wikimedia.org [production]
13:43 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1203 (T367856)', diff saved to https://phabricator.wikimedia.org/P69516 and previous config saved to /var/cache/conftool/dbconfig/20241009-134305-ladsgroup.json [production]
13:42 <brouberol@cumin1002> END (ERROR) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=97) for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons. [production]
13:42 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host cloudcephosd1028.eqiad.wmnet [production]
13:41 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1078774|[brwikimedia] Enable the CampaignEvents extension (T376747)]] [production]
13:41 <brouberol@cumin1002> START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons. [production]
13:39 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp-test1004.wikimedia.org [production]
13:39 <sukhe@cumin1002> START - Cookbook sre.dns.roll-restart-reboot-durum rolling reboot on A:durum [production]
13:39 <Lucas_WMDE> lucaswerkmeister-wmde@deploy2002 $ printf 'https://en.wikipedia.org/static/images/%s\n' 'project-logos/sdwiki.png' 'project-logos/sdwiki-1.5x.png' 'project-logos/sdwiki-2x.png' 'mobile/copyright/wikipedia-wordmark-sd.svg' 'mobile/copyright/wikipedia-tagline-sd.svg' | mwscript-k8s --attach -- purgeList.php # T376536 [production]
13:35 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1078680|sdwiki: Add new logo and tagline (T376536)]] (duration: 19m 34s) [production]
13:33 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idm2001.wikimedia.org [production]