351-400 of 10000 results (22ms)
2025-01-14 ยง
09:33 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on es[1022,1043].eqiad.wmnet with reason: cloning [production]
09:33 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on es[1022,1043].eqiad.wmnet with reason: cloning [production]
09:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es1022 T382569', diff saved to https://phabricator.wikimedia.org/P72027 and previous config saved to /var/cache/conftool/dbconfig/20250114-093315-marostegui.json [production]
09:32 <marostegui@cumin1002> dbctl commit (dc=all): 'es1044 (re)pooling @ 1%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P72026 and previous config saved to /var/cache/conftool/dbconfig/20250114-093216-root.json [production]
09:31 <marostegui@cumin1002> dbctl commit (dc=all): 'Add es1044 to dbctl depooled T382569', diff saved to https://phabricator.wikimedia.org/P72025 and previous config saved to /var/cache/conftool/dbconfig/20250114-093147-marostegui.json [production]
09:31 <hashar@deploy2002> anzx, hashar: Backport for [[gerrit:1111106|knwiki, knwikisource, knwiktionary, knwikiquote: update logo, wordmark (T382802)]], [[gerrit:1111109|hiwikisource: logo fix (T310961)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
09:26 <hashar@deploy2002> Started scap sync-world: Backport for [[gerrit:1111106|knwiki, knwikisource, knwiktionary, knwikiquote: update logo, wordmark (T382802)]], [[gerrit:1111109|hiwikisource: logo fix (T310961)]] [production]
09:19 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2215.codfw.wmnet with reason: host reimage [production]
09:15 <jelto@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2215.codfw.wmnet with reason: host reimage [production]
09:12 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-search: apply [production]
09:11 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-search: apply [production]
09:10 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-research: apply [production]
09:10 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-research: apply [production]
09:09 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wmde: apply [production]
09:09 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wmde: apply [production]
09:09 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-ml: apply [production]
09:08 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-ml: apply [production]
09:06 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
09:05 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
08:59 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2214.codfw.wmnet with OS bookworm [production]
08:59 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2215 [production]
08:59 <jelto@cumin1002> START - Cookbook sre.hosts.move-vlan for host wikikube-worker2215 [production]
08:59 <jelto@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2215.codfw.wmnet with OS bookworm [production]
08:58 <jelto@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-worker2215.codfw.wmnet with OS bookworm [production]
08:58 <marostegui@cumin1002> dbctl commit (dc=all): 'es1023 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72024 and previous config saved to /var/cache/conftool/dbconfig/20250114-085802-root.json [production]
08:55 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2213.codfw.wmnet with OS bookworm [production]
08:55 <jynus@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dbprov2006.codfw.wmnet with reason: os upgrade [production]
08:55 <jynus@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on dbprov2006.codfw.wmnet with reason: os upgrade [production]
08:53 <gmodena@deploy2002> Finished scap sync-world: Backport for [[gerrit:1110777|Revert^2 "config: remove eventbus instrumentation setting"]] (duration: 21m 52s) [production]
08:51 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2212.codfw.wmnet with OS bookworm [production]
08:43 <gmodena@deploy2002> otto, gmodena: Continuing with sync [production]
08:42 <marostegui@cumin1002> dbctl commit (dc=all): 'es1023 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72023 and previous config saved to /var/cache/conftool/dbconfig/20250114-084256-root.json [production]
08:40 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2214.codfw.wmnet with reason: host reimage [production]
08:38 <gmodena@deploy2002> otto, gmodena: Backport for [[gerrit:1110777|Revert^2 "config: remove eventbus instrumentation setting"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:36 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2213.codfw.wmnet with reason: host reimage [production]
08:34 <jelto@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2214.codfw.wmnet with reason: host reimage [production]
08:32 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2212.codfw.wmnet with reason: host reimage [production]
08:31 <gmodena@deploy2002> Started scap sync-world: Backport for [[gerrit:1110777|Revert^2 "config: remove eventbus instrumentation setting"]] [production]
08:31 <jelto@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2213.codfw.wmnet with reason: host reimage [production]
08:27 <marostegui@cumin1002> dbctl commit (dc=all): 'es1023 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72022 and previous config saved to /var/cache/conftool/dbconfig/20250114-082751-root.json [production]
08:26 <jelto@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2212.codfw.wmnet with reason: host reimage [production]
08:25 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2215 [production]
08:25 <jelto@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker2215 [production]
08:25 <jelto@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker2215 [production]
08:25 <jelto@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker2215.codfw.wmnet 62.32.192.10.in-addr.arpa 2.6.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
08:25 <jelto@cumin1002> START - Cookbook sre.dns.wipe-cache wikikube-worker2215.codfw.wmnet 62.32.192.10.in-addr.arpa 2.6.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
08:25 <jelto@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:25 <jelto@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wikikube-worker2215 - jelto@cumin1002" [production]
08:25 <jelto@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wikikube-worker2215 - jelto@cumin1002" [production]
08:22 <jelto@cumin1002> START - Cookbook sre.dns.netbox [production]