5001-5050 of 10000 results (39ms)
2024-12-10 ยง
16:20 <klausman@cumin1002> START - Cookbook sre.hosts.provision for host ml-lab1002.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
16:15 <klausman@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ml-lab1002 [production]
16:15 <klausman@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host ml-lab1002 [production]
16:13 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-videoscaler: apply [production]
16:13 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/mw-videoscaler: apply [production]
16:13 <moritzm> installing postgresql-15 security updates [production]
16:12 <klausman@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:12 <klausman@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update DNS for newly-provisioned ml-lab1002 - klausman@cumin1002" [production]
16:12 <klausman@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update DNS for newly-provisioned ml-lab1002 - klausman@cumin1002" [production]
16:09 <elukey@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'sync'. [production]
16:09 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on phab1004.eqiad.wmnet with reason: nftables [production]
16:09 <elukey@deploy2002> helmfile [codfw] START helmfile.d/admin 'sync'. [production]
16:09 <mutante> phabricator production host needs a maintenance reboot - expect short downtime [production]
16:09 <dzahn@cumin2002> START - Cookbook sre.hosts.downtime for 0:10:00 on phab1004.eqiad.wmnet with reason: nftables [production]
16:09 <elukey@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'sync'. [production]
16:08 <elukey@deploy2002> helmfile [eqiad] START helmfile.d/admin 'sync'. [production]
16:08 <klausman@cumin1002> START - Cookbook sre.dns.netbox [production]
16:07 <moritzm> manually clean out ganeti1009 from puppetdb, decom cookbook got interrupted T381652 [production]
16:06 <jmm@cumin2002> END (PASS) - Cookbook sre.debmonitor.remove-hosts (exit_code=0) for 1 hosts: ganeti1009.eqiad.wmnet [production]
16:06 <jmm@cumin2002> START - Cookbook sre.debmonitor.remove-hosts for 1 hosts: ganeti1009.eqiad.wmnet [production]
16:03 <marostegui@cumin1002> dbctl commit (dc=all): 'db1169 (re)pooling @ 100%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71685 and previous config saved to /var/cache/conftool/dbconfig/20241210-160322-root.json [production]
15:53 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-videoscaler: apply [production]
15:53 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-videoscaler: apply [production]
15:52 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-videoscaler: apply [production]
15:52 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-videoscaler: apply [production]
15:48 <marostegui@cumin1002> dbctl commit (dc=all): 'db1169 (re)pooling @ 75%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71684 and previous config saved to /var/cache/conftool/dbconfig/20241210-154816-root.json [production]
15:48 <moritzm> installing usb.ids updates from Bullseye point release [production]
15:45 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
15:45 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
15:35 <moritzm> installing imagemagick security updates [production]
15:33 <marostegui@cumin1002> dbctl commit (dc=all): 'db1169 (re)pooling @ 50%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71683 and previous config saved to /var/cache/conftool/dbconfig/20241210-153311-root.json [production]
15:18 <marostegui@cumin1002> dbctl commit (dc=all): 'db1169 (re)pooling @ 25%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71682 and previous config saved to /var/cache/conftool/dbconfig/20241210-151805-root.json [production]
15:15 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on wdqs1025.eqiad.wmnet with reason: T376150 [production]
15:15 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wdqs1025.eqiad.wmnet with reason: T376150 [production]
15:06 <Lucas_WMDE> UTC afternoon backport+config window done [production]
15:05 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1101830|Event Logging: Update streamName and schemaId (T364460)]] (duration: 25m 40s) [production]
15:03 <marostegui@cumin1002> dbctl commit (dc=all): 'db1169 (re)pooling @ 10%: Pooling in production', diff saved to https://phabricator.wikimedia.org/P71681 and previous config saved to /var/cache/conftool/dbconfig/20241210-150300-root.json [production]
15:00 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, wangombe: Continuing with sync [production]
14:47 <klausman@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts ml-lab1002.eqiad.wmnet [production]
14:47 <klausman@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:47 <klausman@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ml-lab1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - klausman@cumin1002" [production]
14:46 <klausman@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ml-lab1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - klausman@cumin1002" [production]
14:44 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, wangombe: Backport for [[gerrit:1101830|Event Logging: Update streamName and schemaId (T364460)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:42 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
14:42 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
14:40 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1101830|Event Logging: Update streamName and schemaId (T364460)]] [production]
14:39 <samtar@deploy2002> Finished scap sync-world: Backport for [[gerrit:1101871|Revert "IS/IS-l: wgUseCodexSpecialBlock for beta, prod test.wiki"]] (duration: 10m 15s) [production]
14:32 <samtar@deploy2002> samtar: Continuing with sync [production]
14:32 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
14:32 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]