951-1000 of 10000 results (36ms)
2025-10-28 ยง
16:48 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment codfw1dev for all services [admin]
16:44 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts es1031.eqiad.wmnet [production]
16:44 <marostegui@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:44 <marostegui@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es1031.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1003" [production]
16:44 <marostegui@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es1031.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1003" [production]
16:41 <otto@deploy2002> helmfile [staging] DONE helmfile.d/services/edit-analytics: apply [production]
16:41 <otto@deploy2002> helmfile [staging] START helmfile.d/services/edit-analytics: apply [production]
16:40 <marostegui@cumin1003> START - Cookbook sre.dns.netbox [production]
16:38 <otto@deploy2002> helmfile [staging] DONE helmfile.d/services/edit-analytics: apply [production]
16:38 <otto@deploy2002> helmfile [staging] START helmfile.d/services/edit-analytics: apply [production]
16:34 <marostegui@cumin1003> START - Cookbook sre.hosts.decommission for hosts es1031.eqiad.wmnet [production]
16:32 <marostegui@cumin1003> dbctl commit (dc=all): 'Remove es1031 from dbctl T408600', diff saved to https://phabricator.wikimedia.org/P84315 and previous config saved to /var/cache/conftool/dbconfig/20251028-163252-marostegui.json [production]
16:07 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api [tools]
16:07 <taavi> delete paws, paws-master security groups, long obsolete as paws is now in a separae project [tools]
16:01 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component components-api [tools]
15:44 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['ml-serve2001'] [production]
15:43 <swfrench-wmf> rolling run-puppet-agent on A:cp hosts for haproxy config change [production]
15:41 <otto@deploy2002> helmfile [staging] START helmfile.d/services/edit-analytics: apply [production]
15:34 <jhancock@cumin1003> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ml-serve2001'] [production]
15:31 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api [toolsbeta]
15:26 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component components-api [toolsbeta]
15:23 <swfrench-wmf> disable-puppet on A:cp hosts for haproxy config change [production]
15:16 <brennen@deploy2002> Finished deploy [phabricator/deployment@5fbb350]: deploy phab1004 for T408575 (duration: 06m 09s) [production]
15:11 <swfrench-wmf> applied mediawiki-common network policy updates in mw-script / mw-cron - T309738 [production]
15:10 <brennen@deploy2002> Started deploy [phabricator/deployment@5fbb350]: deploy phab1004 for T408575 [production]
15:09 <brennen@deploy2002> Finished deploy [phabricator/deployment@5fbb350]: deploy phab1004 for T408575 (duration: 00m 34s) [production]
15:09 <swfrench@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-cron: apply [production]
15:09 <swfrench@deploy2002> helmfile [codfw] START helmfile.d/services/mw-cron: apply [production]
15:09 <brennen@deploy2002> Started deploy [phabricator/deployment@5fbb350]: deploy phab1004 for T408575 [production]
15:09 <swfrench@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply [production]
15:09 <swfrench@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
15:06 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab2002.codfw.wmnet with reason: reboot for kernel [production]
15:05 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab1004.eqiad.wmnet with reason: reboot for kernel [production]
14:59 <dancy@deploy2002> Installation of scap version "4.218.0" completed for 2 hosts [production]
14:57 <dancy@deploy2002> Installing scap version "4.218.0" for 2 host(s) [production]
14:52 <elukey@puppetserver1001> conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=eqiad [production]
14:45 <hashar> Restarted CI Jenkins [production]
14:42 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-nginx [tools]
14:42 <hashar> Restarting Gerrit [production]
14:39 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component ingress-nginx [tools]
14:35 <wmbot~bd808@tools-bastion-14> Ingress config not working with newly upgraded nginx-ingress-controller:v1.13.3 [tools.iw]
14:18 <wmbot~bd808@tools-bastion-14> kubectl apply --validate=true -f ingress.yaml [tools.iw]
14:18 <wmbot~bd808@tools-bastion-14> kubectl delete ingress/iw-domain [tools.iw]
14:17 <wmbot~bd808@tools-bastion-14> kubectl delete ingress/iw-root [tools.iw]
14:16 <wmbot~bd808@tools-bastion-14> `kubectl apply --validate=true -f ingress.yaml` reports unchainged, but https://iw.toolforge.org/bash not working. [tools.iw]
14:01 <hashar> Reloaded Zuul for "Configure the REL1_45 test and gate pipelines" | https://gerrit.wikimedia.org/r/c/integration/config/+/1197724 | T407911 [releng]
13:43 <derick@deploy2002> Finished scap sync-world: Backport for [[gerrit:1199291|Remove hCaptcha site key from private/readme.php]] (duration: 08m 58s) [production]
13:39 <derick@deploy2002> mszwarc, derick: Continuing with sync [production]
13:38 <derick@deploy2002> mszwarc, derick: Backport for [[gerrit:1199291|Remove hCaptcha site key from private/readme.php]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:34 <derick@deploy2002> Started scap sync-world: Backport for [[gerrit:1199291|Remove hCaptcha site key from private/readme.php]] [production]