551-600 of 10000 results (26ms)
2025-06-23 §
05:58 <marostegui@cumin1002> dbctl commit (dc=all): 'db2215 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P78557 and previous config saved to /var/cache/conftool/dbconfig/20250623-055840-root.json [production]
05:55 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 7 hosts with reason: T397597 [production]
05:48 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply [production]
05:47 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
05:47 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2215 T397419', diff saved to https://phabricator.wikimedia.org/P78556 and previous config saved to /var/cache/conftool/dbconfig/20250623-054725-marostegui.json [production]
05:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1160 (T396130)', diff saved to https://phabricator.wikimedia.org/P78555 and previous config saved to /var/cache/conftool/dbconfig/20250623-054633-marostegui.json [production]
05:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote db2196 to x1 primary T397419', diff saved to https://phabricator.wikimedia.org/P78554 and previous config saved to /var/cache/conftool/dbconfig/20250623-054616-marostegui.json [production]
05:45 <marostegui> Starting x1 codfw failover from db2215 to db2196 - T397419 [production]
05:42 <marostegui@cumin1002> dbctl commit (dc=all): 'Set db2196 with weight 0 T397419', diff saved to https://phabricator.wikimedia.org/P78553 and previous config saved to /var/cache/conftool/dbconfig/20250623-054206-root.json [production]
05:41 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 15 hosts with reason: Primary switchover x1 T397419 [production]
05:38 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1160 (T396130)', diff saved to https://phabricator.wikimedia.org/P78552 and previous config saved to /var/cache/conftool/dbconfig/20250623-053857-marostegui.json [production]
05:38 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1160.eqiad.wmnet with reason: Maintenance [production]
05:33 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1150.eqiad.wmnet with reason: Maintenance [production]
2025-06-22 §
21:42 <Krinkle> Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/1162179 [releng]
00:09 <andrewbogott> rebooting tools-prometheus-8 [tools]
2025-06-21 §
20:07 <wmbot~anticomposite@tools-bastion-13> stewardbots/StewardBot/manage.sh restart # Disconnected [tools.stewardbots]
16:15 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment codfw1dev for all services [admin]
16:10 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment codfw1dev for all services [admin]
16:09 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-54, tools-k8s-worker-nfs-12 [tools]
15:58 <andrewbogott> rebooting tools-k8s-worker-nfs-54 tools-k8s-worker-nfs-12, lots of D state [tools]
15:57 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-54, tools-k8s-worker-nfs-12 [tools]
10:09 <wmbot~dcaro@acme> END (PASS) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=0) [tools]
09:27 <wmbot~dcaro@acme> START - Cookbook wmcs.openstack.cloudvirt.vm_console [tools]
09:27 <wmbot~dcaro@acme> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=99) [tools]
09:26 <wmbot~dcaro@acme> START - Cookbook wmcs.openstack.cloudvirt.vm_console [tools]
03:16 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for all services [admin]
03:08 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for all services [admin]
02:54 <Krinkle> Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/1162106 [releng]
2025-06-20 §
21:14 <cjming@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
21:14 <cjming@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
20:58 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for service: project,nova,cinder,neutron [admin]
20:51 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for service: project,nova,cinder,neutron [admin]
20:49 <AntiComposite> cvn-app12: restart all bots [cvn]
20:48 <AntiComposite> cvn-app10: restart all bots [cvn]
20:34 <wmbot~anticomposite@tools-bastion-13> stewardbots/StewardBot/manage.sh restart # Pinged out [tools.stewardbots]
19:19 <sukhe> sudo cumin -b11 'A:cp' "run-puppet-agent --enable 'merging CR 1160381'": T390924 [production]
19:19 <sukhe> sudo cumin -b11 'A:cp' "run-puppet-agent 'merging CR 1160381'": T390924 [production]
19:16 <sukhe> enabling puppet on cp4037 to merge CR 1160381: add `ismobile=1' for mobile requests: T390924 [production]
19:10 <sukhe> sudo cumin 'A:cp' "disable-puppet 'merging CR 1160381'": T390924 [production]
18:57 <dduvall> ran `helm --namespace gitlab-runner uninstall docker-hub-mirror` to fix helm state. reapplying production cluster configuration [releng]
18:41 <dduvall> deleted docker-hub-mirror statefulset and admission controller deployment. reapplying production cluster configuration [releng]
18:36 <bking@cumin2002> conftool action : set/weight=10:pooled=no; selector: name=cirrussearch2113\.codfw\.wmnet [production]
18:34 <aokoth@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/services/miscweb: apply [production]
18:33 <aokoth@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/services/miscweb: apply [production]
18:32 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
18:30 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
18:19 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
18:18 <dduvall> seeing numerous image pull errors in gitlab-cloud-runner cluster [releng]
18:08 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
18:05 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]