8101-8150 of 10000 results (31ms)
2024-10-14 ยง
12:32 <elukey@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: aux-k8s-ctrl1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - elukey@cumin1002" [production]
12:32 <elukey@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: aux-k8s-ctrl1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - elukey@cumin1002" [production]
12:30 <hnowlan> removed all aqsv1 service components from aqs* hosts [production]
12:30 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1218', diff saved to https://phabricator.wikimedia.org/P69800 and previous config saved to /var/cache/conftool/dbconfig/20241014-123025-arnaudb.json [production]
12:28 <elukey@cumin1002> START - Cookbook sre.dns.netbox [production]
12:28 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on k3s.catalyst.eqiad1.wikimedia.cloud [catalyst]
12:27 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.vps.refresh_puppet_certs on k3s.catalyst.eqiad1.wikimedia.cloud [catalyst]
12:27 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on k3s.catalyst.eqiad1.wikimedia.cloud [catalyst]
12:27 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.vps.refresh_puppet_certs on k3s.catalyst.eqiad1.wikimedia.cloud [catalyst]
12:26 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on k3s.catalyst.eqiad1.wikimedia.cloud [catalyst]
12:26 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.vps.refresh_puppet_certs on k3s.catalyst.eqiad1.wikimedia.cloud [catalyst]
12:24 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on k3s.catalyst.eqiad1.wikimedia.cloud [catalyst]
12:24 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.vps.refresh_puppet_certs on k3s.catalyst.eqiad1.wikimedia.cloud [catalyst]
12:23 <elukey@cumin1002> START - Cookbook sre.hosts.decommission for hosts aux-k8s-ctrl1001.eqiad.wmnet [production]
12:22 <elukey@puppetserver1001> conftool action : set/pooled=inactive; selector: name=aux-k8s-worker1001.eqiad.wmnet [production]
12:22 <elukey@puppetserver1001> conftool action : set/pooled=inactive; selector: name=aux-k8s-ctrl1001.eqiad.wmnet [production]
12:15 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1218', diff saved to https://phabricator.wikimedia.org/P69799 and previous config saved to /var/cache/conftool/dbconfig/20241014-121518-arnaudb.json [production]
12:09 <elukey> increase etcd k8s aux cluster from 3 -> 5 - T344230 [production]
12:00 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1218 (T367781)', diff saved to https://phabricator.wikimedia.org/P69798 and previous config saved to /var/cache/conftool/dbconfig/20241014-120011-arnaudb.json [production]
11:59 <aborrero@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
11:59 <aborrero@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudlb2004-dev cloud-private adddress - aborrero@cumin1002" [production]
11:59 <aborrero@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudlb2004-dev cloud-private adddress - aborrero@cumin1002" [production]
11:57 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db1218 (T367781)', diff saved to https://phabricator.wikimedia.org/P69797 and previous config saved to /var/cache/conftool/dbconfig/20241014-115755-arnaudb.json [production]
11:57 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1218.eqiad.wmnet with reason: Maintenance [production]
11:57 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db1218.eqiad.wmnet with reason: Maintenance [production]
11:57 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1207 (T367781)', diff saved to https://phabricator.wikimedia.org/P69796 and previous config saved to /var/cache/conftool/dbconfig/20241014-115732-arnaudb.json [production]
11:56 <Dreamy_Jazz> Started time limited scan on enwiki for MediaModeration - https://wikitech.wikimedia.org/wiki/MediaModeration [production]
11:56 <aborrero@cumin1002> START - Cookbook sre.dns.netbox [production]
11:55 <btullis> roll-restarting nginx and envoy on wcqs-public nodes for T374240 [analytics]
11:52 <btullis@cumin1002> END (PASS) - Cookbook sre.wdqs.restart-nginx-envoy (exit_code=0) rolling restart_daemons on A:wcqs-public [production]
11:52 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.mysql.clone (exit_code=0) of db2194.codfw.wmnet onto db2227.codfw.wmnet [production]
11:50 <btullis@cumin1002> START - Cookbook sre.wdqs.restart-nginx-envoy rolling restart_daemons on A:wcqs-public [production]
11:50 <hnowlan@deploy2002> Finished deploy [restbase/deploy@26112d4]: Remove unused AQS components. Add bdrwiki (T371761) (duration: 15m 38s) [production]
11:45 <Dreamy_Jazz> Restarting MediaModeration scanning script - https://wikitech.wikimedia.org/wiki/MediaModeration [production]
11:44 <btullis> restarted postgresql@13-main on an-db1001, followed by all airflow schedulers, for T374240 [analytics]
11:43 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1173 (T376905)', diff saved to https://phabricator.wikimedia.org/P69794 and previous config saved to /var/cache/conftool/dbconfig/20241014-114341-ladsgroup.json [production]
11:43 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1173.eqiad.wmnet with reason: Maintenance [production]
11:43 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1173.eqiad.wmnet with reason: Maintenance [production]
11:43 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1168 (T376905)', diff saved to https://phabricator.wikimedia.org/P69793 and previous config saved to /var/cache/conftool/dbconfig/20241014-114316-ladsgroup.json [production]
11:42 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P69792 and previous config saved to /var/cache/conftool/dbconfig/20241014-114225-arnaudb.json [production]
11:39 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2219 (T367781)', diff saved to https://phabricator.wikimedia.org/P69791 and previous config saved to /var/cache/conftool/dbconfig/20241014-113941-arnaudb.json [production]
11:34 <hnowlan@deploy2002> Started deploy [restbase/deploy@26112d4]: Remove unused AQS components. Add bdrwiki (T371761) [production]
11:31 <andrewtavis-wmde@deploy2002> Finished deploy [airflow-dags/wmde@c9a2532]: (no justification provided) (duration: 00m 08s) [production]
11:30 <andrewtavis-wmde@deploy2002> Started deploy [airflow-dags/wmde@c9a2532]: (no justification provided) [production]
11:28 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P69790 and previous config saved to /var/cache/conftool/dbconfig/20241014-112809-ladsgroup.json [production]
11:27 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P69789 and previous config saved to /var/cache/conftool/dbconfig/20241014-112719-arnaudb.json [production]
11:26 <claime> Running ./redis-check-aof --fix on rdb1014 tcp_6379 instance - T376961 [production]
11:24 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2219', diff saved to https://phabricator.wikimedia.org/P69788 and previous config saved to /var/cache/conftool/dbconfig/20241014-112434-arnaudb.json [production]
11:16 <ladsgroup@deploy2002> Finished scap sync-world: Creating bclwikisource (T377084) (duration: 06m 49s) [production]
11:13 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P69787 and previous config saved to /var/cache/conftool/dbconfig/20241014-111302-ladsgroup.json [production]