601-650 of 10000 results (110ms)
2025-04-09 ยง
21:22 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: reimage row A - bking@cumin2002 - T388610 [production]
21:22 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-druid1007.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:22 <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1126660|[test2wiki] Enable Wikifunctions client mode (T383106)]], [[gerrit:1135500|MWMultiVersion: Recognise the new wikifunctionsclient dblist]] [production]
21:22 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-druid1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:20 <jclark@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-druid1006 [production]
21:20 <jclark@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host an-druid1006 [production]
21:20 <jclark@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-druid1007 [production]
21:20 <jclark@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host an-druid1007 [production]
21:19 <jclark@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:19 <jclark@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added mgmt for an-druid - jclark@cumin1002" [production]
21:19 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added mgmt for an-druid - jclark@cumin1002" [production]
21:19 <jforrester@deploy1003> Sync cancelled. [production]
21:15 <jclark@cumin1002> START - Cookbook sre.dns.netbox [production]
21:15 <jforrester@deploy1003> jforrester: Backport for [[gerrit:1126660|[test2wiki] Enable Wikifunctions client mode (T383106)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:11 <ejegg> fundraising civicrm upgraded from b20436a2 to 38a7a649 [production]
21:11 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host an-druid1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:11 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host an-druid1007.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:08 <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1126660|[test2wiki] Enable Wikifunctions client mode (T383106)]] [production]
19:37 <dancy@deploy1003> Installation of scap version "4.153.0" completed for 2 hosts [production]
19:35 <dancy@deploy1003> Installing scap version "4.153.0" for 2 host(s) [production]
19:24 <fab@deploy1003> Finished deploy [airflow-dags/research@ea5f3de]: (no justification provided) (duration: 00m 41s) [production]
19:24 <fab@deploy1003> Started deploy [airflow-dags/research@ea5f3de]: (no justification provided) [production]
19:14 <dzahn@cumin1002> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: security release [production]
18:48 <dzahn@cumin1002> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: security release [production]
18:40 <dzahn@cumin1002> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: security release [production]
18:20 <brennen@deploy1003> rebuilt and synchronized wikiversions files: group1 to 1.44.0-wmf.24 refs T386219 [production]
18:06 <brennen> 1.44.0-wmf.24 train status (T386219): logs quiet, no current blockers, moving to group1 [production]
18:04 <bking@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 9 hosts with reason: adding net-new role [production]
17:50 <swfrench@deploy1003> Finished scap sync-world: Test scap run after switching to PHP 8.1 container image for maintenance scripts - T390225 (duration: 03m 10s) [production]
17:47 <swfrench@deploy1003> Started scap sync-world: Test scap run after switching to PHP 8.1 container image for maintenance scripts - T390225 [production]
17:46 <swfrench@deploy1003> Stopping before sync operations [production]
17:45 <dzahn@cumin1002> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: security release [production]
17:45 <swfrench@deploy1003> Started scap sync-world: Test stop-before-sync scap run after switching to PHP 8.1 container image for maintenance scripts - T390225 [production]
17:38 <dzahn@cumin1002> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: security release [production]
17:34 <bking@cumin2002> END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row A - bking@cumin2002 - T388610 [production]
17:33 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row A - bking@cumin2002 - T388610 [production]
17:21 <ladsgroup@deploy1003> Finished scap sync-world: Backport for [[gerrit:1135454|Increase max db connection count before circuit breaking (T390510)]] (duration: 16m 47s) [production]
17:19 <mutante> apt1002 - updating thirdparty/gitlab-bullseye gitlab-ce package version [production]
17:12 <ladsgroup@deploy1003> ladsgroup: Continuing with sync [production]
17:11 <ladsgroup@deploy1003> ladsgroup: Backport for [[gerrit:1135454|Increase max db connection count before circuit breaking (T390510)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
17:04 <bking@cumin2002> END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row A - bking@cumin2002 - T388610 [production]
17:04 <sukhe> forcing rechecks for pc1011 and db1151 [production]
17:04 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1135454|Increase max db connection count before circuit breaking (T390510)]] [production]
17:04 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row A - bking@cumin2002 - T388610 [production]
17:01 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row A - bking@cumin2002 - T388610 [production]
17:01 <bking@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2109.codfw.wmnet on all recursors [production]
17:01 <bking@cumin2002> START - Cookbook sre.dns.wipe-cache cirrussearch2109.codfw.wmnet on all recursors [production]
17:01 <bking@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2085.codfw.wmnet on all recursors [production]
17:01 <bking@cumin2002> START - Cookbook sre.dns.wipe-cache cirrussearch2085.codfw.wmnet on all recursors [production]
16:59 <sukhe> [END] sudo cumin -b11 "O:mariadb::core" "run-puppet-agent" [production]