201-250 of 10000 results (95ms)
2025-04-15 §
14:56 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P75035 and previous config saved to /var/cache/conftool/dbconfig/20250415-145613-fceratto.json [production]
14:52 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:52 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:41 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P75034 and previous config saved to /var/cache/conftool/dbconfig/20250415-144106-fceratto.json [production]
14:40 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row C - bking@cumin2002 - T388610 [production]
14:39 <cgoubert@deploy1003> Finished scap sync-world: Backport for [[gerrit:1136133|shwiktionary: Add bs as import source (T391621)]] (duration: 19m 28s) [production]
14:39 <bking@cumin2002> END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row C - bking@cumin2002 - T388610 [production]
14:38 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row C - bking@cumin2002 - T388610 [production]
14:38 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-text_drmrs [production]
14:33 <cgoubert@deploy1003> aleksandar, cgoubert: Continuing with sync [production]
14:31 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-upload_drmrs [production]
14:31 <cgoubert@deploy1003> aleksandar, cgoubert: Backport for [[gerrit:1136133|shwiktionary: Add bs as import source (T391621)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:25 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194 (T391056)', diff saved to https://phabricator.wikimedia.org/P75033 and previous config saved to /var/cache/conftool/dbconfig/20250415-142558-fceratto.json [production]
14:25 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-upload_eqiad and A:cp [production]
14:25 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-text_eqiad and A:cp [production]
14:25 <vgutierrez> rolling upgrade to varnish 7.1.1-1.1~bpo11+wmf3 in eqiad - T391334 [production]
14:23 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1194 (T391056)', diff saved to https://phabricator.wikimedia.org/P75032 and previous config saved to /var/cache/conftool/dbconfig/20250415-142349-fceratto.json [production]
14:23 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1194.eqiad.wmnet with reason: Maintenance [production]
14:23 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T391056)', diff saved to https://phabricator.wikimedia.org/P75031 and previous config saved to /var/cache/conftool/dbconfig/20250415-142327-fceratto.json [production]
14:22 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row C - bking@cumin2002 - T388610 [production]
14:22 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch2065.codfw.wmnet with OS bullseye [production]
14:20 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-text_esams and not P{cp3073.esams.wmnet} and A:cp [production]
14:20 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:20 <cgoubert@deploy1003> Started scap sync-world: Backport for [[gerrit:1136133|shwiktionary: Add bs as import source (T391621)]] [production]
14:20 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:18 <cgoubert@deploy1003> Finished scap sync-world: Backport for [[gerrit:1136696|tests(Mentorship): add coverage for UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136698|perf(Mentorship): extract sub-queries from UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136700|perf(Mentorship): batch filtering mentees in UncachedMenteeOverviewDataProvider (T391695)]] (duration: 18m 30s) [production]
14:15 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-upload_esams and not P{cp3081.esams.wmnet} and A:cp [production]
14:11 <cgoubert@deploy1003> migr, cgoubert: Continuing with sync [production]
14:11 <cgoubert@deploy1003> migr, cgoubert: Backport for [[gerrit:1136696|tests(Mentorship): add coverage for UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136698|perf(Mentorship): extract sub-queries from UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136700|perf(Mentorship): batch filtering mentees in UncachedMenteeOverviewDataProvider (T391695)]] synced to the testservers (https://wikitech.wikimedia [production]
14:08 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P75030 and previous config saved to /var/cache/conftool/dbconfig/20250415-140820-fceratto.json [production]
14:07 <urandom> bootstrapping Cassandra/restbase1044-c — T389423 [production]
14:04 <sukhe@dns1004> END - running authdns-update [production]
14:02 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cirrussearch2065.codfw.wmnet with reason: host reimage [production]
14:01 <sukhe@dns1004> START - running authdns-update [production]
13:59 <cgoubert@deploy1003> Started scap sync-world: Backport for [[gerrit:1136696|tests(Mentorship): add coverage for UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136698|perf(Mentorship): extract sub-queries from UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136700|perf(Mentorship): batch filtering mentees in UncachedMenteeOverviewDataProvider (T391695)]] [production]
13:59 <sukhe@dns1004> END - running authdns-update [production]
13:56 <sukhe@dns1004> START - running authdns-update [production]
13:56 <cgoubert@deploy1003> Finished scap sync-world: Backport for [[gerrit:1136701|tests(Mentorship): add coverage for UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136702|perf(Mentorship): extract sub-queries from UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136703|perf(Mentorship): batch filtering mentees in UncachedMenteeOverviewDataProvider (T391695)]] (duration: 18m 27s) [production]
13:55 <tappof@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "add data::pdus to exports - tappof@cumin1002 - T387231" [production]
13:55 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cirrussearch2065.codfw.wmnet with reason: host reimage [production]
13:55 <tappof@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "add data::pdus to exports - tappof@cumin1002 - T387231" [production]
13:53 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P75029 and previous config saved to /var/cache/conftool/dbconfig/20250415-135313-fceratto.json [production]
13:49 <cgoubert@deploy1003> migr, cgoubert: Continuing with sync [production]
13:49 <cgoubert@deploy1003> migr, cgoubert: Backport for [[gerrit:1136701|tests(Mentorship): add coverage for UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136702|perf(Mentorship): extract sub-queries from UncachedMenteeOverviewDataProvider (T391695)]], [[gerrit:1136703|perf(Mentorship): batch filtering mentees in UncachedMenteeOverviewDataProvider (T391695)]] synced to the testservers (https://wikitech.wikimedia [production]
13:45 <tappof@cumin1002> END (ERROR) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=97) generate netbox hiera data: "add data::pdus to exports - tappof@cumin1002 - T387231" [production]
13:45 <tappof@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "add data::pdus to exports - tappof@cumin1002 - T387231" [production]
13:40 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host cirrussearch2065 [production]
13:40 <bking@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cirrussearch2065 [production]
13:40 <bking@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host cirrussearch2065 [production]
13:40 <bking@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2065.codfw.wmnet 68.32.192.10.in-addr.arpa 8.6.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]