301-350 of 10000 results (127ms)
2025-04-03 §
08:45 <jnuche@deploy1003> Started deploy [releng/jenkins-deploy@c274545] (releasing): (no justification provided) [production]
08:42 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3006.esams.wmnet [production]
08:29 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3005.esams.wmnet [production]
08:28 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3005.esams.wmnet [production]
08:24 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
08:22 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
08:21 <hashar> Upgrading CI Jenkins [production]
08:21 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti3005.esams.wmnet [production]
08:20 <slyngshede@dns1004> END - running authdns-update [production]
08:18 <slyngshede@dns1004> START - running authdns-update [production]
08:12 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
08:12 <slyngshede@dns1004> START - running authdns-update [production]
08:06 <kevinbazira@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
08:05 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3005.esams.wmnet [production]
07:54 <moritzm> failover ganeti masters in esams to ganeti3007/3008 [production]
07:47 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3008.esams.wmnet [production]
07:47 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3008.esams.wmnet [production]
07:44 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2044.codfw.wmnet [production]
07:44 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1044.eqiad.wmnet [production]
07:41 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti3008.esams.wmnet [production]
07:38 <jiji@cumin1002> START - Cookbook sre.hosts.reboot-single for host mc2044.codfw.wmnet [production]
07:38 <jiji@cumin1002> START - Cookbook sre.hosts.reboot-single for host mc1044.eqiad.wmnet [production]
07:36 <moritzm> added spiderpig-access LDAP group T390338 [production]
07:31 <fabfur> applying patch to use TLS on tmpfs on A:cp-ulsfo (T384227) [production]
07:28 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3008.esams.wmnet [production]
07:27 <fabfur> disabling puppet on A:cp-ulsfo to deploy https://gerrit.wikimedia.org/r/c/operations/puppet/+/1133405 (T384227) [production]
07:22 <elukey> restart docker on deploy1003 to pick up max-concurrent-uploads=1 - T390251 [production]
07:08 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3007.esams.wmnet [production]
07:08 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3007.esams.wmnet [production]
07:07 <elukey@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop: sync [production]
07:07 <elukey@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop: sync [production]
07:00 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti3007.esams.wmnet [production]
06:54 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3007.esams.wmnet [production]
00:39 <urandom> starting `nodetool garbagecollect` on Cassandra/sessionstore2006 [production]
00:16 <tstarling@deploy1003> Finished scap sync-world: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] (duration: 15m 04s) [production]
00:15 <zabe> zabe@mwmaint1002:~$ cat group2.dblist | xargs -I{} bash -c "echo {}; mwscript extensions/AbuseFilter/maintenance/MigrateESRefToAflTable.php {} --deletedump /home/zabe/afl_text_table_deletedump/{} --dump /home/zabe/afl_text_table_dump/{} --sleep 0.4" # T381599 [production]
00:09 <tstarling@deploy1003> tstarling: Continuing with sync [production]
00:08 <tstarling@deploy1003> tstarling: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
00:01 <tstarling@deploy1003> Started scap sync-world: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] [production]
2025-04-02 §
23:32 <urandom> starting `nodetool garbagecollect` on Cassandra/sessionstore1006 [production]
23:28 <urandom> starting `nodetool garbagecollect` on Cassandra/sessionstore2005 [production]
22:38 <jhathaway> puppet private repo changes completed, T385995 [production]
22:01 <brett> Import ncmonitor 1.3.3 into bookworm-wikimedia [production]
22:00 <dreamyjazz@deploy1003> Finished scap sync-world: Backport for [[gerrit:1133538|AbuseLogger: properly distinguish between global filters and central DB (T390904)]] (duration: 25m 19s) [production]
21:55 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 [production]
21:53 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 [production]
21:53 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 [production]
21:53 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 [production]
21:53 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 [production]
21:53 <dreamyjazz@deploy1003> dreamyjazz: Continuing with sync [production]