2025-04-03
§
|
08:45 |
<jnuche@deploy1003> |
Started deploy [releng/jenkins-deploy@c274545] (releasing): (no justification provided) |
[production] |
08:42 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3006.esams.wmnet |
[production] |
08:29 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3005.esams.wmnet |
[production] |
08:28 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3005.esams.wmnet |
[production] |
08:24 |
<elukey@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
08:22 |
<elukey@deploy1003> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
08:21 |
<hashar> |
Upgrading CI Jenkins |
[production] |
08:21 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti3005.esams.wmnet |
[production] |
08:20 |
<slyngshede@dns1004> |
END - running authdns-update |
[production] |
08:18 |
<slyngshede@dns1004> |
START - running authdns-update |
[production] |
08:12 |
<kevinbazira@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
08:12 |
<slyngshede@dns1004> |
START - running authdns-update |
[production] |
08:06 |
<kevinbazira@deploy1003> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
08:05 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3005.esams.wmnet |
[production] |
07:54 |
<moritzm> |
failover ganeti masters in esams to ganeti3007/3008 |
[production] |
07:47 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3008.esams.wmnet |
[production] |
07:47 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3008.esams.wmnet |
[production] |
07:44 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2044.codfw.wmnet |
[production] |
07:44 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1044.eqiad.wmnet |
[production] |
07:41 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti3008.esams.wmnet |
[production] |
07:38 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host mc2044.codfw.wmnet |
[production] |
07:38 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host mc1044.eqiad.wmnet |
[production] |
07:36 |
<moritzm> |
added spiderpig-access LDAP group T390338 |
[production] |
07:31 |
<fabfur> |
applying patch to use TLS on tmpfs on A:cp-ulsfo (T384227) |
[production] |
07:28 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3008.esams.wmnet |
[production] |
07:27 |
<fabfur> |
disabling puppet on A:cp-ulsfo to deploy https://gerrit.wikimedia.org/r/c/operations/puppet/+/1133405 (T384227) |
[production] |
07:22 |
<elukey> |
restart docker on deploy1003 to pick up max-concurrent-uploads=1 - T390251 |
[production] |
07:08 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3007.esams.wmnet |
[production] |
07:08 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3007.esams.wmnet |
[production] |
07:07 |
<elukey@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/changeprop: sync |
[production] |
07:07 |
<elukey@deploy1003> |
helmfile [eqiad] START helmfile.d/services/changeprop: sync |
[production] |
07:00 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti3007.esams.wmnet |
[production] |
06:54 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3007.esams.wmnet |
[production] |
00:39 |
<urandom> |
starting `nodetool garbagecollect` on Cassandra/sessionstore2006 |
[production] |
00:16 |
<tstarling@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] (duration: 15m 04s) |
[production] |
00:15 |
<zabe> |
zabe@mwmaint1002:~$ cat group2.dblist | xargs -I{} bash -c "echo {}; mwscript extensions/AbuseFilter/maintenance/MigrateESRefToAflTable.php {} --deletedump /home/zabe/afl_text_table_deletedump/{} --dump /home/zabe/afl_text_table_dump/{} --sleep 0.4" # T381599 |
[production] |
00:09 |
<tstarling@deploy1003> |
tstarling: Continuing with sync |
[production] |
00:08 |
<tstarling@deploy1003> |
tstarling: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
00:01 |
<tstarling@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] |
[production] |
2025-04-02
§
|
23:32 |
<urandom> |
starting `nodetool garbagecollect` on Cassandra/sessionstore1006 |
[production] |
23:28 |
<urandom> |
starting `nodetool garbagecollect` on Cassandra/sessionstore2005 |
[production] |
22:38 |
<jhathaway> |
puppet private repo changes completed, T385995 |
[production] |
22:01 |
<brett> |
Import ncmonitor 1.3.3 into bookworm-wikimedia |
[production] |
22:00 |
<dreamyjazz@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1133538|AbuseLogger: properly distinguish between global filters and central DB (T390904)]] (duration: 25m 19s) |
[production] |
21:55 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<dreamyjazz@deploy1003> |
dreamyjazz: Continuing with sync |
[production] |