2025-04-03
§
|
07:38 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host mc2044.codfw.wmnet |
[production] |
07:38 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host mc1044.eqiad.wmnet |
[production] |
07:36 |
<moritzm> |
added spiderpig-access LDAP group T390338 |
[production] |
07:31 |
<fabfur> |
applying patch to use TLS on tmpfs on A:cp-ulsfo (T384227) |
[production] |
07:28 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3008.esams.wmnet |
[production] |
07:27 |
<fabfur> |
disabling puppet on A:cp-ulsfo to deploy https://gerrit.wikimedia.org/r/c/operations/puppet/+/1133405 (T384227) |
[production] |
07:22 |
<elukey> |
restart docker on deploy1003 to pick up max-concurrent-uploads=1 - T390251 |
[production] |
07:08 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3007.esams.wmnet |
[production] |
07:08 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3007.esams.wmnet |
[production] |
07:07 |
<elukey@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/changeprop: sync |
[production] |
07:07 |
<elukey@deploy1003> |
helmfile [eqiad] START helmfile.d/services/changeprop: sync |
[production] |
07:00 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti3007.esams.wmnet |
[production] |
06:54 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3007.esams.wmnet |
[production] |
00:39 |
<urandom> |
starting `nodetool garbagecollect` on Cassandra/sessionstore2006 |
[production] |
00:16 |
<tstarling@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] (duration: 15m 04s) |
[production] |
00:15 |
<zabe> |
zabe@mwmaint1002:~$ cat group2.dblist | xargs -I{} bash -c "echo {}; mwscript extensions/AbuseFilter/maintenance/MigrateESRefToAflTable.php {} --deletedump /home/zabe/afl_text_table_deletedump/{} --dump /home/zabe/afl_text_table_dump/{} --sleep 0.4" # T381599 |
[production] |
00:09 |
<tstarling@deploy1003> |
tstarling: Continuing with sync |
[production] |
00:08 |
<tstarling@deploy1003> |
tstarling: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
00:01 |
<tstarling@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1133575|Temporarily disable Lua profiler (T389734)]] |
[production] |
2025-04-02
§
|
23:32 |
<urandom> |
starting `nodetool garbagecollect` on Cassandra/sessionstore1006 |
[production] |
23:28 |
<urandom> |
starting `nodetool garbagecollect` on Cassandra/sessionstore2005 |
[production] |
22:38 |
<jhathaway> |
puppet private repo changes completed, T385995 |
[production] |
22:01 |
<brett> |
Import ncmonitor 1.3.3 into bookworm-wikimedia |
[production] |
22:00 |
<dreamyjazz@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1133538|AbuseLogger: properly distinguish between global filters and central DB (T390904)]] (duration: 25m 19s) |
[production] |
21:55 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<dreamyjazz@deploy1003> |
dreamyjazz: Continuing with sync |
[production] |
21:53 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:53 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:52 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: test only - bking@cumin2002 - T388610 |
[production] |
21:41 |
<dreamyjazz@deploy1003> |
dreamyjazz: Backport for [[gerrit:1133538|AbuseLogger: properly distinguish between global filters and central DB (T390904)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:37 |
<urandom> |
starting `nodetool garbagecollect` on Cassandra/sessionstore2004 |
[production] |
21:35 |
<urandom> |
starting `nodetool garbagecollect` on Cassandra/sessionstore1005 |
[production] |
21:35 |
<dreamyjazz@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1133538|AbuseLogger: properly distinguish between global filters and central DB (T390904)]] |
[production] |
21:31 |
<reedy@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1133532|Enable EmailAuth enforcement on group 0/1 (T390662)]] (duration: 15m 42s) |
[production] |
21:23 |
<reedy@deploy1003> |
reedy, tgr: Continuing with sync |
[production] |
21:21 |
<reedy@deploy1003> |
reedy, tgr: Backport for [[gerrit:1133532|Enable EmailAuth enforcement on group 0/1 (T390662)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:15 |
<reedy@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1133532|Enable EmailAuth enforcement on group 0/1 (T390662)]] |
[production] |
21:07 |
<reedy@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1133500|SUL3: Fix user ID mismatch during login (immediately after creation) (T388177)]], [[gerrit:1133502|SUL3: Fix user ID mismatch during login (immediately after creation) (T388177)]], [[gerrit:1133503|Remove redundant WaitConditionLoop from CentralAuthTokenManager]], [[gerrit:1133504|Remove redundant WaitConditionLoop from CentralAuthTokenManager]] |
[production] |
21:00 |
<reedy@deploy1003> |
d3r1ck01, matmarex, reedy: Continuing with sync |
[production] |
20:52 |
<reedy@deploy1003> |
d3r1ck01, matmarex, reedy: Backport for [[gerrit:1133500|SUL3: Fix user ID mismatch during login (immediately after creation) (T388177)]], [[gerrit:1133502|SUL3: Fix user ID mismatch during login (immediately after creation) (T388177)]], [[gerrit:1133503|Remove redundant WaitConditionLoop from CentralAuthTokenManager]], [[gerrit:1133504|Remove redundant WaitConditionLoop from CentralAuthTokenManager] |
[production] |
20:47 |
<reedy@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1133500|SUL3: Fix user ID mismatch during login (immediately after creation) (T388177)]], [[gerrit:1133502|SUL3: Fix user ID mismatch during login (immediately after creation) (T388177)]], [[gerrit:1133503|Remove redundant WaitConditionLoop from CentralAuthTokenManager]], [[gerrit:1133504|Remove redundant WaitConditionLoop from CentralAuthTokenManager]] |
[production] |
20:14 |
<reedy@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1133500|SUL3: Fix user ID mismatch during login (immediately after creation) (T388177)]], [[gerrit:1133502|SUL3: Fix user ID mismatch during login (immediately after creation) (T388177)]], [[gerrit:1133503|Remove redundant WaitConditionLoop from CentralAuthTokenManager]], [[gerrit:1133504|Remove redundant WaitConditionLoop from CentralAuthTokenManager]] |
[production] |
19:54 |
<jhathaway> |
rolling out a change to private repo, 1127150, please let me know if any issues arise when merging patches |
[production] |
18:54 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe2003.codfw.wmnet with OS bookworm |
[production] |
18:35 |
<dancy@deploy1003> |
rebuilt and synchronized wikiversions files: group1 to 1.44.0-wmf.23 refs T386218 |
[production] |
18:35 |
<cstone> |
SmashPig upgraded from b9310c06 to 642ae816 |
[production] |