|
2025-11-20
ยง
|
| 17:12 |
<robh@cumin2002> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker1034.eqiad.wmnet |
[production] |
| 17:10 |
<robh@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wikikube-worker1034.eqiad.wmnet with reason: C/D Migration |
[production] |
| 17:10 |
<robh@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wikikube-worker1163.eqiad.wmnet with reason: C/D Migration |
[production] |
| 17:10 |
<robh@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wikikube-worker1162.eqiad.wmnet with reason: C/D Migration |
[production] |
| 17:03 |
<robh@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wikikube-worker1159.eqiad.wmnet with reason: C/D Migration |
[production] |
| 17:01 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1006.eqiad.wmnet |
[production] |
| 16:58 |
<robh@cumin2002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1159,1162-1163].eqiad.wmnet |
[production] |
| 16:56 |
<robh@cumin2002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker1034.eqiad.wmnet |
[production] |
| 16:56 |
<robh@cumin2002> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1159,1162-1163].eqiad.wmnet |
[production] |
| 16:55 |
<robh@cumin2002> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker1034.eqiad.wmnet |
[production] |
| 16:55 |
<volans@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component infra-tracing |
[tools] |
| 16:54 |
<robh> |
draining eqiad d3 wikikube hosts for network migration |
[production] |
| 16:54 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1006.eqiad.wmnet |
[production] |
| 16:45 |
<volans@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component logging |
[tools] |
| 16:43 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1005.eqiad.wmnet |
[production] |
| 16:36 |
<volans@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component logging |
[tools] |
| 16:35 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1005.eqiad.wmnet |
[production] |
| 16:19 |
<mforns> |
Finished refinery deploy regular deployment train for 4df475f3ab6ac9f43f9cd9f63008496fbc9e7c16 |
[analytics] |
| 16:16 |
<papaul> |
rebooting sretest1005 to chek LLDP settings |
[production] |
| 16:12 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 16:05 |
<mforns@deploy2002> |
Finished deploy [analytics/refinery@4df475f] (thin): Regular analytics weekly train THIN [analytics/refinery@4df475f3] (duration: 01m 16s) |
[production] |
| 16:03 |
<mforns@deploy2002> |
Started deploy [analytics/refinery@4df475f] (thin): Regular analytics weekly train THIN [analytics/refinery@4df475f3] |
[production] |
| 16:00 |
<dcausse@deploy2002> |
mwscript-k8s job started: extensions/CirrusSearch/maintenance/UpdateSuggesterIndex.php hewiki --masterTimeout=10m --replicationTimeout=5400 --indexChunkSize=3000 --cluster=eqiad --optimize # T410602 reindexing search suggestions on hewiki |
[production] |
| 15:59 |
<mforns@deploy2002> |
Finished deploy [analytics/refinery@4df475f]: Regular analytics weekly train [analytics/refinery@4df475f3] (duration: 02m 25s) |
[production] |
| 15:57 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 15:57 |
<mforns@deploy2002> |
Started deploy [analytics/refinery@4df475f]: Regular analytics weekly train [analytics/refinery@4df475f3] |
[production] |
| 15:56 |
<mforns@deploy2002> |
Finished deploy [analytics/refinery@4df475f] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@4df475f3] (duration: 01m 01s) |
[production] |
| 15:56 |
<volans@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component logging |
[tools] |
| 15:55 |
<mforns@deploy2002> |
Started deploy [analytics/refinery@4df475f] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@4df475f3] |
[production] |
| 15:53 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
| 15:52 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 15:51 |
<mforns> |
Starting refinery deploy regular deployment train for 4df475f3ab6ac9f43f9cd9f63008496fbc9e7c16 |
[analytics] |
| 15:51 |
<volans@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component logging |
[tools] |
| 15:47 |
<volans@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission |
[tools] |
| 15:45 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1207884|Revert "Revert^2 "rdbms: Dismantle concept of groups""]] (duration: 10m 09s) |
[production] |
| 15:40 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db1180 (T410589)', diff saved to https://phabricator.wikimedia.org/P85419 and previous config saved to /var/cache/conftool/dbconfig/20251120-154014-ladsgroup.json |
[production] |
| 15:40 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1180.eqiad.wmnet with reason: Maintenance |
[production] |
| 15:40 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1173 (T410589)', diff saved to https://phabricator.wikimedia.org/P85418 and previous config saved to /var/cache/conftool/dbconfig/20251120-154002-ladsgroup.json |
[production] |
| 15:40 |
<ladsgroup@deploy2002> |
ladsgroup, trainbranchbot: Continuing with sync |
[production] |
| 15:39 |
<ladsgroup@deploy2002> |
ladsgroup, trainbranchbot: Backport for [[gerrit:1207884|Revert "Revert^2 "rdbms: Dismantle concept of groups""]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 15:39 |
<dcausse@deploy2002> |
mwscript-k8s job started: extensions/CirrusSearch/maintenance/UpdateSuggesterIndex.php frwiki --masterTimeout=10m --replicationTimeout=5400 --indexChunkSize=3000 --cluster=eqiad --optimize # T410602 reindexing search suggestions on frwiki |
[production] |
| 15:37 |
<volans@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component registry-admission |
[tools] |
| 15:37 |
<volans@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission |
[tools] |
| 15:35 |
<klausman@deploy2002> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 15:35 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 15:35 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1207884|Revert "Revert^2 "rdbms: Dismantle concept of groups""]] |
[production] |
| 15:34 |
<klausman@deploy2002> |
helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 15:34 |
<klausman@deploy2002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 15:32 |
<klausman@deploy2002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 15:31 |
<klausman@deploy2002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |