251-300 of 10000 results (26ms)
2025-11-20 ยง
17:12 <robh@cumin2002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker1034.eqiad.wmnet [production]
17:10 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wikikube-worker1034.eqiad.wmnet with reason: C/D Migration [production]
17:10 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wikikube-worker1163.eqiad.wmnet with reason: C/D Migration [production]
17:10 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wikikube-worker1162.eqiad.wmnet with reason: C/D Migration [production]
17:03 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wikikube-worker1159.eqiad.wmnet with reason: C/D Migration [production]
17:01 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1006.eqiad.wmnet [production]
16:58 <robh@cumin2002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1159,1162-1163].eqiad.wmnet [production]
16:56 <robh@cumin2002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker1034.eqiad.wmnet [production]
16:56 <robh@cumin2002> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1159,1162-1163].eqiad.wmnet [production]
16:55 <robh@cumin2002> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker1034.eqiad.wmnet [production]
16:55 <volans@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component infra-tracing [tools]
16:54 <robh> draining eqiad d3 wikikube hosts for network migration [production]
16:54 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1006.eqiad.wmnet [production]
16:45 <volans@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component logging [tools]
16:43 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1005.eqiad.wmnet [production]
16:36 <volans@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component logging [tools]
16:35 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1005.eqiad.wmnet [production]
16:19 <mforns> Finished refinery deploy regular deployment train for 4df475f3ab6ac9f43f9cd9f63008496fbc9e7c16 [analytics]
16:16 <papaul> rebooting sretest1005 to chek LLDP settings [production]
16:12 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . [production]
16:05 <mforns@deploy2002> Finished deploy [analytics/refinery@4df475f] (thin): Regular analytics weekly train THIN [analytics/refinery@4df475f3] (duration: 01m 16s) [production]
16:03 <mforns@deploy2002> Started deploy [analytics/refinery@4df475f] (thin): Regular analytics weekly train THIN [analytics/refinery@4df475f3] [production]
16:00 <dcausse@deploy2002> mwscript-k8s job started: extensions/CirrusSearch/maintenance/UpdateSuggesterIndex.php hewiki --masterTimeout=10m --replicationTimeout=5400 --indexChunkSize=3000 --cluster=eqiad --optimize # T410602 reindexing search suggestions on hewiki [production]
15:59 <mforns@deploy2002> Finished deploy [analytics/refinery@4df475f]: Regular analytics weekly train [analytics/refinery@4df475f3] (duration: 02m 25s) [production]
15:57 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . [production]
15:57 <mforns@deploy2002> Started deploy [analytics/refinery@4df475f]: Regular analytics weekly train [analytics/refinery@4df475f3] [production]
15:56 <mforns@deploy2002> Finished deploy [analytics/refinery@4df475f] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@4df475f3] (duration: 01m 01s) [production]
15:56 <volans@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component logging [tools]
15:55 <mforns@deploy2002> Started deploy [analytics/refinery@4df475f] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@4df475f3] [production]
15:53 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
15:52 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
15:51 <mforns> Starting refinery deploy regular deployment train for 4df475f3ab6ac9f43f9cd9f63008496fbc9e7c16 [analytics]
15:51 <volans@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component logging [tools]
15:47 <volans@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission [tools]
15:45 <ladsgroup@deploy2002> Finished scap sync-world: Backport for [[gerrit:1207884|Revert "Revert^2 "rdbms: Dismantle concept of groups""]] (duration: 10m 09s) [production]
15:40 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db1180 (T410589)', diff saved to https://phabricator.wikimedia.org/P85419 and previous config saved to /var/cache/conftool/dbconfig/20251120-154014-ladsgroup.json [production]
15:40 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1180.eqiad.wmnet with reason: Maintenance [production]
15:40 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1173 (T410589)', diff saved to https://phabricator.wikimedia.org/P85418 and previous config saved to /var/cache/conftool/dbconfig/20251120-154002-ladsgroup.json [production]
15:40 <ladsgroup@deploy2002> ladsgroup, trainbranchbot: Continuing with sync [production]
15:39 <ladsgroup@deploy2002> ladsgroup, trainbranchbot: Backport for [[gerrit:1207884|Revert "Revert^2 "rdbms: Dismantle concept of groups""]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:39 <dcausse@deploy2002> mwscript-k8s job started: extensions/CirrusSearch/maintenance/UpdateSuggesterIndex.php frwiki --masterTimeout=10m --replicationTimeout=5400 --indexChunkSize=3000 --cluster=eqiad --optimize # T410602 reindexing search suggestions on frwiki [production]
15:37 <volans@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component registry-admission [tools]
15:37 <volans@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission [tools]
15:35 <klausman@deploy2002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
15:35 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . [production]
15:35 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1207884|Revert "Revert^2 "rdbms: Dismantle concept of groups""]] [production]
15:34 <klausman@deploy2002> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
15:34 <klausman@deploy2002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
15:32 <klausman@deploy2002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
15:31 <klausman@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]