|
2025-11-28
ยง
|
| 11:31 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1219', diff saved to https://phabricator.wikimedia.org/P86084 and previous config saved to /var/cache/conftool/dbconfig/20251128-113107-marostegui.json |
[production] |
| 11:16 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1219', diff saved to https://phabricator.wikimedia.org/P86083 and previous config saved to /var/cache/conftool/dbconfig/20251128-111600-marostegui.json |
[production] |
| 11:15 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
| 11:00 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1219 (T410531)', diff saved to https://phabricator.wikimedia.org/P86082 and previous config saved to /var/cache/conftool/dbconfig/20251128-110052-marostegui.json |
[production] |
| 10:57 |
<jynus@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2239.codfw.wmnet with reason: Upgrade and reboot |
[production] |
| 10:54 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1219 (T410531)', diff saved to https://phabricator.wikimedia.org/P86081 and previous config saved to /var/cache/conftool/dbconfig/20251128-105451-marostegui.json |
[production] |
| 10:54 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1219.eqiad.wmnet with reason: Maintenance |
[production] |
| 10:54 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1218 (T410531)', diff saved to https://phabricator.wikimedia.org/P86080 and previous config saved to /var/cache/conftool/dbconfig/20251128-105427-marostegui.json |
[production] |
| 10:39 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1218', diff saved to https://phabricator.wikimedia.org/P86079 and previous config saved to /var/cache/conftool/dbconfig/20251128-103920-marostegui.json |
[production] |
| 10:39 |
<brouberol@deploy2002> |
helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 10:38 |
<brouberol@deploy2002> |
helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 10:37 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 10:36 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 10:26 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox |
[production] |
| 10:26 |
<ayounsi@cumin1003> |
START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox |
[production] |
| 10:24 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1218', diff saved to https://phabricator.wikimedia.org/P86078 and previous config saved to /var/cache/conftool/dbconfig/20251128-102412-marostegui.json |
[production] |
| 10:09 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1218 (T410531)', diff saved to https://phabricator.wikimedia.org/P86077 and previous config saved to /var/cache/conftool/dbconfig/20251128-100905-marostegui.json |
[production] |
| 10:05 |
<bwojtowicz@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 10:04 |
<klausman@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve1001.eqiad.wmnet with OS trixie |
[production] |
| 10:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1218 (T410531)', diff saved to https://phabricator.wikimedia.org/P86076 and previous config saved to /var/cache/conftool/dbconfig/20251128-100258-marostegui.json |
[production] |
| 10:02 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1218.eqiad.wmnet with reason: Maintenance |
[production] |
| 10:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1206 (T410531)', diff saved to https://phabricator.wikimedia.org/P86075 and previous config saved to /var/cache/conftool/dbconfig/20251128-100234-marostegui.json |
[production] |
| 10:02 |
<bwojtowicz@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 09:52 |
<jnuche> |
temporarily disabled beta-scap-sync-world to avoid spamming in channel |
[releng] |
| 09:48 |
<klausman@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1001.eqiad.wmnet with reason: host reimage |
[production] |
| 09:47 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P86074 and previous config saved to /var/cache/conftool/dbconfig/20251128-094727-marostegui.json |
[production] |
| 09:44 |
<klausman@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1001.eqiad.wmnet with reason: host reimage |
[production] |
| 09:32 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P86073 and previous config saved to /var/cache/conftool/dbconfig/20251128-093219-marostegui.json |
[production] |
| 09:27 |
<klausman@cumin1003> |
START - Cookbook sre.hosts.reimage for host ml-serve1001.eqiad.wmnet with OS trixie |
[production] |
| 09:23 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2158 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P86072 and previous config saved to /var/cache/conftool/dbconfig/20251128-092341-marostegui.json |
[production] |
| 09:23 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2158.codfw.wmnet with reason: Maintenance |
[production] |
| 09:23 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2151 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P86071 and previous config saved to /var/cache/conftool/dbconfig/20251128-092318-marostegui.json |
[production] |
| 09:17 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1206 (T410531)', diff saved to https://phabricator.wikimedia.org/P86070 and previous config saved to /var/cache/conftool/dbconfig/20251128-091712-marostegui.json |
[production] |
| 09:11 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1206 (T410531)', diff saved to https://phabricator.wikimedia.org/P86069 and previous config saved to /var/cache/conftool/dbconfig/20251128-091116-marostegui.json |
[production] |
| 09:11 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1206.eqiad.wmnet with reason: Maintenance |
[production] |
| 09:10 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1196 (T410531)', diff saved to https://phabricator.wikimedia.org/P86068 and previous config saved to /var/cache/conftool/dbconfig/20251128-091052-marostegui.json |
[production] |
| 09:08 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2151', diff saved to https://phabricator.wikimedia.org/P86067 and previous config saved to /var/cache/conftool/dbconfig/20251128-090810-marostegui.json |
[production] |
| 08:59 |
<jynus@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2239.codfw.wmnet with reason: Upgrade and reboot |
[production] |
| 08:55 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1196', diff saved to https://phabricator.wikimedia.org/P86066 and previous config saved to /var/cache/conftool/dbconfig/20251128-085544-marostegui.json |
[production] |
| 08:53 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2151', diff saved to https://phabricator.wikimedia.org/P86065 and previous config saved to /var/cache/conftool/dbconfig/20251128-085303-marostegui.json |
[production] |
| 08:50 |
<brouberol@dns1004> |
END - running authdns-update |
[production] |
| 08:49 |
<brouberol@dns1004> |
START - running authdns-update |
[production] |
| 08:40 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1196', diff saved to https://phabricator.wikimedia.org/P86064 and previous config saved to /var/cache/conftool/dbconfig/20251128-084037-marostegui.json |
[production] |
| 08:37 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2151 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P86063 and previous config saved to /var/cache/conftool/dbconfig/20251128-083755-marostegui.json |
[production] |
| 08:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1196 (T410531)', diff saved to https://phabricator.wikimedia.org/P86062 and previous config saved to /var/cache/conftool/dbconfig/20251128-082529-marostegui.json |
[production] |
| 08:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1196 (T410531)', diff saved to https://phabricator.wikimedia.org/P86061 and previous config saved to /var/cache/conftool/dbconfig/20251128-081852-marostegui.json |
[production] |
| 08:18 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1013,1017].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
| 08:18 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1196.eqiad.wmnet with reason: Maintenance |
[production] |
| 08:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1195 (T410531)', diff saved to https://phabricator.wikimedia.org/P86060 and previous config saved to /var/cache/conftool/dbconfig/20251128-081820-marostegui.json |
[production] |
| 08:08 |
<moritzm> |
installing Linux 6.1.158 kernel on Bookworm hosts |
[production] |