|
2025-05-30
§
|
| 12:19 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175 (T395241)', diff saved to https://phabricator.wikimedia.org/P76723 and previous config saved to /var/cache/conftool/dbconfig/20250530-121934-fceratto.json |
[production] |
| 12:04 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P76721 and previous config saved to /var/cache/conftool/dbconfig/20250530-120427-fceratto.json |
[production] |
| 12:03 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2187.codfw.wmnet with reason: host reimage |
[production] |
| 12:01 |
<fceratto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2187.codfw.wmnet with reason: host reimage |
[production] |
| 11:49 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P76720 and previous config saved to /var/cache/conftool/dbconfig/20250530-114921-fceratto.json |
[production] |
| 11:41 |
<fceratto@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2187.codfw.wmnet with OS bookworm |
[production] |
| 11:34 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175 (T395241)', diff saved to https://phabricator.wikimedia.org/P76718 and previous config saved to /var/cache/conftool/dbconfig/20250530-113414-fceratto.json |
[production] |
| 11:24 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2175 (T395241)', diff saved to https://phabricator.wikimedia.org/P76717 and previous config saved to /var/cache/conftool/dbconfig/20250530-112449-fceratto.json |
[production] |
| 11:24 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance |
[production] |
| 11:24 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T395241)', diff saved to https://phabricator.wikimedia.org/P76716 and previous config saved to /var/cache/conftool/dbconfig/20250530-112423-fceratto.json |
[production] |
| 11:09 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P76715 and previous config saved to /var/cache/conftool/dbconfig/20250530-110917-fceratto.json |
[production] |
| 10:57 |
<JustHannah> |
T395592 Ran mwscript-k8s --comment="T395592" --follow -- extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=enwiki --logwiki=metawiki 'Yusuftahaluleci' 'Yusuf_Taha_Lüleci' |
[production] |
| 10:54 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P76714 and previous config saved to /var/cache/conftool/dbconfig/20250530-105410-fceratto.json |
[production] |
| 10:39 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T395241)', diff saved to https://phabricator.wikimedia.org/P76713 and previous config saved to /var/cache/conftool/dbconfig/20250530-103903-fceratto.json |
[production] |
| 10:28 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2148 (T395241)', diff saved to https://phabricator.wikimedia.org/P76712 and previous config saved to /var/cache/conftool/dbconfig/20250530-102830-fceratto.json |
[production] |
| 10:28 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2148.codfw.wmnet with reason: Maintenance |
[production] |
| 10:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P76711 and previous config saved to /var/cache/conftool/dbconfig/20250530-102416-root.json |
[production] |
| 10:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P76710 and previous config saved to /var/cache/conftool/dbconfig/20250530-100911-root.json |
[production] |
| 09:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P76709 and previous config saved to /var/cache/conftool/dbconfig/20250530-095405-root.json |
[production] |
| 09:43 |
<Lucas_WMDE> |
ssh integration-castor05.integration.eqiad1.wikimedia.cloud sudo -u jenkins-deploy rm -rf /srv/castor/castor-mw-ext-and-skins/master/mwgate-node20 # fix failure seen in mwgate-node20 57273 and 57274 |
[releng] |
| 09:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P76708 and previous config saved to /var/cache/conftool/dbconfig/20250530-093859-root.json |
[production] |
| 09:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P76707 and previous config saved to /var/cache/conftool/dbconfig/20250530-092353-root.json |
[production] |
| 09:08 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P76706 and previous config saved to /var/cache/conftool/dbconfig/20250530-090847-root.json |
[production] |
| 08:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P76705 and previous config saved to /var/cache/conftool/dbconfig/20250530-085341-root.json |
[production] |
| 08:38 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P76704 and previous config saved to /var/cache/conftool/dbconfig/20250530-083836-root.json |
[production] |
| 08:23 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2040.codfw.wmnet with reason: Maintenance |
[production] |
| 08:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es2040 T395647', diff saved to https://phabricator.wikimedia.org/P76703 and previous config saved to /var/cache/conftool/dbconfig/20250530-082144-marostegui.json |
[production] |
| 08:20 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on db2187.codfw.wmnet with reason: Reimaging |
[production] |
| 08:18 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depool db2187 for reimaging, see T394884', diff saved to https://phabricator.wikimedia.org/P76702 and previous config saved to /var/cache/conftool/dbconfig/20250530-081804-fceratto.json |
[production] |
| 07:38 |
<taavi> |
reboot tools-static-15 to unstuck NFS things |
[tools] |
| 07:09 |
<elukey@deploy1003> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
| 07:08 |
<elukey@deploy1003> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
| 07:08 |
<bwojtowicz@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'readability' for release 'main' . |
[production] |
| 07:05 |
<elukey@puppetserver1001> |
conftool action : set/pooled=false; selector: dnsdisc=inference,name=eqiad |
[production] |
| 06:29 |
<moritzm> |
uninstalling systemd-coredump (only installed on one host due to an older tests, but not needed and there's open security issues) |
[production] |
|
2025-05-29
§
|
| 23:33 |
<bvibber@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1152167|Validation fix for saving Data: .chart pages with transforms (T395631)]] (duration: 10m 00s) |
[production] |
| 23:27 |
<bvibber@deploy1003> |
bvibber: Continuing with sync |
[production] |
| 23:25 |
<bvibber@deploy1003> |
bvibber: Backport for [[gerrit:1152167|Validation fix for saving Data: .chart pages with transforms (T395631)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 23:23 |
<bvibber@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1152167|Validation fix for saving Data: .chart pages with transforms (T395631)]] |
[production] |
| 22:18 |
<bd808> |
Submitted WikimediaDebug v3.1.0 to addons.mozilla.org for review (T395190, T315111) |
[releng] |
| 22:11 |
<bd808> |
Submitted WikimediaDebug v3.1.0 to Chrome Web Store for review (T395190, T315111) |
[releng] |
| 21:41 |
<cjming> |
end of UTC late backport window |
[production] |
| 21:38 |
<cjming@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1152115|ext.wikimediaEvents: Add XLab PageVisit instrument (T393918 T392313)]] (duration: 09m 54s) |
[production] |
| 21:31 |
<cjming@deploy1003> |
cjming: Continuing with sync |
[production] |
| 21:30 |
<cjming@deploy1003> |
cjming: Backport for [[gerrit:1152115|ext.wikimediaEvents: Add XLab PageVisit instrument (T393918 T392313)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:28 |
<cjming@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1152115|ext.wikimediaEvents: Add XLab PageVisit instrument (T393918 T392313)]] |
[production] |
| 21:20 |
<cjming@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1151148|EventStreamConfig: Remove xLab development streams (T393918)]] (duration: 09m 34s) |
[production] |
| 21:12 |
<cjming@deploy1003> |
cjming, phuedx: Continuing with sync |
[production] |
| 21:12 |
<cjming@deploy1003> |
cjming, phuedx: Backport for [[gerrit:1151148|EventStreamConfig: Remove xLab development streams (T393918)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:10 |
<cjming@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1151148|EventStreamConfig: Remove xLab development streams (T393918)]] |
[production] |