2025-05-30
§
|
10:28 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2148.codfw.wmnet with reason: Maintenance |
[production] |
10:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P76711 and previous config saved to /var/cache/conftool/dbconfig/20250530-102416-root.json |
[production] |
10:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P76710 and previous config saved to /var/cache/conftool/dbconfig/20250530-100911-root.json |
[production] |
09:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P76709 and previous config saved to /var/cache/conftool/dbconfig/20250530-095405-root.json |
[production] |
09:43 |
<Lucas_WMDE> |
ssh integration-castor05.integration.eqiad1.wikimedia.cloud sudo -u jenkins-deploy rm -rf /srv/castor/castor-mw-ext-and-skins/master/mwgate-node20 # fix failure seen in mwgate-node20 57273 and 57274 |
[releng] |
09:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P76708 and previous config saved to /var/cache/conftool/dbconfig/20250530-093859-root.json |
[production] |
09:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P76707 and previous config saved to /var/cache/conftool/dbconfig/20250530-092353-root.json |
[production] |
09:08 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P76706 and previous config saved to /var/cache/conftool/dbconfig/20250530-090847-root.json |
[production] |
08:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P76705 and previous config saved to /var/cache/conftool/dbconfig/20250530-085341-root.json |
[production] |
08:38 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2040 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P76704 and previous config saved to /var/cache/conftool/dbconfig/20250530-083836-root.json |
[production] |
08:23 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2040.codfw.wmnet with reason: Maintenance |
[production] |
08:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es2040 T395647', diff saved to https://phabricator.wikimedia.org/P76703 and previous config saved to /var/cache/conftool/dbconfig/20250530-082144-marostegui.json |
[production] |
08:20 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on db2187.codfw.wmnet with reason: Reimaging |
[production] |
08:18 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depool db2187 for reimaging, see T394884', diff saved to https://phabricator.wikimedia.org/P76702 and previous config saved to /var/cache/conftool/dbconfig/20250530-081804-fceratto.json |
[production] |
07:38 |
<taavi> |
reboot tools-static-15 to unstuck NFS things |
[tools] |
07:09 |
<elukey@deploy1003> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
07:08 |
<elukey@deploy1003> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
07:08 |
<bwojtowicz@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'readability' for release 'main' . |
[production] |
07:05 |
<elukey@puppetserver1001> |
conftool action : set/pooled=false; selector: dnsdisc=inference,name=eqiad |
[production] |
06:29 |
<moritzm> |
uninstalling systemd-coredump (only installed on one host due to an older tests, but not needed and there's open security issues) |
[production] |
2025-05-29
§
|
23:33 |
<bvibber@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1152167|Validation fix for saving Data: .chart pages with transforms (T395631)]] (duration: 10m 00s) |
[production] |
23:27 |
<bvibber@deploy1003> |
bvibber: Continuing with sync |
[production] |
23:25 |
<bvibber@deploy1003> |
bvibber: Backport for [[gerrit:1152167|Validation fix for saving Data: .chart pages with transforms (T395631)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
23:23 |
<bvibber@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1152167|Validation fix for saving Data: .chart pages with transforms (T395631)]] |
[production] |
22:18 |
<bd808> |
Submitted WikimediaDebug v3.1.0 to addons.mozilla.org for review (T395190, T315111) |
[releng] |
22:11 |
<bd808> |
Submitted WikimediaDebug v3.1.0 to Chrome Web Store for review (T395190, T315111) |
[releng] |
21:41 |
<cjming> |
end of UTC late backport window |
[production] |
21:38 |
<cjming@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1152115|ext.wikimediaEvents: Add XLab PageVisit instrument (T393918 T392313)]] (duration: 09m 54s) |
[production] |
21:31 |
<cjming@deploy1003> |
cjming: Continuing with sync |
[production] |
21:30 |
<cjming@deploy1003> |
cjming: Backport for [[gerrit:1152115|ext.wikimediaEvents: Add XLab PageVisit instrument (T393918 T392313)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
21:28 |
<cjming@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1152115|ext.wikimediaEvents: Add XLab PageVisit instrument (T393918 T392313)]] |
[production] |
21:20 |
<cjming@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1151148|EventStreamConfig: Remove xLab development streams (T393918)]] (duration: 09m 34s) |
[production] |
21:12 |
<cjming@deploy1003> |
cjming, phuedx: Continuing with sync |
[production] |
21:12 |
<cjming@deploy1003> |
cjming, phuedx: Backport for [[gerrit:1151148|EventStreamConfig: Remove xLab development streams (T393918)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
21:10 |
<cjming@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1151148|EventStreamConfig: Remove xLab development streams (T393918)]] |
[production] |
21:07 |
<cjming@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1151236|noc: Fix invalid `max-age: 300` syntax to `max-age=300` in fileserve.php (T341859)]] (duration: 10m 22s) |
[production] |
21:00 |
<cjming@deploy1003> |
cjming, krinkle: Continuing with sync |
[production] |
20:59 |
<cjming@deploy1003> |
cjming, krinkle: Backport for [[gerrit:1151236|noc: Fix invalid `max-age: 300` syntax to `max-age=300` in fileserve.php (T341859)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:57 |
<cjming@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1151236|noc: Fix invalid `max-age: 300` syntax to `max-age=300` in fileserve.php (T341859)]] |
[production] |
20:55 |
<cscott@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1152080|Campaign: Ensure `<div class="mw-parser-output">` wrapper is removed (T395023)]] (duration: 15m 01s) |
[production] |
20:48 |
<cscott@deploy1003> |
cscott: Continuing with sync |
[production] |
20:43 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1240.eqiad.wmnet with reason: Maintenance |
[production] |
20:42 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1212 (T395241)', diff saved to https://phabricator.wikimedia.org/P76700 and previous config saved to /var/cache/conftool/dbconfig/20250529-204251-fceratto.json |
[production] |
20:42 |
<cscott@deploy1003> |
cscott: Backport for [[gerrit:1152080|Campaign: Ensure `<div class="mw-parser-output">` wrapper is removed (T395023)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:40 |
<cscott@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1152080|Campaign: Ensure `<div class="mw-parser-output">` wrapper is removed (T395023)]] |
[production] |
20:28 |
<cjming@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1149720|Turn on glent m1 AB test (T262612)]] (duration: 10m 19s) |
[production] |
20:27 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1212', diff saved to https://phabricator.wikimedia.org/P76699 and previous config saved to /var/cache/conftool/dbconfig/20250529-202743-fceratto.json |
[production] |
20:21 |
<cjming@deploy1003> |
ebernhardson, cjming: Continuing with sync |
[production] |
20:20 |
<cjming@deploy1003> |
ebernhardson, cjming: Backport for [[gerrit:1149720|Turn on glent m1 AB test (T262612)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:18 |
<cjming@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1149720|Turn on glent m1 AB test (T262612)]] |
[production] |