2025-04-14
ยง
|
13:13 |
<elukey@deploy1003> |
Started deploy [docker-pkg/deploy@a555b7b]: Upgrade to 4.0.4 |
[production] |
13:13 |
<godog> |
remove old LVs from prometheus[12]00[56] - T383232 |
[production] |
13:07 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1189', diff saved to https://phabricator.wikimedia.org/P74952 and previous config saved to /var/cache/conftool/dbconfig/20250414-130703-fceratto.json |
[production] |
13:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repool pc5 T391454', diff saved to https://phabricator.wikimedia.org/P74951 and previous config saved to /var/cache/conftool/dbconfig/20250414-130222-marostegui.json |
[production] |
13:00 |
<moritzm> |
remove ganeti01.svc.eqiad.wmnet cert (replaced by cfssl cert) T357750 |
[production] |
12:56 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-text_ulsfo and not P{cp4037.ulsfo.wmnet} and A:cp |
[production] |
12:56 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on pc2015.codfw.wmnet,pc1015.eqiad.wmnet with reason: Maintenance |
[production] |
12:56 |
<jforrester@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1136049|Special pages: Don't just set userCanExecute() but actually run it (T391594)]], [[gerrit:1136050|Client mode: Provide WikiLambdaClientModeOffline for SRE to disable]], [[gerrit:1136051|Wikifunctions VE: Add loading and abort state to content editable (T391441)]], [[gerrit:1136126|logging: Allow through WikiLambdaClient logs at info level an |
[production] |
12:56 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-upload_ulsfo and not P{cp4047.ulsfo.wmnet} and not P{cp4045.ulsfo.wmnet} and A:cp |
[production] |
12:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool pc5 T391454', diff saved to https://phabricator.wikimedia.org/P74950 and previous config saved to /var/cache/conftool/dbconfig/20250414-125511-marostegui.json |
[production] |
12:53 |
<moritzm> |
remove ganeti01.svc.codfw.wmnet cert (replaced by cfssl cert) T357750 |
[production] |
12:51 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1189', diff saved to https://phabricator.wikimedia.org/P74949 and previous config saved to /var/cache/conftool/dbconfig/20250414-125156-fceratto.json |
[production] |
12:51 |
<godog> |
upgrade prometheus2007 to thanos 0.38.0 - T383966 |
[production] |
12:50 |
<godog> |
upgrade prometheus2005 to thanos 0.38.0 - T383966 |
[production] |
12:49 |
<moritzm> |
remove ganeti01.svc.esams.wmnet cert (replaced by cfssl cert) T357750 |
[production] |
12:46 |
<jforrester@deploy1003> |
jforrester: Continuing with sync |
[production] |
12:46 |
<moritzm> |
remove ganeti01.svc.ulsfo.wmnet cert (replaced by cfssl cert) T357750 |
[production] |
12:44 |
<jforrester@deploy1003> |
jforrester: Backport for [[gerrit:1136049|Special pages: Don't just set userCanExecute() but actually run it (T391594)]], [[gerrit:1136050|Client mode: Provide WikiLambdaClientModeOffline for SRE to disable]], [[gerrit:1136051|Wikifunctions VE: Add loading and abort state to content editable (T391441)]], [[gerrit:1136126|logging: Allow through WikiLambdaClient logs at info level and above]] sync |
[production] |
12:43 |
<moritzm> |
remove ganeti01.svc.eqsin.wmnet cert (replaced by cfssl cert) T357750 |
[production] |
12:36 |
<jforrester@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1136049|Special pages: Don't just set userCanExecute() but actually run it (T391594)]], [[gerrit:1136050|Client mode: Provide WikiLambdaClientModeOffline for SRE to disable]], [[gerrit:1136051|Wikifunctions VE: Add loading and abort state to content editable (T391441)]], [[gerrit:1136126|logging: Allow through WikiLambdaClient logs at info level and |
[production] |
12:36 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1189 (T391056)', diff saved to https://phabricator.wikimedia.org/P74948 and previous config saved to /var/cache/conftool/dbconfig/20250414-123649-fceratto.json |
[production] |
12:32 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1189 (T391056)', diff saved to https://phabricator.wikimedia.org/P74947 and previous config saved to /var/cache/conftool/dbconfig/20250414-123255-fceratto.json |
[production] |
12:32 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1189.eqiad.wmnet with reason: Maintenance |
[production] |
12:32 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1175 (T391056)', diff saved to https://phabricator.wikimedia.org/P74946 and previous config saved to /var/cache/conftool/dbconfig/20250414-123234-fceratto.json |
[production] |
12:25 |
<cgoubert@deploy1003> |
helmfile [ml-serve-eqiad] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
12:24 |
<cgoubert@deploy1003> |
helmfile [ml-serve-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
12:24 |
<cgoubert@deploy1003> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
12:23 |
<cgoubert@deploy1003> |
helmfile [ml-serve-eqiad] 'sync' command on namespace 'ores-legacy' for release 'main' . |
[production] |
12:22 |
<cgoubert@deploy1003> |
helmfile [ml-serve-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . |
[production] |
12:22 |
<cgoubert@deploy1003> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . |
[production] |
12:17 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P74945 and previous config saved to /var/cache/conftool/dbconfig/20250414-121726-fceratto.json |
[production] |
12:06 |
<jforrester@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1136049|Special pages: Don't just set userCanExecute() but actually run it (T391594)]], [[gerrit:1136050|Client mode: Provide WikiLambdaClientModeOffline for SRE to disable]], [[gerrit:1136051|Wikifunctions VE: Add loading and abort state to content editable (T391441)]], [[gerrit:1136126|logging: Allow through WikiLambdaClient logs at info level and |
[production] |
12:02 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P74944 and previous config saved to /var/cache/conftool/dbconfig/20250414-120219-fceratto.json |
[production] |
11:47 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1175 (T391056)', diff saved to https://phabricator.wikimedia.org/P74943 and previous config saved to /var/cache/conftool/dbconfig/20250414-114711-fceratto.json |
[production] |
11:43 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1175 (T391056)', diff saved to https://phabricator.wikimedia.org/P74942 and previous config saved to /var/cache/conftool/dbconfig/20250414-114323-fceratto.json |
[production] |
11:43 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1175.eqiad.wmnet with reason: Maintenance |
[production] |
11:43 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1166 (T391056)', diff saved to https://phabricator.wikimedia.org/P74941 and previous config saved to /var/cache/conftool/dbconfig/20250414-114300-fceratto.json |
[production] |
11:40 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.dns.roll-restart (exit_code=0) rolling restart_daemons on A:dnsbox |
[production] |
11:30 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on P{cp4045.ulsfo.wmnet} and A:cp |
[production] |
11:30 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on P{cp4037.ulsfo.wmnet} and A:cp |
[production] |
11:29 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
11:29 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
11:28 |
<fceratto@deploy1003> |
helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
11:27 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1166', diff saved to https://phabricator.wikimedia.org/P74940 and previous config saved to /var/cache/conftool/dbconfig/20250414-112754-fceratto.json |
[production] |
11:27 |
<fceratto@deploy1003> |
helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
11:26 |
<hnowlan@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
11:26 |
<hnowlan@deploy1003> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
11:25 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on P{cp4045.ulsfo.wmnet} and A:cp |
[production] |
11:25 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
11:25 |
<vgutierrez@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on P{cp4037.ulsfo.wmnet} and A:cp |
[production] |