2025-05-12
ยง
|
13:01 |
<elukey> |
`puppet ca destroy thanos.discovery.wmnet` on puppetmaster1001 - old cert not used anymore |
[production] |
12:59 |
<tgr@deploy1003> |
tgr, mszabo: Continuing with sync |
[production] |
12:52 |
<tgr@deploy1003> |
tgr, mszabo: Backport for [[gerrit:1143962|Get rid of ancient session_name call (T124371)]], [[gerrit:1144495|Do not use $_SESSION (T29887 T124371)]], [[gerrit:1140725|Set wgPHPSessionHandling to 'warn' (T362324)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
12:47 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P75907 and previous config saved to /var/cache/conftool/dbconfig/20250512-124718-fceratto.json |
[production] |
12:36 |
<tgr@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1143962|Get rid of ancient session_name call (T124371)]], [[gerrit:1144495|Do not use $_SESSION (T29887 T124371)]], [[gerrit:1140725|Set wgPHPSessionHandling to 'warn' (T362324)]] |
[production] |
12:32 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1180 (T392806)', diff saved to https://phabricator.wikimedia.org/P75906 and previous config saved to /var/cache/conftool/dbconfig/20250512-123211-fceratto.json |
[production] |
12:30 |
<Krinkle> |
Disable publishing noise from rWSWF, T143162, T267223 |
[releng] |
12:26 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1180 (T392806)', diff saved to https://phabricator.wikimedia.org/P75905 and previous config saved to /var/cache/conftool/dbconfig/20250512-122626-fceratto.json |
[production] |
12:26 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1180.eqiad.wmnet with reason: Maintenance |
[production] |
12:26 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1168 (T392806)', diff saved to https://phabricator.wikimedia.org/P75904 and previous config saved to /var/cache/conftool/dbconfig/20250512-122600-fceratto.json |
[production] |
12:25 |
<kamila@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
12:24 |
<kamila@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
12:18 |
<kamila@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
12:18 |
<kamila@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/mw-cron: apply |
[production] |
12:18 |
<kamila@deploy1003> |
helmfile [codfw] START helmfile.d/services/mw-cron: apply |
[production] |
12:16 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api |
[toolsbeta] |
12:10 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P75903 and previous config saved to /var/cache/conftool/dbconfig/20250512-121053-fceratto.json |
[production] |
12:05 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component components-api |
[toolsbeta] |
11:55 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P75902 and previous config saved to /var/cache/conftool/dbconfig/20250512-115545-fceratto.json |
[production] |
11:51 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission |
[tools] |
11:45 |
<jgleeson> |
civicrm upgraded from dc096105 to 852c6ee6 |
[production] |
11:40 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1168 (T392806)', diff saved to https://phabricator.wikimedia.org/P75901 and previous config saved to /var/cache/conftool/dbconfig/20250512-114038-fceratto.json |
[production] |
11:39 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component registry-admission |
[tools] |
11:39 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission |
[toolsbeta] |
11:33 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1168 (T392806)', diff saved to https://phabricator.wikimedia.org/P75900 and previous config saved to /var/cache/conftool/dbconfig/20250512-113350-fceratto.json |
[production] |
11:33 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1168.eqiad.wmnet with reason: Maintenance |
[production] |
11:33 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165 (T392806)', diff saved to https://phabricator.wikimedia.org/P75899 and previous config saved to /var/cache/conftool/dbconfig/20250512-113324-fceratto.json |
[production] |
11:28 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component registry-admission |
[toolsbeta] |
11:26 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-admission |
[tools] |
11:25 |
<volans@cumin1003> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Revert to v0.9.0 - volans@cumin1003 |
[production] |
11:22 |
<volans@cumin1003> |
START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Revert to v0.9.0 - volans@cumin1003 |
[production] |
11:18 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P75898 and previous config saved to /var/cache/conftool/dbconfig/20250512-111817-fceratto.json |
[production] |
11:17 |
<volans@cumin1003> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Revert to v0.9.0 - volans@cumin1003 |
[production] |
11:16 |
<volans@cumin1003> |
START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Revert to v0.9.0 - volans@cumin1003 |
[production] |
11:14 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component ingress-admission |
[tools] |
11:14 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-admission |
[toolsbeta] |
11:12 |
<volans@cumin1003> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Release v0.10.0 - volans@cumin1003 |
[production] |
11:11 |
<volans@cumin1003> |
START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Release v0.10.0 - volans@cumin1003 |
[production] |
11:08 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox |
[production] |
11:08 |
<ayounsi@cumin1002> |
START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox |
[production] |
11:06 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary |
[production] |
11:03 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1088.eqiad.wmnet with OS bullseye |
[production] |
11:03 |
<ayounsi@cumin1002> |
START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary |
[production] |
11:03 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P75897 and previous config saved to /var/cache/conftool/dbconfig/20250512-110310-fceratto.json |
[production] |
11:02 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component ingress-admission |
[toolsbeta] |
10:48 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165 (T392806)', diff saved to https://phabricator.wikimedia.org/P75896 and previous config saved to /var/cache/conftool/dbconfig/20250512-104803-fceratto.json |
[production] |
10:47 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1088.eqiad.wmnet with reason: host reimage |
[production] |
10:44 |
<taavi> |
fix security groups for frontproxy-nginx metricsinfra job |
[toolsbeta] |
10:44 |
<mvernon@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be1088.eqiad.wmnet with reason: host reimage |
[production] |
10:44 |
<taavi> |
add generic TargetDown rule for better detection of issues like T392889 |
[metricsinfra] |