|
2025-11-04
ยง
|
| 14:32 |
<lucaswerkmeister-wmde@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1198390|azwiktionary: use new wordmark and tagline (T408147)]], [[gerrit:1199751|Remove wmgULSPosition for special wikis (T400067)]] |
[production] |
| 14:31 |
<klausman@deploy2002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 14:31 |
<klausman@deploy2002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 14:30 |
<klausman@deploy2002> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 14:30 |
<klausman@deploy2002> |
helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 14:30 |
<klausman@deploy2002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 14:30 |
<klausman@deploy2002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 14:27 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.clone of db2230.codfw.wmnet onto db-test2001.codfw.wmnet |
[production] |
| 14:20 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2211', diff saved to https://phabricator.wikimedia.org/P84759 and previous config saved to /var/cache/conftool/dbconfig/20251104-142010-marostegui.json |
[production] |
| 14:18 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudnet2007-dev.codfw.wmnet with reason: host reimage |
[production] |
| 14:15 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudnet2007-dev.codfw.wmnet with reason: host reimage |
[production] |
| 14:05 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2211', diff saved to https://phabricator.wikimedia.org/P84758 and previous config saved to /var/cache/conftool/dbconfig/20251104-140503-marostegui.json |
[production] |
| 13:57 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reimage for host cloudnet2007-dev.codfw.wmnet with OS trixie |
[production] |
| 13:53 |
<topranks> |
downgrade lsw1-c3-eqiad to SR-Linux v24.7.2 |
[production] |
| 13:49 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2211 (T407997)', diff saved to https://phabricator.wikimedia.org/P84757 and previous config saved to /var/cache/conftool/dbconfig/20251104-134955-marostegui.json |
[production] |
| 13:45 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2211 (T407997)', diff saved to https://phabricator.wikimedia.org/P84756 and previous config saved to /var/cache/conftool/dbconfig/20251104-134545-marostegui.json |
[production] |
| 13:45 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2211.codfw.wmnet with reason: Maintenance |
[production] |
| 13:43 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2201.codfw.wmnet with reason: Maintenance |
[production] |
| 13:43 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2178 (T407997)', diff saved to https://phabricator.wikimedia.org/P84755 and previous config saved to /var/cache/conftool/dbconfig/20251104-134314-marostegui.json |
[production] |
| 13:41 |
<moritzm> |
installing tiff security updates |
[production] |
| 13:35 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1220 (re)pooling @ 100%: After switchover', diff saved to https://phabricator.wikimedia.org/P84754 and previous config saved to /var/cache/conftool/dbconfig/20251104-133526-root.json |
[production] |
| 13:28 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2178', diff saved to https://phabricator.wikimedia.org/P84753 and previous config saved to /var/cache/conftool/dbconfig/20251104-132804-marostegui.json |
[production] |
| 13:20 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1220 (re)pooling @ 75%: After switchover', diff saved to https://phabricator.wikimedia.org/P84752 and previous config saved to /var/cache/conftool/dbconfig/20251104-132019-root.json |
[production] |
| 13:18 |
<hashar> |
gerrit: blocked pushes to refs/[^/]+ , first level references are considered invalid by cgit (funny refname) # T275946 T337508 |
[releng] |
| 13:12 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2178', diff saved to https://phabricator.wikimedia.org/P84750 and previous config saved to /var/cache/conftool/dbconfig/20251104-131254-marostegui.json |
[production] |
| 13:05 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1220 (re)pooling @ 50%: After switchover', diff saved to https://phabricator.wikimedia.org/P84749 and previous config saved to /var/cache/conftool/dbconfig/20251104-130512-root.json |
[production] |
| 12:59 |
<topranks> |
downgrade lsw1-d3-eqiad to SR-Linux v24.10.1 |
[production] |
| 12:57 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2178 (T407997)', diff saved to https://phabricator.wikimedia.org/P84748 and previous config saved to /var/cache/conftool/dbconfig/20251104-125745-marostegui.json |
[production] |
| 12:54 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2178 (T407997)', diff saved to https://phabricator.wikimedia.org/P84747 and previous config saved to /var/cache/conftool/dbconfig/20251104-125359-marostegui.json |
[production] |
| 12:53 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2178.codfw.wmnet with reason: Maintenance |
[production] |
| 12:53 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2171 (T407997)', diff saved to https://phabricator.wikimedia.org/P84746 and previous config saved to /var/cache/conftool/dbconfig/20251104-125335-marostegui.json |
[production] |
| 12:50 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1220 (re)pooling @ 25%: After switchover', diff saved to https://phabricator.wikimedia.org/P84745 and previous config saved to /var/cache/conftool/dbconfig/20251104-125005-root.json |
[production] |
| 12:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db1220 T409167', diff saved to https://phabricator.wikimedia.org/P84744 and previous config saved to /var/cache/conftool/dbconfig/20251104-124836-marostegui.json |
[production] |
| 12:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Promote db1237 to x1 primary T409167', diff saved to https://phabricator.wikimedia.org/P84743 and previous config saved to /var/cache/conftool/dbconfig/20251104-124803-marostegui.json |
[production] |
| 12:47 |
<marostegui> |
Starting x1 eqiad failover from db1220 to db1237 - T409167 |
[production] |
| 12:46 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 14 hosts with reason: Primary switchover x1 T409167 |
[production] |
| 12:45 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db1237 with weight 0 T409167', diff saved to https://phabricator.wikimedia.org/P84742 and previous config saved to /var/cache/conftool/dbconfig/20251104-124556-marostegui.json |
[production] |
| 12:38 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2171', diff saved to https://phabricator.wikimedia.org/P84741 and previous config saved to /var/cache/conftool/dbconfig/20251104-123827-marostegui.json |
[production] |
| 12:23 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2171', diff saved to https://phabricator.wikimedia.org/P84740 and previous config saved to /var/cache/conftool/dbconfig/20251104-122320-marostegui.json |
[production] |
| 12:18 |
<hashar> |
deployment-prep: deleted /var/run/lock/scap.srv_mediawiki-staging.lock . Lock file was left over by scap due to Jenkins being restarted |
[releng] |
| 12:17 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component api-gateway |
[tools] |
| 12:13 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component api-gateway |
[tools] |
| 12:08 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2171 (T407997)', diff saved to https://phabricator.wikimedia.org/P84739 and previous config saved to /var/cache/conftool/dbconfig/20251104-120812-marostegui.json |
[production] |
| 12:08 |
<fabfur> |
re-enable puppet on A:cp (T408060) |
[production] |
| 12:04 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2171 (T407997)', diff saved to https://phabricator.wikimedia.org/P84737 and previous config saved to /var/cache/conftool/dbconfig/20251104-120401-marostegui.json |
[production] |
| 12:03 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2171.codfw.wmnet with reason: Maintenance |
[production] |
| 12:03 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2157 (T407997)', diff saved to https://phabricator.wikimedia.org/P84736 and previous config saved to /var/cache/conftool/dbconfig/20251104-120338-marostegui.json |
[production] |
| 12:00 |
<topranks> |
upgrade lsw1-d3-eqiad to SR-Linux v24.10.3 |
[production] |
| 11:57 |
<fabfur> |
temporary disable puppet on A:cp to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/1199247 (T408060) |
[production] |
| 11:52 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P84735 and previous config saved to /var/cache/conftool/dbconfig/20251104-115217-root.json |
[production] |