201-250 of 10000 results (95ms)
2026-06-04 ยง
07:08 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1297550|Revert "EventStreamConfig - webrequest.dumps.dev0 - enable canary events for hive ingestion"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:06 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1297550|Revert "EventStreamConfig - webrequest.dumps.dev0 - enable canary events for hive ingestion"]] [production]
07:04 <otto@deploy1003> Finished scap sync-world: Backport for [[gerrit:1297260|EventStreamConfig - webrequest.dumps.dev0 - enable canary events for hive ingestion (T425087)]] (duration: 399m 30s) [production]
07:03 <otto@deploy1003> otto: Rolling back deployment [production]
06:53 <cwilliams@cumin1003> START - Cookbook sre.mysql.pool pool db1255: Migration of db1255.eqiad.wmnet completed [production]
06:51 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1255.eqiad.wmnet with OS trixie [production]
06:38 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool db2191: Migration of db2191.codfw.wmnet completed [production]
06:35 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1255.eqiad.wmnet with reason: host reimage [production]
06:32 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2191.codfw.wmnet with OS trixie [production]
06:31 <cwilliams@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1255.eqiad.wmnet with reason: host reimage [production]
06:16 <cwilliams@cumin1003> START - Cookbook sre.hosts.reimage for host db1255.eqiad.wmnet with OS trixie [production]
06:15 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2191.codfw.wmnet with reason: host reimage [production]
06:13 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1255: Upgrading db1255.eqiad.wmnet [production]
06:12 <cwilliams@cumin1003> START - Cookbook sre.mysql.depool depool db1255: Upgrading db1255.eqiad.wmnet [production]
06:12 <cwilliams@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
06:11 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db2191.codfw.wmnet with reason: host reimage [production]
06:04 <cwilliams@cumin1003> dbctl commit (dc=all): 'Depool db1255 T427895', diff saved to https://phabricator.wikimedia.org/P93836 and previous config saved to /var/cache/conftool/dbconfig/20260604-060428-cwilliams.json [production]
06:03 <cwilliams@dns1004> END - running authdns-update [production]
06:02 <cwilliams@dns1004> START - running authdns-update [production]
05:54 <cwilliams@cumin1003> dbctl commit (dc=all): 'Promote db1258 to x3 primary and set section read-write T427895', diff saved to https://phabricator.wikimedia.org/P93835 and previous config saved to /var/cache/conftool/dbconfig/20260604-055429-cwilliams.json [production]
05:53 <cwilliams@cumin1003> dbctl commit (dc=all): 'Set x3 eqiad as read-only for maintenance - T427895', diff saved to https://phabricator.wikimedia.org/P93834 and previous config saved to /var/cache/conftool/dbconfig/20260604-055346-cwilliams.json [production]
05:53 <cezmunsta> Starting x3 eqiad failover from db1255 to db1258 - T427895 [production]
05:52 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host db2191.codfw.wmnet with OS trixie [production]
05:50 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db2191: Upgrading db2191.codfw.wmnet [production]
05:50 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool db2191: Upgrading db2191.codfw.wmnet [production]
05:50 <cwilliams@cumin1003> dbctl commit (dc=all): 'Set db1258 with weight 0 T427895', diff saved to https://phabricator.wikimedia.org/P93833 and previous config saved to /var/cache/conftool/dbconfig/20260604-055021-cwilliams.json [production]
05:50 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
05:50 <cwilliams@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 18 hosts with reason: Primary switchover x3 T427895 [production]
05:48 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
05:46 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool db2191 T428120', diff saved to https://phabricator.wikimedia.org/P93832 and previous config saved to /var/cache/conftool/dbconfig/20260604-054614-marostegui.json [production]
05:45 <marostegui@cumin1003> dbctl commit (dc=all): 'Promote db2215 to x1 primary T428120', diff saved to https://phabricator.wikimedia.org/P93831 and previous config saved to /var/cache/conftool/dbconfig/20260604-054528-marostegui.json [production]
05:44 <marostegui> Starting x1 codfw failover from db2191 to db2215 - T428120 [production]
05:27 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 16 hosts with reason: Primary switchover x1 T428120 [production]
05:27 <marostegui@cumin1003> dbctl commit (dc=all): 'Set db2215 with weight 0 T428120', diff saved to https://phabricator.wikimedia.org/P93830 and previous config saved to /var/cache/conftool/dbconfig/20260604-052722-marostegui.json [production]
05:19 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
03:45 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1263 (T426633)', diff saved to https://phabricator.wikimedia.org/P93829 and previous config saved to /var/cache/conftool/dbconfig/20260604-034546-fceratto.json [production]
03:35 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1263', diff saved to https://phabricator.wikimedia.org/P93828 and previous config saved to /var/cache/conftool/dbconfig/20260604-033538-fceratto.json [production]
03:25 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1263', diff saved to https://phabricator.wikimedia.org/P93827 and previous config saved to /var/cache/conftool/dbconfig/20260604-032531-fceratto.json [production]
03:15 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1263 (T426633)', diff saved to https://phabricator.wikimedia.org/P93826 and previous config saved to /var/cache/conftool/dbconfig/20260604-031523-fceratto.json [production]
03:07 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1263 (T426633)', diff saved to https://phabricator.wikimedia.org/P93825 and previous config saved to /var/cache/conftool/dbconfig/20260604-030710-fceratto.json [production]
03:07 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1263.eqiad.wmnet with reason: Maintenance [production]
03:06 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1262 (T426633)', diff saved to https://phabricator.wikimedia.org/P93824 and previous config saved to /var/cache/conftool/dbconfig/20260604-030642-fceratto.json [production]
02:56 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1262', diff saved to https://phabricator.wikimedia.org/P93823 and previous config saved to /var/cache/conftool/dbconfig/20260604-025634-fceratto.json [production]
02:46 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1262', diff saved to https://phabricator.wikimedia.org/P93822 and previous config saved to /var/cache/conftool/dbconfig/20260604-024627-fceratto.json [production]
02:36 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1262 (T426633)', diff saved to https://phabricator.wikimedia.org/P93821 and previous config saved to /var/cache/conftool/dbconfig/20260604-023619-fceratto.json [production]
02:28 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1262 (T426633)', diff saved to https://phabricator.wikimedia.org/P93820 and previous config saved to /var/cache/conftool/dbconfig/20260604-022809-fceratto.json [production]
02:28 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1262.eqiad.wmnet with reason: Maintenance [production]
02:27 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1261 (T426633)', diff saved to https://phabricator.wikimedia.org/P93819 and previous config saved to /var/cache/conftool/dbconfig/20260604-022742-fceratto.json [production]
02:17 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1261', diff saved to https://phabricator.wikimedia.org/P93818 and previous config saved to /var/cache/conftool/dbconfig/20260604-021734-fceratto.json [production]
02:07 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1261', diff saved to https://phabricator.wikimedia.org/P93817 and previous config saved to /var/cache/conftool/dbconfig/20260604-020726-fceratto.json [production]