401-450 of 10000 results (23ms)
2024-11-25 ยง
23:09 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp7015.magru.wmnet with OS bullseye [production]
23:02 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp7015.magru.wmnet with OS bullseye [production]
23:01 <bking@cumin1002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) [production]
23:01 <bking@cumin1002> START - Cookbook sre.wdqs.data-transfer [production]
23:01 <brett@cumin2002> END (FAIL) - Cookbook sre.dns.wipe-cache (exit_code=99) cp7015.magru.wmnet lvs7003.magru.wmnet cp7015.mgmt.magru.wmnet lvs7003.mgmt.magru.wmnet on all recursors [production]
23:00 <brett@cumin2002> START - Cookbook sre.dns.wipe-cache cp7015.magru.wmnet lvs7003.magru.wmnet cp7015.mgmt.magru.wmnet lvs7003.mgmt.magru.wmnet on all recursors [production]
23:00 <brett@cumin2002> END (FAIL) - Cookbook sre.dns.wipe-cache (exit_code=99) cp7015.magru.wmnet lvs7003.magru.wmnet on all recursors [production]
23:00 <brett@cumin2002> START - Cookbook sre.dns.wipe-cache cp7015.magru.wmnet lvs7003.magru.wmnet on all recursors [production]
22:56 <brett> Import varnish-modules 0.20.0-2~deb11u1 into varnish-staging apt component [production]
22:56 <bking@cumin1002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T376150, initialize wdqs internal scholarly tier) xfer wikidata from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:56 <bking@cumin1002> START - Cookbook sre.wdqs.data-transfer (T376150, initialize wdqs internal scholarly tier) xfer wikidata from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:53 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T376150, initialize wdqs internal scholarly tier) xfer wikidata from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:53 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T376150, initialize wdqs internal scholarly tier) xfer wikidata from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:49 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db2191 (T380449)', diff saved to https://phabricator.wikimedia.org/P71158 and previous config saved to /var/cache/conftool/dbconfig/20241125-224949-ladsgroup.json [production]
22:49 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2191.codfw.wmnet with reason: Maintenance [production]
22:49 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db2191.codfw.wmnet with reason: Maintenance [production]
22:49 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2131 (T380449)', diff saved to https://phabricator.wikimedia.org/P71157 and previous config saved to /var/cache/conftool/dbconfig/20241125-224927-ladsgroup.json [production]
22:48 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:48 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:46 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp7015.magru.wmnet with OS bullseye [production]
22:43 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:43 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:38 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:38 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:37 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:37 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:37 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:37 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T376150, initialize wdqs internal scholarly tier) xfer scholarly_articles from wdqs2024.codfw.wmnet -> wdqs2026.codfw.wmnet, repooling source-only afterwards [production]
22:34 <tgr|away> UTC late deploys done [production]
22:34 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2131', diff saved to https://phabricator.wikimedia.org/P71156 and previous config saved to /var/cache/conftool/dbconfig/20241125-223420-ladsgroup.json [production]
22:33 <tgr@deploy2002> Finished scap sync-world: Backport for [[gerrit:1097327|SUL3: Sort overrides (T373737)]], [[gerrit:1097328|More authentication domain overrides (T373737)]], [[gerrit:1097322|Update private/readme.php to match production]] (duration: 12m 49s) [production]
22:32 <eileen> civicrm upgraded from b7bd670f to 190ea417 [production]
22:31 <brett@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host lvs7003.magru.wmnet with OS bullseye [production]
22:27 <tgr@deploy2002> tgr: Continuing with sync [production]
22:25 <tgr@deploy2002> tgr: Backport for [[gerrit:1097327|SUL3: Sort overrides (T373737)]], [[gerrit:1097328|More authentication domain overrides (T373737)]], [[gerrit:1097322|Update private/readme.php to match production]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
22:21 <tgr@deploy2002> Started scap sync-world: Backport for [[gerrit:1097327|SUL3: Sort overrides (T373737)]], [[gerrit:1097328|More authentication domain overrides (T373737)]], [[gerrit:1097322|Update private/readme.php to match production]] [production]
22:19 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2131', diff saved to https://phabricator.wikimedia.org/P71155 and previous config saved to /var/cache/conftool/dbconfig/20241125-221913-ladsgroup.json [production]
22:19 <tgr@deploy2002> Finished scap sync-world: Backport for [[gerrit:1097518|Reader Survey: Increase coverage (T378660)]] (duration: 14m 08s) [production]
22:13 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs7003.magru.wmnet with reason: host reimage [production]
22:12 <tgr@deploy2002> tgr, dani: Continuing with sync [production]
22:09 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs7003.magru.wmnet with reason: host reimage [production]
22:09 <tgr@deploy2002> tgr, dani: Backport for [[gerrit:1097518|Reader Survey: Increase coverage (T378660)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
22:09 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp7015.magru.wmnet with OS bullseye [production]
22:08 <brett@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp7015.magru.wmnet with OS bullseye [production]
22:04 <tgr@deploy2002> Started scap sync-world: Backport for [[gerrit:1097518|Reader Survey: Increase coverage (T378660)]] [production]
22:04 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2131 (T380449)', diff saved to https://phabricator.wikimedia.org/P71154 and previous config saved to /var/cache/conftool/dbconfig/20241125-220406-ladsgroup.json [production]
22:02 <tgr@deploy2002> Finished scap sync-world: Backport for [[gerrit:1097457|LoginCompleteHookHandler: onTempUserCreatedRedirect() should use getPrimaryInstance() (T380042)]] (duration: 12m 41s) [production]
21:56 <tgr@deploy2002> tgr, matmarex: Continuing with sync [production]
21:54 <tgr@deploy2002> tgr, matmarex: Backport for [[gerrit:1097457|LoginCompleteHookHandler: onTempUserCreatedRedirect() should use getPrimaryInstance() (T380042)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:50 <tgr@deploy2002> Started scap sync-world: Backport for [[gerrit:1097457|LoginCompleteHookHandler: onTempUserCreatedRedirect() should use getPrimaryInstance() (T380042)]] [production]