| 
      
        2024-11-25
      
      ยง
     | 
  
    
  | 22:33 | 
  <tgr@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1097327|SUL3: Sort overrides (T373737)]], [[gerrit:1097328|More authentication domain overrides (T373737)]], [[gerrit:1097322|Update private/readme.php to match production]] (duration: 12m 49s) | 
  [production] | 
            
  | 22:32 | 
  <eileen> | 
  civicrm upgraded from b7bd670f to 190ea417 | 
  [production] | 
            
  | 22:31 | 
  <brett@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host lvs7003.magru.wmnet with OS bullseye | 
  [production] | 
            
  | 22:27 | 
  <tgr@deploy2002> | 
  tgr: Continuing with sync | 
  [production] | 
            
  | 22:25 | 
  <tgr@deploy2002> | 
  tgr: Backport for [[gerrit:1097327|SUL3: Sort overrides (T373737)]], [[gerrit:1097328|More authentication domain overrides (T373737)]], [[gerrit:1097322|Update private/readme.php to match production]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 22:21 | 
  <tgr@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1097327|SUL3: Sort overrides (T373737)]], [[gerrit:1097328|More authentication domain overrides (T373737)]], [[gerrit:1097322|Update private/readme.php to match production]] | 
  [production] | 
            
  | 22:19 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2131', diff saved to https://phabricator.wikimedia.org/P71155 and previous config saved to /var/cache/conftool/dbconfig/20241125-221913-ladsgroup.json | 
  [production] | 
            
  | 22:19 | 
  <tgr@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1097518|Reader Survey: Increase coverage (T378660)]] (duration: 14m 08s) | 
  [production] | 
            
  | 22:13 | 
  <brett@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs7003.magru.wmnet with reason: host reimage | 
  [production] | 
            
  | 22:12 | 
  <tgr@deploy2002> | 
  tgr, dani: Continuing with sync | 
  [production] | 
            
  | 22:09 | 
  <brett@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on lvs7003.magru.wmnet with reason: host reimage | 
  [production] | 
            
  | 22:09 | 
  <tgr@deploy2002> | 
  tgr, dani: Backport for [[gerrit:1097518|Reader Survey: Increase coverage (T378660)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 22:09 | 
  <brett@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host cp7015.magru.wmnet with OS bullseye | 
  [production] | 
            
  | 22:08 | 
  <brett@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp7015.magru.wmnet with OS bullseye | 
  [production] | 
            
  | 22:04 | 
  <tgr@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1097518|Reader Survey: Increase coverage (T378660)]] | 
  [production] | 
            
  | 22:04 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2131 (T380449)', diff saved to https://phabricator.wikimedia.org/P71154 and previous config saved to /var/cache/conftool/dbconfig/20241125-220406-ladsgroup.json | 
  [production] | 
            
  | 22:02 | 
  <tgr@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1097457|LoginCompleteHookHandler: onTempUserCreatedRedirect() should use getPrimaryInstance() (T380042)]] (duration: 12m 41s) | 
  [production] | 
            
  | 21:56 | 
  <tgr@deploy2002> | 
  tgr, matmarex: Continuing with sync | 
  [production] | 
            
  | 21:54 | 
  <tgr@deploy2002> | 
  tgr, matmarex: Backport for [[gerrit:1097457|LoginCompleteHookHandler: onTempUserCreatedRedirect() should use getPrimaryInstance() (T380042)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 21:50 | 
  <tgr@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1097457|LoginCompleteHookHandler: onTempUserCreatedRedirect() should use getPrimaryInstance() (T380042)]] | 
  [production] | 
            
  | 21:49 | 
  <tgr@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1094054|Reader Survey: Increase coverage on enwiki (T378660)]] (duration: 16m 06s) | 
  [production] | 
            
  | 21:46 | 
  <brett@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host cp7015.magru.wmnet with OS bullseye | 
  [production] | 
            
  | 21:45 | 
  <brett@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp7015.magru.wmnet with OS bullseye | 
  [production] | 
            
  | 21:42 | 
  <tgr@deploy2002> | 
  tgr, dani: Continuing with sync | 
  [production] | 
            
  | 21:39 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db2131 (T380449)', diff saved to https://phabricator.wikimedia.org/P71153 and previous config saved to /var/cache/conftool/dbconfig/20241125-213904-ladsgroup.json | 
  [production] | 
            
  | 21:38 | 
  <ladsgroup@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2131.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 21:38 | 
  <ladsgroup@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 6:00:00 on db2131.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 21:38 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2115 (T380449)', diff saved to https://phabricator.wikimedia.org/P71152 and previous config saved to /var/cache/conftool/dbconfig/20241125-213841-ladsgroup.json | 
  [production] | 
            
  | 21:37 | 
  <tgr@deploy2002> | 
  tgr, dani: Backport for [[gerrit:1094054|Reader Survey: Increase coverage on enwiki (T378660)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 21:33 | 
  <tgr@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1094054|Reader Survey: Increase coverage on enwiki (T378660)]] | 
  [production] | 
            
  | 21:31 | 
  <brett@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host lvs7003.magru.wmnet with OS bullseye | 
  [production] | 
            
  | 21:30 | 
  <brett@cumin2002> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host lvs7003.magru.wmnet with OS bullseye | 
  [production] | 
            
  | 21:29 | 
  <tgr@deploy2002> | 
  Finished scap sync-world: Backport for [[gerrit:1097417|Reader Survey: Fix yes/no messages (T378660)]] (duration: 16m 02s) | 
  [production] | 
            
  | 21:23 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2115', diff saved to https://phabricator.wikimedia.org/P71151 and previous config saved to /var/cache/conftool/dbconfig/20241125-212334-ladsgroup.json | 
  [production] | 
            
  | 21:22 | 
  <tgr@deploy2002> | 
  dani, tgr: Continuing with sync | 
  [production] | 
            
  | 21:18 | 
  <sukhe@cumin1002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "testing - sukhe@cumin1002" | 
  [production] | 
            
  | 21:18 | 
  <sukhe@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "testing - sukhe@cumin1002" | 
  [production] | 
            
  | 21:17 | 
  <tgr@deploy2002> | 
  dani, tgr: Backport for [[gerrit:1097417|Reader Survey: Fix yes/no messages (T378660)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 21:13 | 
  <tgr@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1097417|Reader Survey: Fix yes/no messages (T378660)]] | 
  [production] | 
            
  | 21:08 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2115', diff saved to https://phabricator.wikimedia.org/P71150 and previous config saved to /var/cache/conftool/dbconfig/20241125-210827-ladsgroup.json | 
  [production] | 
            
  | 21:04 | 
  <brett@cumin2002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Host reimage - brett@cumin2002 - brett@cumin2002" | 
  [production] | 
            
  | 21:04 | 
  <brett@cumin2002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Host reimage - brett@cumin2002 - brett@cumin2002" | 
  [production] | 
            
  | 21:03 | 
  <eevans@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts restbase[2021-2023].codfw.wmnet | 
  [production] | 
            
  | 21:03 | 
  <eevans@cumin1002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 21:03 | 
  <eevans@cumin1002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: restbase[2021-2023].codfw.wmnet decommissioned, removing all IPs except the asset tag one - eevans@cumin1002" | 
  [production] | 
            
  | 21:03 | 
  <eevans@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: restbase[2021-2023].codfw.wmnet decommissioned, removing all IPs except the asset tag one - eevans@cumin1002" | 
  [production] | 
            
  | 20:59 | 
  <eevans@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 20:57 | 
  <brett@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp7008.magru.wmnet with OS bullseye | 
  [production] | 
            
  | 20:57 | 
  <brett@cumin2002> | 
  END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - brett@cumin2002" | 
  [production] | 
            
  | 20:56 | 
  <robh@cumin2002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - robh@cumin2002" | 
  [production] |