| 2024-10-02
      
      ยง | 
    
  | 15:33 | <cdanis@deploy1003> | helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 15:31 | <cdanis@deploy1003> | helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 15:31 | <cdanis@deploy1003> | helmfile [staging-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 15:30 | <kcvelaga@deploy2002> | Finished deploy [airflow-dags/analytics_product@3a7901e]: T375153 (duration: 01m 59s) | [production] | 
            
  | 15:28 | <swfrench@cumin1002> | END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) status all services in all: None - None | [production] | 
            
  | 15:28 | <swfrench@cumin1002> | START - Cookbook sre.discovery.datacenter status all services in all: None - None | [production] | 
            
  | 15:28 | <kcvelaga@deploy2002> | Started deploy [airflow-dags/analytics_product@3a7901e]: T375153 | [production] | 
            
  | 15:27 | <swfrench@cumin1002> | END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) pool all active/active services in eqiad: Datacenter Switchover - T370962 | [production] | 
            
  | 15:26 | <dancy@deploy2002> | Finished scap sync-world: Testing T370934 (duration: 03m 19s) | [production] | 
            
  | 15:24 | <aborrero@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan+apply for main branch | [admin] | 
            
  | 15:24 | <jelto@cumin1002> | END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab1003.wikimedia.org with reason: test I946dd0b73b6be2d6b8093f03550f78d76188b92b with dummy upgrade | [production] | 
            
  | 15:23 | <aborrero@cloudcumin1001> | START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch | [admin] | 
            
  | 15:23 | <aborrero@cloudcumin1001> | END (ERROR) - Cookbook wmcs.openstack.tofu (exit_code=97) running tofu plan+apply for main branch | [admin] | 
            
  | 15:23 | <aborrero@cloudcumin1001> | START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch | [admin] | 
            
  | 15:23 | <jelto@cumin1002> | START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: test I946dd0b73b6be2d6b8093f03550f78d76188b92b with dummy upgrade | [production] | 
            
  | 15:22 | <dancy@deploy2002> | Started scap sync-world: Testing T370934 | [production] | 
            
  | 15:18 | <dancy@deploy2002> | Installation of scap version "4.108.0" completed for 210 hosts | [production] | 
            
  | 15:14 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on registry1004.eqiad.wmnet with reason: testing | [production] | 
            
  | 15:14 | <elukey@cumin1002> | START - Cookbook sre.hosts.downtime for 1:00:00 on registry1004.eqiad.wmnet with reason: testing | [production] | 
            
  | 15:13 | <dancy@deploy2002> | Installing scap version "4.108.0" for 210 hosts | [production] | 
            
  | 15:12 | <cdanis@deploy1003> | helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 15:12 | <cdanis@deploy1003> | helmfile [staging-codfw] START helmfile.d/admin 'apply'. | [production] | 
            
  | 15:07 | <swfrench@cumin1002> | START - Cookbook sre.discovery.datacenter pool all active/active services in eqiad: Datacenter Switchover - T370962 | [production] | 
            
  | 15:07 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host logging-hd2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 15:04 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host logging-hd2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 15:00 | <swfrench@cumin1002> | END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) status all services in all: None - None | [production] | 
            
  | 15:00 | <swfrench@cumin1002> | START - Cookbook sre.discovery.datacenter status all services in all: None - None | [production] | 
            
  | 14:59 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host logging-hd2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 14:56 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host logging-hd2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 14:51 | <bking@cumin2002> | START - Cookbook sre.hosts.reimage for host wdqs-categories1001.eqiad.wmnet with OS bullseye | [production] | 
            
  | 14:46 | <bking@cumin2002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM wdqs-categories1001.eqiad.wmnet - bking@cumin2002" | [production] | 
            
  | 14:46 | <bking@cumin2002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM wdqs-categories1001.eqiad.wmnet - bking@cumin2002" | [production] | 
            
  | 14:45 | <bking@cumin2002> | END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wdqs-categories1001.eqiad.wmnet on all recursors | [production] | 
            
  | 14:45 | <bking@cumin2002> | START - Cookbook sre.dns.wipe-cache wdqs-categories1001.eqiad.wmnet on all recursors | [production] | 
            
  | 14:45 | <bking@cumin2002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 14:45 | <bking@cumin2002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM wdqs-categories1001.eqiad.wmnet - bking@cumin2002" | [production] | 
            
  | 14:44 | <bking@cumin2002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM wdqs-categories1001.eqiad.wmnet - bking@cumin2002" | [production] | 
            
  | 14:40 | <elukey@cumin1002> | END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host irc1004.wikimedia.org | [production] | 
            
  | 14:40 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host irc1004.wikimedia.org with OS bookworm | [production] | 
            
  | 14:30 | <bking@cumin2002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 14:30 | <bking@cumin2002> | START - Cookbook sre.ganeti.makevm for new host wdqs-categories1001.eqiad.wmnet | [production] | 
            
  | 14:29 | <jayme@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestage2001.codfw.wmnet with OS bookworm | [production] | 
            
  | 14:26 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on irc1004.wikimedia.org with reason: host reimage | [production] | 
            
  | 14:22 | <elukey@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on irc1004.wikimedia.org with reason: host reimage | [production] | 
            
  | 14:21 | <urbanecm@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1077399|labswiki: Disallow account autocreation (T161859)]] (duration: 07m 38s) | [production] | 
            
  | 14:17 | <urbanecm@deploy2002> | urbanecm: Continuing with sync | [production] | 
            
  | 14:16 | <urbanecm@deploy2002> | urbanecm: Backport for [[gerrit:1077399|labswiki: Disallow account autocreation (T161859)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 14:14 | <urbanecm@deploy2002> | Started scap sync-world: Backport for [[gerrit:1077399|labswiki: Disallow account autocreation (T161859)]] | [production] | 
            
  | 14:12 | <elukey@cumin1002> | START - Cookbook sre.hosts.reimage for host irc1004.wikimedia.org with OS bookworm | [production] | 
            
  | 14:11 | <hashar@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1077390|Remove Maintenance check (T376255)]] (duration: 07m 27s) | [production] |