2025-03-19
ยง
|
10:49 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti7001.magru.wmnet |
[production] |
10:48 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti7001.magru.wmnet |
[production] |
10:38 |
<ayounsi@dns1004> |
END - running authdns-update |
[production] |
10:35 |
<ayounsi@dns1004> |
START - running authdns-update |
[production] |
10:18 |
<XioNoX> |
trunk sandbox vlan to ganeti7001/3 - T385560 |
[production] |
10:13 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
10:13 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: pickup new magru sandbox includes files - ayounsi@cumin1002" |
[production] |
10:13 |
<ayounsi@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: pickup new magru sandbox includes files - ayounsi@cumin1002" |
[production] |
10:13 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host elastic1111.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
10:11 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-staging-ctrl2002.codfw.wmnet with OS bookworm |
[production] |
10:11 |
<elukey@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/kartotherian: sync |
[production] |
10:10 |
<vgutierrez> |
Upgrading cp4049 to Varnish 7 (T378737) |
[production] |
10:09 |
<ayounsi@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
10:09 |
<elukey@deploy2002> |
helmfile [eqiad] START helmfile.d/services/kartotherian: sync |
[production] |
10:06 |
<elukey@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/kartotherian: sync |
[production] |
10:04 |
<ayounsi@cumin1002> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
10:04 |
<elukey@deploy2002> |
helmfile [codfw] START helmfile.d/services/kartotherian: sync |
[production] |
10:02 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.provision for host elastic1111.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
10:00 |
<ayounsi@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
09:57 |
<elukey@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host elastic1111.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
09:55 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.provision for host elastic1111.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
09:52 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-staging-ctrl2002.codfw.wmnet with reason: host reimage |
[production] |
09:48 |
<XioNoX> |
add sandbox vlan on asw1-b3-magru - T385560 |
[production] |
09:48 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-staging-ctrl2002.codfw.wmnet with reason: host reimage |
[production] |
09:40 |
<jnuche@deploy2002> |
rebuilt and synchronized wikiversions files: group0 to 1.44.0-wmf.21 refs T386216 |
[production] |
09:32 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.reimage for host ml-staging-ctrl2002.codfw.wmnet with OS bookworm |
[production] |
09:31 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1248.eqiad.wmnet with reason: Maintenance |
[production] |
09:30 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-staging-ctrl2001.codfw.wmnet with OS bookworm |
[production] |
09:30 |
<vgutierrez> |
Upgrading cp4048 to Varnish 7 (T378737) |
[production] |
09:30 |
<marostegui> |
Stop MariaDB on db1248 T388837 |
[production] |
09:19 |
<jnuche@deploy2002> |
rebuilt and synchronized wikiversions files: group1 to 1.44.0-wmf.21 refs T386216 |
[production] |
09:13 |
<dcausse@deploy2002> |
Finished deploy [airflow-dags/search@e55954c]: publish search artifacts (duration: 00m 38s) |
[production] |
09:13 |
<dcausse@deploy2002> |
Started deploy [airflow-dags/search@e55954c]: publish search artifacts |
[production] |
09:12 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-staging-ctrl2001.codfw.wmnet with reason: host reimage |
[production] |
09:08 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-staging-ctrl2001.codfw.wmnet with reason: host reimage |
[production] |
09:06 |
<moritzm> |
installing pdns-recursor security updates on DoH hosts |
[production] |
09:00 |
<elukey> |
remove kartotherian.discovery.wmnet:{80,443} ports from LVS config (some extra noise may be registered) |
[production] |
08:51 |
<logmsgbot> |
kharlan Deployed security patch for T389235 |
[production] |
08:49 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.reimage for host ml-staging-ctrl2001.codfw.wmnet with OS bookworm |
[production] |
08:37 |
<logmsgbot> |
kharlan Deployed security patch for T389235 |
[production] |
08:20 |
<vgutierrez> |
Upgrading cp4046 to Varnish 7 (T378737) |
[production] |
07:54 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jennifer Ebe out of all services on: 1294 hosts |
[production] |
07:53 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jennifer Ebe out of all services on: 949 hosts |
[production] |
07:51 |
<moritzm> |
rebalance ganeti eqiad/B following reimages T382507 |
[production] |
07:38 |
<wmbot~melos@tools-bastion-13> |
stewardbots/StewardBot/manage.sh restart # Not reading RC |
[tools.stewardbots] |
06:52 |
<vgutierrez> |
Upgrading cp4045 to Varnish 7 (T378737) |
[production] |
06:37 |
<vgutierrez> |
Upgrading cp4042 to Varnish 7 (T378737) |
[production] |
06:24 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment codfw1dev for all services |
[admin] |
06:21 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.restart_openstack on deployment codfw1dev for all services |
[admin] |
06:14 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment codfw1dev for all services |
[admin] |