801-850 of 977 results (21ms)
2019-10-10 §
11:59 <arturo> network switch hardware is down affecting cloudvirt1025/1026 (T227536) VMs are supposed to be online but unreachable [admin]
2019-10-09 §
10:44 <arturo> cloudvirt1013 rebooted well [admin]
10:32 <arturo> cloudvirt1013 is rebooting [admin]
10:32 <arturo> cloudvirt1012 rebooted just fine (very slow, 35 VMs) [admin]
10:20 <arturo> cloudvirt1012 is rebooting [admin]
10:19 <arturo> cloudvirt1009 rebooted just fine (very slow though) [admin]
10:06 <arturo> cloudvirt1009 is rebooting [admin]
10:06 <arturo> cloudvirt1008 rebooted just fine (very slow though) [admin]
09:58 <arturo> cloudvirt1008 is rebooting [admin]
09:52 <arturo> icinga downtime toolschecker, paws, etc for 2h, because cloudvirt reboots [admin]
2019-10-07 §
14:07 <arturo> horizon is disabled for maintenance (T212302) [admin]
14:00 <arturo> starting scheduled maintenance: upgrading eqiad1 from openstack mitaka to newton [admin]
2019-10-02 §
15:23 <arturo> codfw1dev renaming net/subnet objects to a more modern naming scheme T233665 [admin]
12:49 <arturo> codfw1dev delete all floating ip allocations in the deployment for mangling the network config for testing T233665 [admin]
12:47 <arturo> codfw1dev deleting all VMs in the deployment for mangling the network config for testing T233665 [admin]
11:08 <arturo> codfw1dev rebooting cloudnet2002-dev and cloudnet2003-dev for testing T233665 [admin]
10:31 <arturo> codfw1dev: add cloudinstances2b-gw router to the l3 agent in cloudnet2003-dev [admin]
09:59 <arturo> codfw1dev: cleanup leftover "HA port tenant admin" in neutron (ports from missing servers) [admin]
09:46 <arturo> codfw1dev: cleanup leftover neutron agents [admin]
2019-09-30 §
10:21 <arturo> we installed ferm in every VM by mistake. Deleting it and forcing a puppet agent run to try to go back to a clean state. [admin]
09:38 <arturo> downtime toolschecker for 24h [admin]
09:33 <arturo> force update ferm cloud-wide (in all VMs) for T153468 [admin]
2019-08-18 §
10:39 <arturo> rebooting cloudvirt1023 for new interface names configuration [admin]
10:34 <arturo> downtimed cloudvirt1023 for 2 days [admin]
2019-08-05 §
17:17 <bd808> Set downtime on gridengine and kubernetes webservice checks in icinga until 2019-09-02 (flaky tests) [admin]
2019-07-29 §
20:14 <bd808> Restarted maintain-kubeusers on tools-k8s-master-01 (T194859) [admin]
2019-07-25 §
12:32 <arturo> eqiad1/glance: debian-9.9-stretch image deprecates debian-9.8-stretch (T228983) [admin]
09:59 <arturo> (codfw1dev) drop missing glance images (T228972) [admin]
09:32 <arturo> (codfw1dev) deleting a bunch of VMs that were running in now missing hypervisors [admin]
09:31 <arturo> (codfw1dev) deleting a bunch of VMs in ERROR and SHUTDOWN state [admin]
09:27 <arturo> last log entry refers to the codfw1dev deployment [admin]
09:27 <arturo> cleanup `nova service-list` from old hypervisors (labtest*) [admin]
09:23 <arturo> refreshed nova DB grants in clouddb2001-dev for the codfw1dev deployment [admin]
08:47 <arturo> cleanup the cloud-announce pending emails (spam) [admin]
2019-07-23 §
19:43 <andrewbogott> restarting rabbitmq-server on cloudcontrol1003 and 1004 [admin]
2019-07-22 §
23:44 <bd808> Restarted maintain-kubeusers on tools-k8s-master-01 (T228529) [admin]
2019-07-11 §
22:07 <bd808> Ran `sudo systemctl stop designate_floating_ip_ptr_records_updater.service` on cloudcontrol1003 [admin]
22:01 <bd808> `sudo apt-get install python2.7-dbg` on cloudcontrol1003 to debug hung python process [admin]
21:48 <bd808> Ran `sudo systemctl stop designate_floating_ip_ptr_records_updater.service` on cloudcontrol1004 [admin]
2019-06-25 §
16:05 <bstorm_> updated python3.4 to update4 wherever it was installed on Jessie VMs to prevent issues with broken update3. [admin]
14:56 <bstorm_> Updated python 3.4 on the labs-puppetmaster server [admin]
2019-06-03 §
15:55 <arturo> T221769 rebooting cloudservices1003 after bootstrapping is apparently completed [admin]
2019-05-28 §
21:42 <bstorm_> unmounting labstore1003-scratch on all cloud clients [admin]
18:14 <bstorm_> T209527 switched mounts from labstore1003 to cloudstore1008 for scratch [admin]
2019-05-20 §
17:25 <arturo> T223923 dropped compat-network config from /etc/network/interfaces in eqiad1/codfw1dev neutron nodes [admin]
17:22 <arturo> T223923 dropped br-compat bridges and vlan interfaces (1102 and 2102) in eqiad1/codfw1dev neutron nodes [admin]
17:07 <arturo> T223923 dropped compat-network configuration from the neutron database in eqiad1 [admin]
16:55 <arturo> T223923 dropped compat-network configuration from the neutron database in codfw1dev [admin]
2019-05-15 §
17:00 <andrewbogott> touching /root/firstboot_done on all VMs that cumin can reach. This will prevent firstboot.sh from running a second time if/when any of these are rebooted. T223370 [admin]
2019-04-26 §
15:51 <arturo> andrew updated dns servers for the cloud-instances2-b-eqiad subnet in neutron: 208.80.154.143 and 208.80.154.24 [admin]