4651-4700 of 10000 results (41ms)
2025-01-06 §
06:23 <marostegui@cumin1002> START - Cookbook sre.dns.netbox [production]
06:20 <marostegui@cumin1002> dbctl commit (dc=all): 'db1193 (re)pooling @ 50%: Repooling after schema change', diff saved to https://phabricator.wikimedia.org/P71790 and previous config saved to /var/cache/conftool/dbconfig/20250106-062040-root.json [production]
06:19 <marostegui@cumin1002> START - Cookbook sre.hosts.decommission for hosts es2020.codfw.wmnet [production]
06:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Remove es2020 from dbctl T382945', diff saved to https://phabricator.wikimedia.org/P71789 and previous config saved to /var/cache/conftool/dbconfig/20250106-061832-marostegui.json [production]
06:08 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts es2021.codfw.wmnet [production]
06:08 <marostegui@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
06:08 <marostegui@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es2021.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" [production]
06:08 <marostegui@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es2021.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" [production]
06:05 <marostegui@cumin1002> dbctl commit (dc=all): 'db1193 (re)pooling @ 25%: Repooling after schema change', diff saved to https://phabricator.wikimedia.org/P71787 and previous config saved to /var/cache/conftool/dbconfig/20250106-060534-root.json [production]
06:04 <marostegui@cumin1002> START - Cookbook sre.dns.netbox [production]
06:00 <marostegui@cumin1002> START - Cookbook sre.hosts.decommission for hosts es2021.codfw.wmnet [production]
05:57 <marostegui@cumin1002> dbctl commit (dc=all): 'Remove es2021 from dbctl T382944', diff saved to https://phabricator.wikimedia.org/P71786 and previous config saved to /var/cache/conftool/dbconfig/20250106-055726-marostegui.json [production]
05:50 <marostegui@cumin1002> dbctl commit (dc=all): 'db1193 (re)pooling @ 10%: Repooling after schema change', diff saved to https://phabricator.wikimedia.org/P71785 and previous config saved to /var/cache/conftool/dbconfig/20250106-055029-root.json [production]
03:39 <tstarling@deploy2002> Finished scap sync-world: Backport for [[gerrit:1101239|Enable canShellboxGetTempUrl everywhere (T292322)]] (duration: 21m 53s) [production]
03:31 <tstarling@deploy2002> tstarling: Continuing with sync [production]
03:31 <tstarling@deploy2002> tstarling: Backport for [[gerrit:1101239|Enable canShellboxGetTempUrl everywhere (T292322)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
03:17 <tstarling@deploy2002> Started scap sync-world: Backport for [[gerrit:1101239|Enable canShellboxGetTempUrl everywhere (T292322)]] [production]
2025-01-05 §
18:58 <lucaswerkmeister> remove /tmp/framer.txt on tools-bastion-13 (I notified the owner privately), and replace it with a root-owned file to prevent iTerm from leaking logs into it (https://iterm2.com/downloads/stable/iTerm2-3_5_11.changelog) on tools-sgebastion-10, tools-bastion-12 and tools-bastion-13 [tools]
2025-01-04 §
21:34 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
21:31 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
20:54 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
20:50 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
20:02 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on doc1003.eqiad.wmnet with reason: dz [production]
20:01 <dzahn@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on doc1003.eqiad.wmnet with reason: dz [production]
19:33 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
19:29 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
18:30 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
18:27 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
18:22 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
18:18 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
18:02 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
17:59 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on traffic-puppetserver-bookworm.traffic.eqiad1.wikimedia.cloud [traffic]
15:45 <wmbot~sakretsu@tools-bastion-13> implemented health checks for draftbot [tools.itwiki]
2025-01-03 §
21:46 <bd808@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-69 [tools]
21:41 <bd808@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-69 [tools]
21:40 <bd808@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.worker.drain (exit_code=99) for node tools-k8s-worker-nfs-69 [tools]
21:35 <bd808@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.worker.drain for node tools-k8s-worker-nfs-69 [tools]
21:01 <bd808> `sudo service maintain-dbusers restart` on cloudcontrol1005. Report of missing replica.my.cnf and journalctl output empty due to log rotation. (T382962) [admin]
19:09 <aokoth@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on doc2002.codfw.wmnet with reason: Disk Change [production]
19:09 <aokoth@cumin1002> START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on doc2002.codfw.wmnet with reason: Disk Change [production]
18:47 <aokoth@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doc2002.codfw.wmnet with reason: Disk Change [production]
18:46 <aokoth@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on doc2002.codfw.wmnet with reason: Disk Change [production]
17:15 <thcipriani@deploy2002> Finished deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit1003.wikimedia.org only) (duration: 00m 10s) [production]
17:15 <thcipriani@deploy2002> Started deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit1003.wikimedia.org only) [production]
17:06 <thcipriani@deploy2002> Finished deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit2002.wikimedia.org only) (duration: 00m 08s) [production]
17:06 <thcipriani@deploy2002> Started deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit2002.wikimedia.org only) [production]
17:00 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
16:55 <thcipriani@deploy2002> Finished deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit2003.wikimedia.org only) (duration: 00m 08s) [production]
16:55 <thcipriani@deploy2002> Started deploy [gerrit/gerrit@44854d4]: remove developer satisfaction survey banner (gerrit2003.wikimedia.org only) [production]
12:10 <moritzm> renewed internal Ganeti certs in eqsin (would have expired in two days) T382873 [production]