2025-01-15
§
|
19:21 |
<aqu@deploy2002> |
Finished deploy [airflow-dags/analytics@cd03eb7]: Cascading backfill under projectview hourly (duration: 01m 06s) |
[production] |
19:20 |
<aqu@deploy2002> |
Started deploy [airflow-dags/analytics@cd03eb7]: Cascading backfill under projectview hourly |
[production] |
19:15 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
19:15 |
<andrew@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
19:08 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
19:08 |
<andrew@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
19:07 |
<sukhe@cumin1002> |
END (FAIL) - Cookbook sre.dns.admin (exit_code=99) DNS admin: depool site ulsfo [reason: this is a test, not actual depool, no task ID specified] |
[production] |
19:07 |
<sukhe@cumin1002> |
START - Cookbook sre.dns.admin DNS admin: depool site ulsfo [reason: this is a test, not actual depool, no task ID specified] |
[production] |
18:51 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
18:50 |
<andrew@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
18:22 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
18:22 |
<andrew@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
18:16 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1163 (T371742)', diff saved to https://phabricator.wikimedia.org/P72080 and previous config saved to /var/cache/conftool/dbconfig/20250115-181629-ladsgroup.json |
[production] |
18:16 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1163.eqiad.wmnet with reason: Maintenance |
[production] |
18:16 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db1163.eqiad.wmnet with reason: Maintenance |
[production] |
18:09 |
<volans@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:05:00 on sretest1002.eqiad.wmnet with reason: testing cookbook |
[production] |
18:06 |
<volans@cumin2002> |
START - Cookbook sre.hosts.downtime for 0:05:00 on sretest1002.eqiad.wmnet with reason: testing cookbook |
[production] |
18:05 |
<volans@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:05:00 on sretest1001.eqiad.wmnet with reason: testing cookbook |
[production] |
17:59 |
<lucaswerkmeister-wmde@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1111166|Added new stream config for haproxy_requestctl (T383392)]] (duration: 13m 38s) |
[production] |
17:58 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
17:56 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.dhcp (exit_code=0) for host cloudcephosd1012.eqiad.wmnet |
[production] |
17:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1163 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72079 and previous config saved to /var/cache/conftool/dbconfig/20250115-175456-root.json |
[production] |
17:53 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.dhcp for host cloudcephosd1012.eqiad.wmnet |
[production] |
17:53 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1102-1106].eqiad.wmnet |
[production] |
17:53 |
<kamila@cumin1002> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1102-1106].eqiad.wmnet |
[production] |
17:52 |
<lucaswerkmeister-wmde@deploy2002> |
lucaswerkmeister-wmde, fabfur: Continuing with sync |
[production] |
17:52 |
<lucaswerkmeister-wmde@deploy2002> |
lucaswerkmeister-wmde, fabfur: Backport for [[gerrit:1111166|Added new stream config for haproxy_requestctl (T383392)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
17:52 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1103.eqiad.wmnet with OS bookworm |
[production] |
17:51 |
<kamila_> |
homer cr*eqiad* commit T377876 |
[production] |
17:49 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1102.eqiad.wmnet with OS bookworm |
[production] |
17:47 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1105.eqiad.wmnet with OS bookworm |
[production] |
17:45 |
<lucaswerkmeister-wmde@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1111166|Added new stream config for haproxy_requestctl (T383392)]] |
[production] |
17:43 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1106.eqiad.wmnet with OS bookworm |
[production] |
17:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1163 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72077 and previous config saved to /var/cache/conftool/dbconfig/20250115-173951-root.json |
[production] |
17:36 |
<andrew@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
17:34 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1103.eqiad.wmnet with reason: host reimage |
[production] |
17:30 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1102.eqiad.wmnet with reason: host reimage |
[production] |
17:26 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1105.eqiad.wmnet with reason: host reimage |
[production] |
17:26 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
17:26 |
<andrew@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1012.eqiad.wmnet with OS bookworm |
[production] |
17:25 |
<root@cumin1002> |
END (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for dbprov1006.eqiad.wmnet: Renew puppet certificate - root@cumin1002 |
[production] |
17:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1163 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72076 and previous config saved to /var/cache/conftool/dbconfig/20250115-172445-root.json |
[production] |
17:24 |
<hnowlan> |
running `decommission` for 5 codfw jobrunners |
[production] |
17:24 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1106.eqiad.wmnet with reason: host reimage |
[production] |
17:22 |
<kamila@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1103.eqiad.wmnet with reason: host reimage |
[production] |
17:21 |
<kamila@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1105.eqiad.wmnet with reason: host reimage |
[production] |
17:21 |
<root@cumin1002> |
START - Cookbook sre.puppet.renew-cert for dbprov1006.eqiad.wmnet: Renew puppet certificate - root@cumin1002 |
[production] |
17:21 |
<kamila@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1102.eqiad.wmnet with reason: host reimage |
[production] |
17:21 |
<kamila@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1106.eqiad.wmnet with reason: host reimage |
[production] |
17:18 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:cassandra-dev: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 |
[production] |