2025-05-02
§
|
16:53 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
16:51 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
16:47 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.network.provision (exit_code=0) for device lsw1-f1-codfw.mgmt.codfw.wmnet |
[production] |
16:28 |
<mvernon@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 18:00:00 on ms-fe1016.eqiad.wmnet with reason: not yet in prod |
[production] |
16:28 |
<mvernon@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 18:00:00 on ms-fe1015.eqiad.wmnet with reason: not yet in prod |
[production] |
16:26 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:24 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
16:24 |
<pt1979@cumin2002> |
START - Cookbook sre.network.provision for device lsw1-f1-codfw.mgmt.codfw.wmnet |
[production] |
16:20 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.depool_and_destroy (T393196) |
[admin] |
16:20 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.ceph.osd.depool_and_destroy (exit_code=97) (T393196) |
[admin] |
16:14 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.depool_and_destroy (T393196) |
[admin] |
16:14 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.ceph.osd.depool_and_destroy (exit_code=99) |
[admin] |
16:14 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.depool_and_destroy |
[admin] |
15:45 |
<stevemunene> |
reboot an-worker1166 after the hardware swap for T390170 |
[analytics] |
15:45 |
<stevemunene@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host an-worker1166.eqiad.wmnet |
[production] |
15:11 |
<herron> |
power cycling prometheus200[78] via rac |
[production] |
15:09 |
<andrewbogott> |
sudo cumin --force O{*} "dpkg --list | grep puppetserver && systemctl restart puppetserver.service" # work around a package update that is causing some puppetservers to error out |
[admin] |
15:06 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1168.eqiad.wmnet |
[production] |
15:05 |
<jgleeson> |
SmashPig changed from 9b3c4587 to ddf64519 |
[production] |
15:04 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1168.eqiad.wmnet |
[production] |
15:03 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1167.eqiad.wmnet |
[production] |
15:01 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1167.eqiad.wmnet |
[production] |
15:01 |
<bking@cumin2002> |
conftool action : set/pooled=yes:weight=10; selector: name=cirrussearch2076.codfw.wmnet|cirrussearch2080.codfw.wmnet|cirrussearch2081.codfw.wmnet|cirrussearch2083.codfw.wmnet|cirrussearch2084.codfw.wmnet|cirrussearch2092.codfw.wmnet|cirrussearch2093.codfw.wmnet|cirrussearch2100.codfw.wmnet|cirrussearch2106.codfw.wmnet|cirrussearch2108.codfw.wmnet |
[production] |
15:01 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1166.eqiad.wmnet |
[production] |
14:55 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1166.eqiad.wmnet |
[production] |
14:48 |
<dancy@deploy1003> |
Installation of scap version "4.159.0" completed for 2 hosts |
[production] |
14:46 |
<dancy@deploy1003> |
Installing scap version "4.159.0" for 2 host(s) |
[production] |
14:10 |
<inflatador> |
bking@localhost set search_codfw num_concurrent_incoming_recoveries from 20 back down to 4 after migration T391350 |
[production] |
13:55 |
<taavi> |
reboot tools-static-15 |
[tools] |
13:49 |
<moritzm> |
imported ruby-defaults 1:3.3~wmf13u1 to component/puppet7 for trixie-wikimedia T392790 |
[production] |
13:40 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2008.wikimedia.org |
[production] |
13:37 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host testvm2008.wikimedia.org |
[production] |
13:25 |
<urandom> |
invoked manual `garbagecollect`, Cassandra sessionstore — T390514 |
[production] |
13:21 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2007.codfw.wmnet |
[production] |
13:17 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host testvm2007.codfw.wmnet |
[production] |
12:51 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2006.codfw.wmnet |
[production] |
12:47 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host testvm2006.codfw.wmnet |
[production] |
12:30 |
<James_F> |
Zuul: [mediawiki/extensions/CodeEditor] Add BetaFeatures phan dependency, for T373711 |
[releng] |
12:18 |
<James_F> |
Zuul: [mediawiki/extensions/WikiLambda] Make Catalyst voting again |
[releng] |
10:06 |
<moritzm> |
imported ruby-concurrent 1.1.6+dfsg-5~wmf13u1 to component/puppet7 for trixie-wikimedia T392790 |
[production] |
10:00 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest2001.codfw.wmnet |
[production] |
09:55 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host sretest2001.codfw.wmnet |
[production] |
09:54 |
<stevemunene@cumin1002> |
END (FAIL) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=99) for hosts an-worker1167.eqiad.wmnet |
[production] |
09:42 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1003.eqiad.wmnet |
[production] |
09:37 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host sretest1003.eqiad.wmnet |
[production] |
09:32 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1002.eqiad.wmnet |
[production] |
09:31 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1167.eqiad.wmnet |
[production] |
09:26 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host sretest1002.eqiad.wmnet |
[production] |
08:43 |
<hashar> |
Updating Quibble jobs to 1.14.0 | https://gerrit.wikimedia.org/r/c/integration/config/+/1140215 | T378797 T384927 T386691 |
[releng] |
08:34 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox-dev2003.codfw.wmnet |
[production] |