2025-05-02
§
|
15:45 |
<stevemunene> |
reboot an-worker1166 after the hardware swap for T390170 |
[analytics] |
15:45 |
<stevemunene@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host an-worker1166.eqiad.wmnet |
[production] |
15:11 |
<herron> |
power cycling prometheus200[78] via rac |
[production] |
15:09 |
<andrewbogott> |
sudo cumin --force O{*} "dpkg --list | grep puppetserver && systemctl restart puppetserver.service" # work around a package update that is causing some puppetservers to error out |
[admin] |
15:06 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1168.eqiad.wmnet |
[production] |
15:05 |
<jgleeson> |
SmashPig changed from 9b3c4587 to ddf64519 |
[production] |
15:04 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1168.eqiad.wmnet |
[production] |
15:03 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1167.eqiad.wmnet |
[production] |
15:01 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1167.eqiad.wmnet |
[production] |
15:01 |
<bking@cumin2002> |
conftool action : set/pooled=yes:weight=10; selector: name=cirrussearch2076.codfw.wmnet|cirrussearch2080.codfw.wmnet|cirrussearch2081.codfw.wmnet|cirrussearch2083.codfw.wmnet|cirrussearch2084.codfw.wmnet|cirrussearch2092.codfw.wmnet|cirrussearch2093.codfw.wmnet|cirrussearch2100.codfw.wmnet|cirrussearch2106.codfw.wmnet|cirrussearch2108.codfw.wmnet |
[production] |
15:01 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1166.eqiad.wmnet |
[production] |
14:55 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1166.eqiad.wmnet |
[production] |
14:48 |
<dancy@deploy1003> |
Installation of scap version "4.159.0" completed for 2 hosts |
[production] |
14:46 |
<dancy@deploy1003> |
Installing scap version "4.159.0" for 2 host(s) |
[production] |
14:10 |
<inflatador> |
bking@localhost set search_codfw num_concurrent_incoming_recoveries from 20 back down to 4 after migration T391350 |
[production] |
13:55 |
<taavi> |
reboot tools-static-15 |
[tools] |
13:49 |
<moritzm> |
imported ruby-defaults 1:3.3~wmf13u1 to component/puppet7 for trixie-wikimedia T392790 |
[production] |
13:40 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2008.wikimedia.org |
[production] |
13:37 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host testvm2008.wikimedia.org |
[production] |
13:25 |
<urandom> |
invoked manual `garbagecollect`, Cassandra sessionstore — T390514 |
[production] |
13:21 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2007.codfw.wmnet |
[production] |
13:17 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host testvm2007.codfw.wmnet |
[production] |
12:51 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2006.codfw.wmnet |
[production] |
12:47 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host testvm2006.codfw.wmnet |
[production] |
12:30 |
<James_F> |
Zuul: [mediawiki/extensions/CodeEditor] Add BetaFeatures phan dependency, for T373711 |
[releng] |
12:18 |
<James_F> |
Zuul: [mediawiki/extensions/WikiLambda] Make Catalyst voting again |
[releng] |
10:06 |
<moritzm> |
imported ruby-concurrent 1.1.6+dfsg-5~wmf13u1 to component/puppet7 for trixie-wikimedia T392790 |
[production] |
10:00 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest2001.codfw.wmnet |
[production] |
09:55 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host sretest2001.codfw.wmnet |
[production] |
09:54 |
<stevemunene@cumin1002> |
END (FAIL) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=99) for hosts an-worker1167.eqiad.wmnet |
[production] |
09:42 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1003.eqiad.wmnet |
[production] |
09:37 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host sretest1003.eqiad.wmnet |
[production] |
09:32 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1002.eqiad.wmnet |
[production] |
09:31 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1167.eqiad.wmnet |
[production] |
09:26 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host sretest1002.eqiad.wmnet |
[production] |
08:43 |
<hashar> |
Updating Quibble jobs to 1.14.0 | https://gerrit.wikimedia.org/r/c/integration/config/+/1140215 | T378797 T384927 T386691 |
[releng] |
08:34 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox-dev2003.codfw.wmnet |
[production] |
08:30 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host netbox-dev2003.codfw.wmnet |
[production] |
08:29 |
<XioNoX> |
update codfw pfw NAT - T392843 |
[production] |
08:16 |
<jmm@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host bast2003.wikimedia.org |
[production] |
08:13 |
<XioNoX> |
push pfw policies - T393098 |
[production] |
08:09 |
<jmm@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host bast2003.wikimedia.org |
[production] |
07:00 |
<James_F> |
Zuul: [mediawiki/extensions/WikimediaMessages] Add cldr as full CI dep too, for T391230 |
[releng] |
06:52 |
<James_F> |
Zuul: [mediawiki/extensions/WikimediaMessages] Add cldr as phan dependency, for T391230 |
[releng] |
06:46 |
<stevemunene@cumin1002> |
END (FAIL) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=99) for hosts an-worker1167.eqiad.wmnet |
[production] |
06:42 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1167.eqiad.wmnet |
[production] |
06:30 |
<slyngshede@cumin1002> |
DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging MarkTraceur out of all services on: 2404 hosts |
[production] |
06:21 |
<stevemunene@cumin1002> |
END (FAIL) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=99) for hosts an-worker1167.eqiad.wmnet |
[production] |
06:18 |
<stevemunene@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1167.eqiad.wmnet |
[production] |
06:14 |
<stevemunene@cumin1002> |
END (FAIL) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=99) for hosts an-worker1166.eqiad.wmnet |
[production] |