251-300 of 10000 results (25ms)
2024-12-04 ยง
18:47 <cjming@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply [production]
18:47 <ryankemper> T379330 `wdqs-internal-main` and `wdqs-internal-scholarly` pools created [production]
18:46 <ryankemper@cumin2002> conftool action : set/pooled=yes:weight=10; selector: cluster=wdqs-internal-main,service=wdqs-main [production]
18:46 <ryankemper@cumin2002> conftool action : set/pooled=yes:weight=10; selector: cluster=wdqs-internal-scholarly,service=wdqs-scholarly [production]
18:35 <dbrant@deploy2002> helmfile [codfw] DONE helmfile.d/services/push-notifications: apply [production]
18:35 <dbrant@deploy2002> helmfile [codfw] START helmfile.d/services/push-notifications: apply [production]
18:34 <dbrant@deploy2002> helmfile [eqiad] DONE helmfile.d/services/push-notifications: apply [production]
18:33 <dbrant@deploy2002> helmfile [eqiad] START helmfile.d/services/push-notifications: apply [production]
18:30 <dbrant@deploy2002> helmfile [staging] DONE helmfile.d/services/push-notifications: apply [production]
18:30 <dbrant@deploy2002> helmfile [staging] START helmfile.d/services/push-notifications: apply [production]
18:13 <swfrench@deploy2002> Finished scap sync-world: Deployment to clear noop chart diff from 1081449 - T377040 (duration: 02m 07s) [production]
18:11 <swfrench@deploy2002> Started scap sync-world: Deployment to clear noop chart diff from 1081449 - T377040 [production]
18:04 <cjming@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
18:04 <cjming@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
18:01 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1178 (T371742)', diff saved to https://phabricator.wikimedia.org/P71556 and previous config saved to /var/cache/conftool/dbconfig/20241204-180114-ladsgroup.json [production]
18:01 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1178.eqiad.wmnet with reason: Maintenance [production]
18:00 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1178.eqiad.wmnet with reason: Maintenance [production]
18:00 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177 (T371742)', diff saved to https://phabricator.wikimedia.org/P71555 and previous config saved to /var/cache/conftool/dbconfig/20241204-180052-ladsgroup.json [production]
17:59 <joal> Rerun cassandra_load_pageview_top_articles_monthly after refinery patch deployed [analytics]
17:56 <joal> Deploying refinery onto HDFS [analytics]
17:55 <joal@deploy2002> Finished deploy [analytics/refinery@6e3ee14] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@6e3ee14b] (duration: 00m 31s) [production]
17:54 <joal@deploy2002> Started deploy [analytics/refinery@6e3ee14] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@6e3ee14b] [production]
17:54 <joal@deploy2002> Finished deploy [analytics/refinery@6e3ee14] (thin): Regular analytics weekly train THIN [analytics/refinery@6e3ee14b] (duration: 00m 37s) [production]
17:54 <joal@deploy2002> Started deploy [analytics/refinery@6e3ee14] (thin): Regular analytics weekly train THIN [analytics/refinery@6e3ee14b] [production]
17:52 <joal@deploy2002> Finished deploy [analytics/refinery@6e3ee14]: Regular analytics weekly train [analytics/refinery@6e3ee14b] (duration: 02m 05s) [production]
17:50 <bd808> Moved SAL fediverse posts to https://wikimedia.social/@sal. Many thanks to botsin.space for providing hosting for so long. [production]
17:50 <joal> Deploying refinery with scap [analytics]
17:50 <joal@deploy2002> Started deploy [analytics/refinery@6e3ee14]: Regular analytics weekly train [analytics/refinery@6e3ee14b] [production]
17:48 <wmbot~bd808@tools-bastion-12> Moved fediverse posting to https://wikimedia.social/@sal (T378571) [tools.stashbot]
17:46 <andrewbogott> rebooting tools-legacy-redirector-2, many probes failing [tools]
17:45 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P71554 and previous config saved to /var/cache/conftool/dbconfig/20241204-174544-ladsgroup.json [production]
17:38 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component envvars-admission [tools]
17:30 <sstefanova@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component envvars-admission [tools]
17:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P71553 and previous config saved to /var/cache/conftool/dbconfig/20241204-173037-ladsgroup.json [production]
17:30 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component envvars-admission [toolsbeta]
17:23 <sstefanova@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component envvars-admission [toolsbeta]
17:18 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1035.eqiad.wmnet' [admin]
17:16 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1035.eqiad.wmnet' [admin]
17:16 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=97) on host 'cloudvirt1035.eqiad.wmnet' (T380893) [admin]
17:16 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1035.eqiad.wmnet' (T380893) [admin]
17:15 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177 (T371742)', diff saved to https://phabricator.wikimedia.org/P71551 and previous config saved to /var/cache/conftool/dbconfig/20241204-171530-ladsgroup.json [production]
17:11 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-admission [tools]
17:10 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cloudcephmon1003.eqiad.wmnet [production]
17:10 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:10 <andrew@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudcephmon1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - andrew@cumin1002" [production]
17:08 <andrew@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudcephmon1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - andrew@cumin1002" [production]
17:04 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
17:03 <sstefanova@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component ingress-admission [tools]
17:02 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-admission [toolsbeta]
17:00 <andrew@cumin1002> START - Cookbook sre.hosts.decommission for hosts cloudcephmon1003.eqiad.wmnet [production]