3501-3550 of 10000 results (50ms)
2024-01-02 §
18:29 <mutante> confctl select 'name=mw2394.codfw.wmnet' set/pooled=inactive | T354193#9430654 - seems like 2396 was previously depooled instead of this 2394 [production]
17:29 <dancy@deploy2002> Installation of scap version "4.65.1" completed for 566 hosts [production]
17:28 <dancy@deploy2002> Installing scap version "4.65.1" for 566 hosts [production]
17:26 <dancy@deploy2002> Installing scap version "4.65.1" for 567 hosts [production]
16:18 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.unset_maintenance (exit_code=99) (T353408) [admin]
16:18 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.unset_maintenance (T353408) [admin]
16:15 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=0) [cloudvirt-canary]
16:15 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary [cloudvirt-canary]
15:36 <btullis> migrating analytics-hive.eqiad.wmnet to an-coord1003 for T336045 [analytics]
14:59 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbstore1008.eqiad.wmnet with OS bookworm [production]
14:58 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbstore1009.eqiad.wmnet with OS bookworm [production]
14:44 <urbanecm> [urbanecm@mwmaint2002 ~]$ mwscript namespaceDupes.php --wiki=csbwiktionary --fix # T354114 [production]
14:43 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbstore1009.eqiad.wmnet with reason: host reimage [production]
14:40 <btullis@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on dbstore1009.eqiad.wmnet with reason: host reimage [production]
14:37 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbstore1008.eqiad.wmnet with reason: host reimage [production]
14:34 <btullis@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on dbstore1008.eqiad.wmnet with reason: host reimage [production]
14:32 <_joe_> confctl select 'name=mw2396.codfw.wmnet' set/pooled=inactive [production]
14:26 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host dbstore1009.eqiad.wmnet with OS bookworm [production]
14:20 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host dbstore1008.eqiad.wmnet with OS bookworm [production]
14:16 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:985384|cswiki: Grant patrolmarks to autopatrolled (T354004)]], [[gerrit:986640|csbwiktionary: Set MetaNamespaceName to Wikisłowôrz (T354114)]] (duration: 13m 46s) [production]
14:04 <urbanecm@deploy2002> urbanecm: Continuing with sync [production]
14:04 <urbanecm@deploy2002> urbanecm: Backport for [[gerrit:985384|cswiki: Grant patrolmarks to autopatrolled (T354004)]], [[gerrit:986640|csbwiktionary: Set MetaNamespaceName to Wikisłowôrz (T354114)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:02 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:985384|cswiki: Grant patrolmarks to autopatrolled (T354004)]], [[gerrit:986640|csbwiktionary: Set MetaNamespaceName to Wikisłowôrz (T354114)]] [production]
11:06 <dcaro> restart toolsdb database to flush connections (T354176) [tools]
10:56 <brouberol> configuring [eqiad,codfw].mediawiki.cirrussearch.page_rerender.v1 as compacted topics on jumbo-eqiad - T353715 [analytics]
10:55 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp4044.ulsfo.wmnet,cp4050.ulsfo.wmnet} and A:cp [production]
10:50 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp4044.ulsfo.wmnet,cp4050.ulsfo.wmnet} and A:cp [production]
10:42 <dcaro> flushed the redis db on tools-harbor-1 (T354176) [tools]
10:38 <vgutierrez> fetching haproxy 2.6.16 for thirdparty/haproxy26 bullseye-wikimedia (apt.wm.o) [production]
10:37 <dcaro> hard reboot tools-harbor-1 [tools]
10:22 <wm-bot2> fran@wmf3169 END (FAIL) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=99) [admin]
10:22 <wm-bot2> fran@wmf3169 START - Cookbook wmcs.openstack.cloudvirt.vm_console [admin]
10:13 <dhinus> hard reboot tools-harbor-1 [tools]
09:24 <btullis> adding three days' downtime to dbstore1008, prior to switching its role to `mariadb::analytics_replica` for T351921 [analytics]
09:23 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Commissioning new database server [production]
09:23 <btullis@cumin1001> START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Commissioning new database server [production]
09:17 <pfischer@deploy2002> Finished scap: Backport for [[gerrit:987028|configure message_key_fields for update_pipeline]] (duration: 15m 35s) [production]
09:05 <pfischer@deploy2002> pfischer: Continuing with sync [production]
09:04 <pfischer@deploy2002> pfischer: Backport for [[gerrit:987028|configure message_key_fields for update_pipeline]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
09:02 <moritzm> installing nodejs security updates on bookworm [production]
09:02 <pfischer@deploy2002> Started scap: Backport for [[gerrit:987028|configure message_key_fields for update_pipeline]] [production]
08:33 <akosiaris@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2448.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
08:27 <jayme> restart prometheus@k8s prometheus@k8s-aux in eqiad - T343529 [production]
08:26 <akosiaris@cumin1001> START - Cookbook sre.hosts.provision for host mw2448.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
06:45 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2144.codfw.wmnet with OS bookworm [production]
06:27 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2144.codfw.wmnet with reason: host reimage [production]
06:24 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db2144.codfw.wmnet with reason: host reimage [production]
06:06 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db2144.codfw.wmnet with OS bookworm [production]
05:00 <mwpresync@deploy2002> Finished scap: testwikis wikis to 1.42.0-wmf.12 refs T350088 (duration: 56m 48s) [production]
04:03 <mwpresync@deploy2002> Started scap: testwikis wikis to 1.42.0-wmf.12 refs T350088 [production]