7401-7450 of 10000 results (54ms)
2024-06-25 §
18:41 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=0) on codfw1dev, with recreate True, for hosts list: ['cloudvirt2004-dev'] [cloudvirt-canary]
18:41 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on codfw1dev, with recreate True, for hosts list: ['cloudvirt2004-dev'] [cloudvirt-canary]
18:31 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5017.eqsin.wmnet with OS bullseye [production]
18:28 <brett@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5017.eqsin.wmnet [production]
18:22 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2004-dev.codfw.wmnet with OS bookworm [production]
18:14 <jhuneidi@deploy1002> rebuilt and synchronized wikiversions files: group0 wikis to 1.43.0-wmf.11 refs T366956 [production]
18:06 <topranks> bringing up link from ssw1-a1-codfw to ssw1-d1-codfw T364095 [production]
17:57 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage [production]
17:55 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage [production]
17:51 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore2004.codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 [production]
17:44 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore2004.codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 [production]
17:43 <brett> Re-re-pooling lvs2011 - T368165 [production]
17:37 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudvirt2004-dev.codfw.wmnet with OS bookworm [production]
17:36 <brett> Depooling lvs2011 due to elevated socket/tcp errors - T368165 [production]
17:28 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2004-dev.codfw.wmnet with OS bookworm [production]
17:28 <brett> Pooling lvs2011 - T368165 [production]
17:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1177 (T364069)', diff saved to https://phabricator.wikimedia.org/P65424 and previous config saved to /var/cache/conftool/dbconfig/20240625-172502-marostegui.json [production]
17:24 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1177.eqiad.wmnet with reason: Maintenance [production]
17:24 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1177.eqiad.wmnet with reason: Maintenance [production]
17:24 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172 (T364069)', diff saved to https://phabricator.wikimedia.org/P65423 and previous config saved to /var/cache/conftool/dbconfig/20240625-172440-marostegui.json [production]
17:20 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.openstack.cloudvirt.vm_console [redirects]
17:19 <dcaro> rebooted redirects-nginx02 as it was non-responsive [redirects]
17:15 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=0) [redirects]
17:15 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.openstack.cloudvirt.vm_console [redirects]
17:09 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P65422 and previous config saved to /var/cache/conftool/dbconfig/20240625-170933-marostegui.json [production]
17:08 <wmbot~dcaro@urcuchillay> END (ERROR) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=255) [redirects]
17:08 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.openstack.cloudvirt.vm_console [redirects]
17:06 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 [production]
17:04 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage [production]
17:02 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage [production]
17:01 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:aqs-codfw: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 [production]
16:54 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P65421 and previous config saved to /var/cache/conftool/dbconfig/20240625-165426-marostegui.json [production]
16:49 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 [production]
16:43 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudvirt2004-dev.codfw.wmnet with OS bookworm [production]
16:39 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172 (T364069)', diff saved to https://phabricator.wikimedia.org/P65420 and previous config saved to /var/cache/conftool/dbconfig/20240625-163919-marostegui.json [production]
16:37 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 [production]
16:34 <Lucas_WMDE> wikibase-product-testing-2022: shut down instance, probably no longer needed and can be removed later [wikidata-dev]
16:33 <arnaudb@cumin1002> dbctl commit (dc=all): 'es1035 (re)pooling @ 100%: post T365986 repool', diff saved to https://phabricator.wikimedia.org/P65419 and previous config saved to /var/cache/conftool/dbconfig/20240625-163330-arnaudb.json [production]
16:31 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw1437.eqiad.wmnet [production]
16:31 <cgoubert@cumin1002> START - Cookbook sre.hosts.remove-downtime for mw1437.eqiad.wmnet [production]
16:27 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw1437.eqiad.wmnet with reason: Resizing disk [production]
16:27 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on mw1437.eqiad.wmnet with reason: Resizing disk [production]
16:26 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt2004-dev.codfw.wmnet' [admin]
16:23 <bvibber> running requeueTranscodes for missing audio files on commons (mwmaint1002) cf T368364 [production]
16:23 <claime> depooling mw1437 [production]
16:20 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt2004-dev.codfw.wmnet' [admin]
16:19 <claime> cleaning up shellbox leftover files on mw1437.eqiad.wmnet [production]
16:19 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 [production]
16:18 <arnaudb@cumin1002> dbctl commit (dc=all): 'es1035 (re)pooling @ 75%: post T365986 repool', diff saved to https://phabricator.wikimedia.org/P65418 and previous config saved to /var/cache/conftool/dbconfig/20240625-161824-arnaudb.json [production]
16:15 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=97) on host 'cloudvirt2006-dev.codfw.wmnet' [admin]