2024-06-25
§
|
19:28 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5017.eqsin.wmnet with reason: host reimage |
[production] |
19:25 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5017.eqsin.wmnet with reason: host reimage |
[production] |
19:23 |
<sukhe> |
re-enable puppet on lvs2011 |
[production] |
19:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2125', diff saved to https://phabricator.wikimedia.org/P65428 and previous config saved to /var/cache/conftool/dbconfig/20240625-191403-marostegui.json |
[production] |
18:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2125', diff saved to https://phabricator.wikimedia.org/P65426 and previous config saved to /var/cache/conftool/dbconfig/20240625-185856-marostegui.json |
[production] |
18:49 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5017.eqsin.wmnet with OS bullseye |
[production] |
18:49 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5017.eqsin.wmnet with OS bullseye |
[production] |
18:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2125 (T367856)', diff saved to https://phabricator.wikimedia.org/P65425 and previous config saved to /var/cache/conftool/dbconfig/20240625-184349-marostegui.json |
[production] |
18:31 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5017.eqsin.wmnet with OS bullseye |
[production] |
18:28 |
<brett@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5017.eqsin.wmnet |
[production] |
18:22 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2004-dev.codfw.wmnet with OS bookworm |
[production] |
18:14 |
<jhuneidi@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.43.0-wmf.11 refs T366956 |
[production] |
18:06 |
<topranks> |
bringing up link from ssw1-a1-codfw to ssw1-d1-codfw T364095 |
[production] |
17:57 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage |
[production] |
17:55 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage |
[production] |
17:51 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore2004.codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
17:44 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore2004.codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
17:43 |
<brett> |
Re-re-pooling lvs2011 - T368165 |
[production] |
17:37 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt2004-dev.codfw.wmnet with OS bookworm |
[production] |
17:36 |
<brett> |
Depooling lvs2011 due to elevated socket/tcp errors - T368165 |
[production] |
17:28 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2004-dev.codfw.wmnet with OS bookworm |
[production] |
17:28 |
<brett> |
Pooling lvs2011 - T368165 |
[production] |
17:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1177 (T364069)', diff saved to https://phabricator.wikimedia.org/P65424 and previous config saved to /var/cache/conftool/dbconfig/20240625-172502-marostegui.json |
[production] |
17:24 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1177.eqiad.wmnet with reason: Maintenance |
[production] |
17:24 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1177.eqiad.wmnet with reason: Maintenance |
[production] |
17:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172 (T364069)', diff saved to https://phabricator.wikimedia.org/P65423 and previous config saved to /var/cache/conftool/dbconfig/20240625-172440-marostegui.json |
[production] |
17:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P65422 and previous config saved to /var/cache/conftool/dbconfig/20240625-170933-marostegui.json |
[production] |
17:06 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
17:04 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage |
[production] |
17:02 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage |
[production] |
17:01 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:aqs-codfw: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
16:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P65421 and previous config saved to /var/cache/conftool/dbconfig/20240625-165426-marostegui.json |
[production] |
16:49 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
16:43 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt2004-dev.codfw.wmnet with OS bookworm |
[production] |
16:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172 (T364069)', diff saved to https://phabricator.wikimedia.org/P65420 and previous config saved to /var/cache/conftool/dbconfig/20240625-163919-marostegui.json |
[production] |
16:37 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
16:33 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'es1035 (re)pooling @ 100%: post T365986 repool', diff saved to https://phabricator.wikimedia.org/P65419 and previous config saved to /var/cache/conftool/dbconfig/20240625-163330-arnaudb.json |
[production] |
16:31 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw1437.eqiad.wmnet |
[production] |
16:31 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for mw1437.eqiad.wmnet |
[production] |
16:27 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw1437.eqiad.wmnet with reason: Resizing disk |
[production] |
16:27 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on mw1437.eqiad.wmnet with reason: Resizing disk |
[production] |
16:23 |
<bvibber> |
running requeueTranscodes for missing audio files on commons (mwmaint1002) cf T368364 |
[production] |
16:23 |
<claime> |
depooling mw1437 |
[production] |
16:19 |
<claime> |
cleaning up shellbox leftover files on mw1437.eqiad.wmnet |
[production] |
16:19 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
16:18 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'es1035 (re)pooling @ 75%: post T365986 repool', diff saved to https://phabricator.wikimedia.org/P65418 and previous config saved to /var/cache/conftool/dbconfig/20240625-161824-arnaudb.json |
[production] |
16:15 |
<claime> |
Extending vg-srv on mw1437 |
[production] |
16:10 |
<brennen@deploy1002> |
Finished deploy [phabricator/deployment@72ad841]: deploy phab1004 for T368392 - followup T364728 (duration: 00m 39s) |
[production] |
16:10 |
<brennen@deploy1002> |
Started deploy [phabricator/deployment@72ad841]: deploy phab1004 for T368392 - followup T364728 |
[production] |
16:09 |
<brennen@deploy1002> |
Finished deploy [phabricator/deployment@72ad841]: deploy phab2002 for T368392 - followup T364728 (duration: 00m 33s) |
[production] |