2024-06-25
§
|
22:10 |
<bvibber> |
a webVideoTranscode job reported 'No space left on device' from a failed ffmpeg run on mw1446 recently |
[production] |
22:09 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2005-dev.codfw.wmnet with reason: host reimage |
[production] |
22:05 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2005-dev.codfw.wmnet with reason: host reimage |
[production] |
21:57 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1177 (T364069)', diff saved to https://phabricator.wikimedia.org/P65431 and previous config saved to /var/cache/conftool/dbconfig/20240625-215705-marostegui.json |
[production] |
21:47 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt2005-dev.codfw.wmnet with OS bookworm |
[production] |
20:44 |
<cjming> |
end of UTC late backport window |
[production] |
20:41 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1043880|Cleanup: Remove wgNavigationTimingSurveyName (T367128)]] (duration: 08m 29s) |
[production] |
20:36 |
<cjming@deploy1002> |
jdlrobson, cjming: Continuing with sync |
[production] |
20:35 |
<cjming@deploy1002> |
jdlrobson, cjming: Backport for [[gerrit:1043880|Cleanup: Remove wgNavigationTimingSurveyName (T367128)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:32 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1043880|Cleanup: Remove wgNavigationTimingSurveyName (T367128)]] |
[production] |
20:31 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1041250|Enable dark mode on more pages (T366378 T367374 T366373 T366520 T366373)]] (duration: 15m 04s) |
[production] |
20:26 |
<cjming@deploy1002> |
jdlrobson, cjming: Continuing with sync |
[production] |
20:19 |
<cjming@deploy1002> |
jdlrobson, cjming: Backport for [[gerrit:1041250|Enable dark mode on more pages (T366378 T367374 T366373 T366520 T366373)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:16 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1041250|Enable dark mode on more pages (T366378 T367374 T366373 T366520 T366373)]] |
[production] |
20:14 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1049608|Temporarily disable '4K' 2160p and mid 1440p transcodes (T368433)]] (duration: 08m 36s) |
[production] |
20:11 |
<Emperor> |
restart swift-proxy on ms-fe2010 ms-fe1011 T360913 |
[production] |
20:09 |
<cjming@deploy1002> |
cjming, bvibber: Continuing with sync |
[production] |
20:08 |
<cjming@deploy1002> |
cjming, bvibber: Backport for [[gerrit:1049608|Temporarily disable '4K' 2160p and mid 1440p transcodes (T368433)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:05 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1049608|Temporarily disable '4K' 2160p and mid 1440p transcodes (T368433)]] |
[production] |
20:03 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5017.eqsin.wmnet with OS bullseye |
[production] |
20:01 |
<hashar@deploy1002> |
Finished deploy [integration/docroot@1eb5f4c]: remove CollaborationKit T368092 (duration: 00m 07s) |
[production] |
20:01 |
<hashar@deploy1002> |
Started deploy [integration/docroot@1eb5f4c]: remove CollaborationKit T368092 |
[production] |
19:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2126 (T367856)', diff saved to https://phabricator.wikimedia.org/P65430 and previous config saved to /var/cache/conftool/dbconfig/20240625-192947-marostegui.json |
[production] |
19:29 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance |
[production] |
19:29 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance |
[production] |
19:29 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2126.codfw.wmnet with reason: Maintenance |
[production] |
19:29 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2126.codfw.wmnet with reason: Maintenance |
[production] |
19:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2125 (T367856)', diff saved to https://phabricator.wikimedia.org/P65429 and previous config saved to /var/cache/conftool/dbconfig/20240625-192910-marostegui.json |
[production] |
19:28 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5017.eqsin.wmnet with reason: host reimage |
[production] |
19:25 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5017.eqsin.wmnet with reason: host reimage |
[production] |
19:23 |
<sukhe> |
re-enable puppet on lvs2011 |
[production] |
19:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2125', diff saved to https://phabricator.wikimedia.org/P65428 and previous config saved to /var/cache/conftool/dbconfig/20240625-191403-marostegui.json |
[production] |
18:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2125', diff saved to https://phabricator.wikimedia.org/P65426 and previous config saved to /var/cache/conftool/dbconfig/20240625-185856-marostegui.json |
[production] |
18:49 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5017.eqsin.wmnet with OS bullseye |
[production] |
18:49 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5017.eqsin.wmnet with OS bullseye |
[production] |
18:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2125 (T367856)', diff saved to https://phabricator.wikimedia.org/P65425 and previous config saved to /var/cache/conftool/dbconfig/20240625-184349-marostegui.json |
[production] |
18:31 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5017.eqsin.wmnet with OS bullseye |
[production] |
18:28 |
<brett@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5017.eqsin.wmnet |
[production] |
18:22 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2004-dev.codfw.wmnet with OS bookworm |
[production] |
18:14 |
<jhuneidi@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.43.0-wmf.11 refs T366956 |
[production] |
18:06 |
<topranks> |
bringing up link from ssw1-a1-codfw to ssw1-d1-codfw T364095 |
[production] |
17:57 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage |
[production] |
17:55 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage |
[production] |
17:51 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore2004.codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
17:44 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore2004.codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
17:43 |
<brett> |
Re-re-pooling lvs2011 - T368165 |
[production] |
17:37 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt2004-dev.codfw.wmnet with OS bookworm |
[production] |
17:36 |
<brett> |
Depooling lvs2011 due to elevated socket/tcp errors - T368165 |
[production] |
17:28 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2004-dev.codfw.wmnet with OS bookworm |
[production] |
17:28 |
<brett> |
Pooling lvs2011 - T368165 |
[production] |