2023-03-15
§
|
09:36 |
<vgutierrez@cumin1001> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_ulsfo |
[production] |
09:26 |
<moritzm> |
rolling restart of FPM/Apache to pick up gnutls28 security updates |
[production] |
09:22 |
<moritzm> |
installing gnutls28 security updates |
[production] |
09:05 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove db1106 from dbctl T331875', diff saved to https://phabricator.wikimedia.org/P45872 and previous config saved to /var/cache/conftool/dbconfig/20230315-090515-root.json |
[production] |
08:40 |
<hashar@deploy2002> |
Finished deploy [integration/docroot@5abe9c6]: Link Groovy doc of PipelineLib - T222199 (duration: 00m 19s) |
[production] |
08:40 |
<hashar@deploy2002> |
Started deploy [integration/docroot@5abe9c6]: Link Groovy doc of PipelineLib - T222199 |
[production] |
08:15 |
<vgutierrez@cumin1001> |
END (FAIL) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=1) rolling upgrade of HAProxy on A:cp-upload_ulsfo |
[production] |
08:15 |
<vgutierrez@cumin1001> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_ulsfo |
[production] |
07:40 |
<tgr_> |
UTC morning deploys done |
[production] |
07:39 |
<mvernon@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host ms-be2067.codfw.wmnet |
[production] |
07:36 |
<tgr@deploy2002> |
Finished scap: Backport for [[gerrit:898869|LevelingUpManager: Ensure that $suggestions is a TaskSet]] (duration: 07m 54s) |
[production] |
07:30 |
<tgr@deploy2002> |
tgr: Backport for [[gerrit:898869|LevelingUpManager: Ensure that $suggestions is a TaskSet]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet |
[production] |
07:28 |
<tgr@deploy2002> |
Started scap: Backport for [[gerrit:898869|LevelingUpManager: Ensure that $suggestions is a TaskSet]] |
[production] |
06:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1105 (s1,s2) T331874', diff saved to https://phabricator.wikimedia.org/P45870 and previous config saved to /var/cache/conftool/dbconfig/20230315-062643-root.json |
[production] |
06:20 |
<marostegui> |
Remove pki2001 from m1 grants T332018 |
[production] |
2023-03-14
§
|
23:29 |
<brennen@deploy2002> |
Finished scap: Backport for [[gerrit:898867|action: Restrict action.delete.js to action=delete pages (T330205)]] (duration: 10m 32s) |
[production] |
23:20 |
<brennen@deploy2002> |
brennen and umherirrender: Backport for [[gerrit:898867|action: Restrict action.delete.js to action=delete pages (T330205)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
23:19 |
<brennen@deploy2002> |
Started scap: Backport for [[gerrit:898867|action: Restrict action.delete.js to action=delete pages (T330205)]] |
[production] |
22:50 |
<jhathaway@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
22:34 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
22:34 |
<jhathaway@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
22:25 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
22:08 |
<jhathaway@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
21:38 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
21:38 |
<jhathaway@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
21:20 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
21:17 |
<jhathaway@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
21:16 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
21:11 |
<jhathaway@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
21:11 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
21:11 |
<jhathaway@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
20:47 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
20:47 |
<jhathaway@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
20:43 |
<ejegg> |
payments-wiki upgraded from 61c30a4f to 1532b107 |
[production] |
20:35 |
<zabe@deploy2002> |
Finished scap: Backport for [[gerrit:897997|dewiki: Allow 'crats to remove sysopship and manage importers (T331921)]] (duration: 08m 36s) |
[production] |
20:28 |
<zabe@deploy2002> |
zabe: Backport for [[gerrit:897997|dewiki: Allow 'crats to remove sysopship and manage importers (T331921)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
20:27 |
<zabe@deploy2002> |
Started scap: Backport for [[gerrit:897997|dewiki: Allow 'crats to remove sysopship and manage importers (T331921)]] |
[production] |
20:04 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
20:03 |
<jhathaway@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
19:47 |
<topranks> |
Reboot cloudsw1-b1-codfw to upgrade JunOS version T327919 |
[production] |
19:44 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on cloudsw1-b1-codfw,cloudsw1-b1-codfw IPv6,cloudsw1-b1-codfw.mgmt with reason: cloudsw1-b1-codfw OS upgrade |
[production] |
19:44 |
<cmooney@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:30:00 on cloudsw1-b1-codfw,cloudsw1-b1-codfw IPv6,cloudsw1-b1-codfw.mgmt with reason: cloudsw1-b1-codfw OS upgrade |
[production] |
19:32 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
19:30 |
<brennen> |
1.40.0-wmf.27 train (T330205): uneventful at group0. i'm afk for about an hour. |
[production] |
19:13 |
<ejegg> |
civicrm upgraded from dbe3b716 to 68fa85cf |
[production] |
18:51 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-logging2002.codfw.wmnet with OS bullseye |
[production] |
18:32 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-logging2002.codfw.wmnet with reason: host reimage |
[production] |
18:28 |
<fab@deploy2002> |
Finished deploy [airflow-dags/research@5edcd7b]: (no justification provided) (duration: 00m 11s) |
[production] |
18:27 |
<fab@deploy2002> |
Started deploy [airflow-dags/research@5edcd7b]: (no justification provided) |
[production] |
18:27 |
<herron@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging2002.codfw.wmnet with reason: host reimage |
[production] |