1351-1400 of 10000 results (66ms)
2023-03-15 §
09:36 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_ulsfo [production]
09:26 <moritzm> rolling restart of FPM/Apache to pick up gnutls28 security updates [production]
09:22 <moritzm> installing gnutls28 security updates [production]
09:05 <marostegui@cumin1001> dbctl commit (dc=all): 'Remove db1106 from dbctl T331875', diff saved to https://phabricator.wikimedia.org/P45872 and previous config saved to /var/cache/conftool/dbconfig/20230315-090515-root.json [production]
08:40 <hashar@deploy2002> Finished deploy [integration/docroot@5abe9c6]: Link Groovy doc of PipelineLib - T222199 (duration: 00m 19s) [production]
08:40 <hashar@deploy2002> Started deploy [integration/docroot@5abe9c6]: Link Groovy doc of PipelineLib - T222199 [production]
08:15 <vgutierrez@cumin1001> END (FAIL) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=1) rolling upgrade of HAProxy on A:cp-upload_ulsfo [production]
08:15 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_ulsfo [production]
07:40 <tgr_> UTC morning deploys done [production]
07:39 <mvernon@cumin2002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host ms-be2067.codfw.wmnet [production]
07:36 <tgr@deploy2002> Finished scap: Backport for [[gerrit:898869|LevelingUpManager: Ensure that $suggestions is a TaskSet]] (duration: 07m 54s) [production]
07:30 <tgr@deploy2002> tgr: Backport for [[gerrit:898869|LevelingUpManager: Ensure that $suggestions is a TaskSet]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet [production]
07:28 <tgr@deploy2002> Started scap: Backport for [[gerrit:898869|LevelingUpManager: Ensure that $suggestions is a TaskSet]] [production]
06:26 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1105 (s1,s2) T331874', diff saved to https://phabricator.wikimedia.org/P45870 and previous config saved to /var/cache/conftool/dbconfig/20230315-062643-root.json [production]
06:20 <marostegui> Remove pki2001 from m1 grants T332018 [production]
2023-03-14 §
23:29 <brennen@deploy2002> Finished scap: Backport for [[gerrit:898867|action: Restrict action.delete.js to action=delete pages (T330205)]] (duration: 10m 32s) [production]
23:20 <brennen@deploy2002> brennen and umherirrender: Backport for [[gerrit:898867|action: Restrict action.delete.js to action=delete pages (T330205)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
23:19 <brennen@deploy2002> Started scap: Backport for [[gerrit:898867|action: Restrict action.delete.js to action=delete pages (T330205)]] [production]
22:50 <jhathaway@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
22:34 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
22:34 <jhathaway@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm [production]
22:25 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
22:08 <jhathaway@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:38 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:38 <jhathaway@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:20 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:17 <jhathaway@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:16 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:11 <jhathaway@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:11 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:11 <jhathaway@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm [production]
20:47 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
20:47 <jhathaway@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm [production]
20:43 <ejegg> payments-wiki upgraded from 61c30a4f to 1532b107 [production]
20:35 <zabe@deploy2002> Finished scap: Backport for [[gerrit:897997|dewiki: Allow 'crats to remove sysopship and manage importers (T331921)]] (duration: 08m 36s) [production]
20:28 <zabe@deploy2002> zabe: Backport for [[gerrit:897997|dewiki: Allow 'crats to remove sysopship and manage importers (T331921)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]
20:27 <zabe@deploy2002> Started scap: Backport for [[gerrit:897997|dewiki: Allow 'crats to remove sysopship and manage importers (T331921)]] [production]
20:04 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
20:03 <jhathaway@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm [production]
19:47 <topranks> Reboot cloudsw1-b1-codfw to upgrade JunOS version T327919 [production]
19:44 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on cloudsw1-b1-codfw,cloudsw1-b1-codfw IPv6,cloudsw1-b1-codfw.mgmt with reason: cloudsw1-b1-codfw OS upgrade [production]
19:44 <cmooney@cumin1001> START - Cookbook sre.hosts.downtime for 0:30:00 on cloudsw1-b1-codfw,cloudsw1-b1-codfw IPv6,cloudsw1-b1-codfw.mgmt with reason: cloudsw1-b1-codfw OS upgrade [production]
19:32 <jhathaway@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
19:30 <brennen> 1.40.0-wmf.27 train (T330205): uneventful at group0. i'm afk for about an hour. [production]
19:13 <ejegg> civicrm upgraded from dbe3b716 to 68fa85cf [production]
18:51 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-logging2002.codfw.wmnet with OS bullseye [production]
18:32 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-logging2002.codfw.wmnet with reason: host reimage [production]
18:28 <fab@deploy2002> Finished deploy [airflow-dags/research@5edcd7b]: (no justification provided) (duration: 00m 11s) [production]
18:27 <fab@deploy2002> Started deploy [airflow-dags/research@5edcd7b]: (no justification provided) [production]
18:27 <herron@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging2002.codfw.wmnet with reason: host reimage [production]