2021-03-26
§
|
20:08 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: REIMAGE |
[production] |
17:44 |
<hashar@deploy1002> |
Synchronized php-1.36.0-wmf.36/includes/changes/RecentChange.php: RecentChange: directly build the user identity if we have the data - T277795 (duration: 01m 06s) |
[production] |
17:42 |
<hashar@deploy1002> |
Finished scap: Revert "Add change tags for media additions/removals" - T266067 T278429 (duration: 31m 43s) |
[production] |
17:10 |
<hashar@deploy1002> |
Started scap: Revert "Add change tags for media additions/removals" - T266067 T278429 |
[production] |
15:40 |
<Urbanecm> |
Delete `commonswiki:ip-autoblock:whitelist` cache key from memcached (wmf.36 moves the autoblock whitelist source, and it was deployed on commonswiki for a while, resulting in the cache key being empty) |
[production] |
15:37 |
<hnowlan> |
importing imposm3_0.11.0+git20201104.4758cf4-1_amd64.changes on apt1001 |
[production] |
14:40 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1016.eqiad.wmnet |
[production] |
14:33 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti1016.eqiad.wmnet |
[production] |
14:05 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1015.eqiad.wmnet |
[production] |
13:58 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti1015.eqiad.wmnet |
[production] |
13:10 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1014.eqiad.wmnet |
[production] |
13:02 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti1014.eqiad.wmnet |
[production] |
13:02 |
<moritzm> |
reimaging theemin T275873 |
[production] |
12:56 |
<moritzm> |
drain ganeti1014 |
[production] |
12:49 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1013.eqiad.wmnet |
[production] |
12:42 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti1013.eqiad.wmnet |
[production] |
12:37 |
<moritzm> |
drain ganeti1013 |
[production] |
12:35 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1012.eqiad.wmnet |
[production] |
12:27 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti1012.eqiad.wmnet |
[production] |
10:55 |
<Urbanecm> |
Move `Help talk:Getting Started --> Help talk:Getting started` on enwiki with `[urbanecm@mwmaint1002 ~]$ mwscript moveBatch.php --wiki=enwiki -r 'sysadmin action: fixing [[:phab:T278350]]' -u 'Martin Urbanec' batch.txt` (T278350) |
[production] |
10:49 |
<Urbanecm> |
Move `User talk:TheAafi/Help talk` to `Help talk:Getting Started` via `[urbanecm@mwmaint1002 ~]$ mwscript moveBatch.php --wiki=enwiki -r 'sysadmin action: fixing [[:phab:T278350]]' -u 'Martin Urbanec' batch.txt` to fix an UBN task (T278350) |
[production] |
10:10 |
<akosiaris@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts chlorine.eqiad.wmnet |
[production] |
10:02 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts chlorine.eqiad.wmnet |
[production] |
10:00 |
<akosiaris@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts argon.eqiad.wmnet |
[production] |
09:49 |
<filippo@deploy1002> |
Finished deploy [librenms/librenms@63e862a]: deploy I955cbfc244 (duration: 00m 08s) |
[production] |
09:49 |
<filippo@deploy1002> |
Started deploy [librenms/librenms@63e862a]: deploy I955cbfc244 |
[production] |
09:46 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts argon.eqiad.wmnet |
[production] |
09:45 |
<akosiaris@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts acrab.codfw.wmnet |
[production] |
09:43 |
<moritzm> |
delete fermium in Ganeti (was still around, but powered down) T224586 |
[production] |
09:38 |
<akosiaris@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts acrux.codfw.wmnet |
[production] |
09:36 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts acrab.codfw.wmnet |
[production] |
09:32 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts acrux.codfw.wmnet |
[production] |
09:31 |
<filippo@deploy1002> |
Finished deploy [librenms/librenms@e7727e3]: deploy I12ac21d877c (duration: 00m 12s) |
[production] |
09:31 |
<filippo@deploy1002> |
Started deploy [librenms/librenms@e7727e3]: deploy I12ac21d877c |
[production] |
09:28 |
<moritzm> |
drain ganeti1012 |
[production] |
09:27 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1010.eqiad.wmnet |
[production] |
09:20 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti1010.eqiad.wmnet |
[production] |
08:38 |
<moritzm> |
drain ganeti1010 |
[production] |
08:38 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1009.eqiad.wmnet |
[production] |
08:30 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti1009.eqiad.wmnet |
[production] |
06:11 |
<ryankemper> |
[WDQS Deploy] Deploy complete. Successful test query placed on query.wikidata.org, there's no relevant criticals in Icinga, and Grafana looks good |
[production] |
06:09 |
<ryankemper> |
[WDQS Deploy] Restarted `wdqs-updater` across all hosts, 4 hosts at a time: `sudo -E cumin -b 4 'A:wdqs-all' 'systemctl restart wdqs-updater'` |
[production] |
06:09 |
<ryankemper> |
[WDQS Deploy] Restarted `wdqs-categories` across all test hosts simultaneously: `sudo -E cumin 'A:wdqs-test' 'systemctl restart wdqs-categories'` |
[production] |
06:09 |
<ryankemper> |
[WDQS Deploy] Restarted `wdqs-categories` across lvs-managed hosts, one node at a time: `sudo -E cumin -b 1 'A:wdqs-all and not A:wdqs-test' 'depool && sleep 45 && systemctl restart wdqs-categories && sleep 45 && pool'` |
[production] |
05:06 |
<ryankemper@deploy1002> |
Finished deploy [wdqs/wdqs@bb5a072]: 0.3.68 (duration: 07m 31s) |
[production] |
05:00 |
<ryankemper> |
[WDQS Deploy] Tests passing following deploy of `0.3.68` on canary `wdqs1003`; proceeding to rest of fleet |
[production] |
04:58 |
<ryankemper@deploy1002> |
Started deploy [wdqs/wdqs@bb5a072]: 0.3.68 |
[production] |
04:58 |
<ryankemper> |
[WDQS Deploy] Gearing up for deploy of wdqs `0.3.68`. Pre-deploy tests passing on canary `wdqs1003` |
[production] |