2021-07-28
§
|
11:03 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 5 days, 8:00:00 on planet1002.eqiad.wmnet with reason: known issue |
[production] |
10:11 |
<moritzm> |
installing remaining nginx security updates on stretch |
[production] |
10:09 |
<godog> |
temp fix prometheus-icinga-am on alert1001 |
[production] |
09:40 |
<dcausse@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . |
[production] |
09:40 |
<urbanecm> |
Start server-side upload for 1 video file (T287482) |
[production] |
09:29 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
09:29 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
09:28 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
09:24 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
09:24 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
08:33 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1122.eqiad.wmnet with reason: REIMAGE |
[production] |
08:31 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1122.eqiad.wmnet with reason: REIMAGE |
[production] |
08:27 |
<Amir1> |
running several long-running queries against pc1007 |
[production] |
08:13 |
<oblivian@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
08:01 |
<dcausse@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . |
[production] |
07:53 |
<moritzm> |
installing aspell security updates on stretch |
[production] |
07:20 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on 29 hosts with reason: T287559 |
[production] |
07:20 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on 29 hosts with reason: T287559 |
[production] |
07:20 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on 40 hosts with reason: T287559 |
[production] |
07:20 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on 40 hosts with reason: T287559 |
[production] |
07:20 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on 6 hosts with reason: T287559 |
[production] |
07:20 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on 6 hosts with reason: T287559 |
[production] |
07:07 |
<godog> |
remove cloud*/syslog.log from centrallog2001 - T287559 |
[production] |
07:06 |
<godog> |
remove node_pinger.prom from node-pinger hosts |
[production] |
06:42 |
<godog> |
remove obsolete user.log.manual-rotation from centrallog1001 to free disk space |
[production] |
02:43 |
<TimStarling> |
on mwmaint2002 fixing T286273 broken files using eval.php |
[production] |
2021-07-27
§
|
23:53 |
<thcipriani@deploy1002> |
Synchronized php-1.37.0-wmf.16/skins/Vector: Backport: [[gerrit:708220|Restore print, links, table and message box styles (T278896)]] (duration: 01m 07s) |
[production] |
23:15 |
<cjming@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:708152|Enable user links on office + test wikis (T287391)]] (duration: 02m 00s) |
[production] |
20:44 |
<ryankemper> |
[WDQS] Returning `wdqs` dns discovery to the expected status of `(eqiad, codfw) = (depooled, pooled)`: `sudo confctl --object-type discovery select 'dnsdisc=wdqs,name=eqiad' set/pooled=false` |
[production] |
20:44 |
<legoktm> |
legoktm@wtp1025:~$ sudo systemctl restart php7.2-fpm # restart php-fpm, opcache hit ratio was warning |
[production] |
20:43 |
<ryankemper@puppetmaster1001> |
conftool action : set/pooled=false; selector: dnsdisc=wdqs,name=eqiad |
[production] |
20:35 |
<twentyafterfour@deploy1002> |
Pruned MediaWiki: 1.37.0-wmf.14 (duration: 03m 12s) |
[production] |
20:25 |
<twentyafterfour@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.37.0-wmf.16 |
[production] |
20:02 |
<twentyafterfour@deploy1002> |
Finished scap: testwikis wikis to 1.37.0-wmf.16 (duration: 36m 06s) |
[production] |
19:26 |
<twentyafterfour@deploy1002> |
Started scap: testwikis wikis to 1.37.0-wmf.16 |
[production] |
18:49 |
<otto@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Migrate EchoMail and EchoInteraction to EventGate - T287210 (duration: 02m 28s) |
[production] |
18:17 |
<twentyafterfour> |
MediaWiki branch `1.37.0-wmf.16` prepped and patched in preparation for the upcoming deployment window. |
[production] |
17:50 |
<legoktm@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
17:47 |
<legoktm@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
17:28 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts logstash2022.codfw.wmnet |
[production] |
17:18 |
<herron@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts logstash2022.codfw.wmnet |
[production] |
17:17 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts logstash2021.codfw.wmnet |
[production] |
16:54 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1003.eqiad.wmnet |
[production] |
16:49 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host ml-serve1003.eqiad.wmnet |
[production] |
16:49 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1004.eqiad.wmnet |
[production] |
16:47 |
<herron@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts logstash2021.codfw.wmnet |
[production] |
16:44 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts logstash2020.codfw.wmnet |
[production] |
16:43 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host ml-serve1004.eqiad.wmnet |
[production] |
16:42 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1002.eqiad.wmnet |
[production] |
16:37 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host ml-serve1002.eqiad.wmnet |
[production] |