2022-12-13
§
|
10:01 |
<oblivian@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply |
[production] |
10:00 |
<oblivian@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-debug: apply |
[production] |
09:54 |
<moritzm> |
installing libhttp-daemon-perl security updates |
[production] |
09:17 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.40.0-wmf.14 refs T320519 |
[production] |
09:07 |
<claime> |
Repooled parse1002.eqiad.wmnet in parsoid service - T324949 |
[production] |
09:05 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for parse1002.eqiad.wmnet |
[production] |
09:05 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for parse1002.eqiad.wmnet |
[production] |
09:01 |
<cgoubert@cumin1001> |
conftool action : set/pooled=no; selector: name=parse1002.eqiad.wmnet |
[production] |
08:58 |
<moritzm> |
installing libpgjava security updates |
[production] |
08:55 |
<moritzm> |
installing xen security updates |
[production] |
08:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 100%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42683 and previous config saved to /var/cache/conftool/dbconfig/20221213-083019-root.json |
[production] |
08:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 75%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42682 and previous config saved to /var/cache/conftool/dbconfig/20221213-081514-root.json |
[production] |
08:13 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:867002|Enable Section Translation in Chuvash Wikipedia (T319176)]] (duration: 10m 01s) |
[production] |
08:05 |
<kartik@deploy1002> |
kartik and kartik: Backport for [[gerrit:867002|Enable Section Translation in Chuvash Wikipedia (T319176)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
08:03 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:867002|Enable Section Translation in Chuvash Wikipedia (T319176)]] |
[production] |
08:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 50%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42681 and previous config saved to /var/cache/conftool/dbconfig/20221213-080009-root.json |
[production] |
07:52 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:867262|Reduce PC writes from parsoid API to 1%]] (duration: 09m 35s) |
[production] |
07:45 |
<ladsgroup@deploy1002> |
ladsgroup and daniel: Backport for [[gerrit:867262|Reduce PC writes from parsoid API to 1%]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet |
[production] |
07:45 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 25%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42680 and previous config saved to /var/cache/conftool/dbconfig/20221213-074504-root.json |
[production] |
07:43 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:867262|Reduce PC writes from parsoid API to 1%]] |
[production] |
07:29 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 10%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42679 and previous config saved to /var/cache/conftool/dbconfig/20221213-072959-root.json |
[production] |
07:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 5%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42678 and previous config saved to /var/cache/conftool/dbconfig/20221213-071454-root.json |
[production] |
06:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 1%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42677 and previous config saved to /var/cache/conftool/dbconfig/20221213-065949-root.json |
[production] |
06:59 |
<kart_> |
Updated cxserver to 2022-12-06-121330-production (T321781, T324534) |
[production] |
06:57 |
<kartik@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
06:56 |
<kartik@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
06:54 |
<kartik@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
06:54 |
<marostegui> |
Reboot db1206 to test RAID controller |
[production] |
06:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1206', diff saved to https://phabricator.wikimedia.org/P42676 and previous config saved to /var/cache/conftool/dbconfig/20221213-065402-marostegui.json |
[production] |
06:53 |
<kartik@deploy1002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
06:51 |
<kartik@deploy1002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
06:50 |
<kartik@deploy1002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
05:23 |
<dwisehaupt> |
payments now using the new digicert certificate in eqiad |
[production] |
04:56 |
<mwpresync@deploy1002> |
Pruned MediaWiki: 1.40.0-wmf.12 (duration: 02m 15s) |
[production] |
04:54 |
<mwpresync@deploy1002> |
Finished scap: testwikis wikis to 1.40.0-wmf.14 refs T320519 (duration: 52m 11s) |
[production] |
04:02 |
<mwpresync@deploy1002> |
Started scap: testwikis wikis to 1.40.0-wmf.14 refs T320519 |
[production] |
03:16 |
<MacFan4000> |
locally applied fixes to stop the apache error log from filling up with spam |
[wm-bot] |
00:44 |
<bd808> |
Update demo server to cbc780 |
[toolhub] |
00:11 |
<bd808> |
Rotated elasticsearch password (T324637) |
[tools.sal] |
2022-12-12
§
|
23:49 |
<MacFan4000> |
clear apache logs to free up space |
[wm-bot] |
23:35 |
<tzatziki> |
removing 3 files for legal compliance |
[production] |
23:12 |
<tzatziki> |
removing 2 files for legal compliance |
[production] |
22:20 |
<wm-bot> |
<bd808> Deleted stuck updatetools pod launched by a CronJob object |
[tools.admin] |
22:04 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.UPGRADE (3 nodes at a time) for ElasticSearch cluster search_codfw: search_codfw elasticsearch and plugin upgrade - ryankemper@cumin2002 |
[production] |
21:57 |
<tzatziki> |
removing 1 file for legal compliance |
[production] |
21:51 |
<wm-bot> |
<root> Changed k8s cronjob send-daily-report schedule to "3 0 * * *" to see if this makes the job run again. |
[tools.anomiebot] |
21:50 |
<wm-bot> |
<root> Changed k8s cronjob send-daily-report schedule to 3 0 AnomieBOT.git bot botctl.sh botlogs bot-test generate-daily-report.sh grid jobs.yaml logs mysql2sqlite needed-perl-packges public_html README replica.my.cnf rotate-logs.pl send-daily-report.sh service.manifest task-status test-task-status.pl tmp www AnomieBOT.git bot botctl.sh botlogs bot-test generate-daily-report.sh grid jobs.yaml logs mysql2sqlite needed-perl- |
[tools.anomiebot] |
21:13 |
<ryankemper@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.UPGRADE (3 nodes at a time) for ElasticSearch cluster search_codfw: search_codfw elasticsearch and plugin upgrade - ryankemper@cumin2002 |
[production] |
21:08 |
<cstone> |
civicrm upgraded from 3ae68ab4 to 09925be0 |
[production] |
20:31 |
<ryankemper> |
[WDQS] `ryankemper@cumin2002:~$ sudo -E cumin -b 4 wdqs2* 'systemctl restart wdqs-blazegraph'` |
[production] |