2022-12-13
§
|
10:36 |
<vgutierrez> |
clean up stale prometheus target files in prometheus5001 |
[production] |
10:22 |
<claime> |
puppet run on cp4037 - T290536 |
[production] |
10:21 |
<claime> |
puppet disabled on cp hosts for T290536 |
[production] |
10:01 |
<oblivian@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply |
[production] |
10:00 |
<oblivian@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-debug: apply |
[production] |
09:54 |
<moritzm> |
installing libhttp-daemon-perl security updates |
[production] |
09:17 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.40.0-wmf.14 refs T320519 |
[production] |
09:07 |
<claime> |
Repooled parse1002.eqiad.wmnet in parsoid service - T324949 |
[production] |
09:05 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for parse1002.eqiad.wmnet |
[production] |
09:05 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for parse1002.eqiad.wmnet |
[production] |
09:01 |
<cgoubert@cumin1001> |
conftool action : set/pooled=no; selector: name=parse1002.eqiad.wmnet |
[production] |
08:58 |
<moritzm> |
installing libpgjava security updates |
[production] |
08:55 |
<moritzm> |
installing xen security updates |
[production] |
08:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 100%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42683 and previous config saved to /var/cache/conftool/dbconfig/20221213-083019-root.json |
[production] |
08:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 75%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42682 and previous config saved to /var/cache/conftool/dbconfig/20221213-081514-root.json |
[production] |
08:13 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:867002|Enable Section Translation in Chuvash Wikipedia (T319176)]] (duration: 10m 01s) |
[production] |
08:05 |
<kartik@deploy1002> |
kartik and kartik: Backport for [[gerrit:867002|Enable Section Translation in Chuvash Wikipedia (T319176)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
08:03 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:867002|Enable Section Translation in Chuvash Wikipedia (T319176)]] |
[production] |
08:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 50%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42681 and previous config saved to /var/cache/conftool/dbconfig/20221213-080009-root.json |
[production] |
07:52 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:867262|Reduce PC writes from parsoid API to 1%]] (duration: 09m 35s) |
[production] |
07:45 |
<ladsgroup@deploy1002> |
ladsgroup and daniel: Backport for [[gerrit:867262|Reduce PC writes from parsoid API to 1%]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet |
[production] |
07:45 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 25%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42680 and previous config saved to /var/cache/conftool/dbconfig/20221213-074504-root.json |
[production] |
07:43 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:867262|Reduce PC writes from parsoid API to 1%]] |
[production] |
07:29 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 10%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42679 and previous config saved to /var/cache/conftool/dbconfig/20221213-072959-root.json |
[production] |
07:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 5%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42678 and previous config saved to /var/cache/conftool/dbconfig/20221213-071454-root.json |
[production] |
06:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1206 (re)pooling @ 1%: Testing new RAID controller', diff saved to https://phabricator.wikimedia.org/P42677 and previous config saved to /var/cache/conftool/dbconfig/20221213-065949-root.json |
[production] |
06:59 |
<kart_> |
Updated cxserver to 2022-12-06-121330-production (T321781, T324534) |
[production] |
06:57 |
<kartik@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
06:56 |
<kartik@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
06:54 |
<kartik@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
06:54 |
<marostegui> |
Reboot db1206 to test RAID controller |
[production] |
06:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1206', diff saved to https://phabricator.wikimedia.org/P42676 and previous config saved to /var/cache/conftool/dbconfig/20221213-065402-marostegui.json |
[production] |
06:53 |
<kartik@deploy1002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
06:51 |
<kartik@deploy1002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
06:50 |
<kartik@deploy1002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
05:23 |
<dwisehaupt> |
payments now using the new digicert certificate in eqiad |
[production] |
04:56 |
<mwpresync@deploy1002> |
Pruned MediaWiki: 1.40.0-wmf.12 (duration: 02m 15s) |
[production] |
04:54 |
<mwpresync@deploy1002> |
Finished scap: testwikis wikis to 1.40.0-wmf.14 refs T320519 (duration: 52m 11s) |
[production] |
04:02 |
<mwpresync@deploy1002> |
Started scap: testwikis wikis to 1.40.0-wmf.14 refs T320519 |
[production] |
2022-12-12
§
|
23:35 |
<tzatziki> |
removing 3 files for legal compliance |
[production] |
23:12 |
<tzatziki> |
removing 2 files for legal compliance |
[production] |
22:04 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.UPGRADE (3 nodes at a time) for ElasticSearch cluster search_codfw: search_codfw elasticsearch and plugin upgrade - ryankemper@cumin2002 |
[production] |
21:57 |
<tzatziki> |
removing 1 file for legal compliance |
[production] |
21:13 |
<ryankemper@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.UPGRADE (3 nodes at a time) for ElasticSearch cluster search_codfw: search_codfw elasticsearch and plugin upgrade - ryankemper@cumin2002 |
[production] |
21:08 |
<cstone> |
civicrm upgraded from 3ae68ab4 to 09925be0 |
[production] |
20:31 |
<ryankemper> |
[WDQS] `ryankemper@cumin2002:~$ sudo -E cumin -b 4 wdqs2* 'systemctl restart wdqs-blazegraph'` |
[production] |
17:59 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:867202|Disable writing parsoid html to PC on commons and wikidata.]] (duration: 07m 45s) |
[production] |
17:53 |
<ladsgroup@deploy1002> |
ladsgroup and daniel: Backport for [[gerrit:867202|Disable writing parsoid html to PC on commons and wikidata.]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
17:52 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:867202|Disable writing parsoid html to PC on commons and wikidata.]] |
[production] |
16:18 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host thanos-be1004.eqiad.wmnet with OS bullseye |
[production] |