2022-10-18
§
|
11:06 |
<cgoubert@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
11:05 |
<cgoubert@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
10:59 |
<cgoubert@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
10:57 |
<claime> |
Disabling nutcracker on k8s-experimental mwdebug - T321042 |
[production] |
10:17 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:843906|Revert "Add multiple integration tests for Hooks.php" (T321041)]] (duration: 06m 24s) |
[production] |
10:11 |
<urbanecm@deploy1002> |
urbanecm and urbanecm: Backport for [[gerrit:843906|Revert "Add multiple integration tests for Hooks.php" (T321041)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
10:11 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:843906|Revert "Add multiple integration tests for Hooks.php" (T321041)]] |
[production] |
08:50 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: Revert "group0 wikis to 1.40.0-wmf.6" # T320511 |
[production] |
08:35 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.40.0-wmf.6 refs T320511 |
[production] |
08:28 |
<hashar@deploy1002> |
Pruned MediaWiki: 1.40.0-wmf.4 (duration: 02m 11s) |
[production] |
08:26 |
<hashar> |
scap clean auto # T320511 |
[production] |
08:23 |
<hashar@deploy1002> |
Finished scap: testwikis wikis to 1.40.0-wmf.6 refs T320511 (duration: 36m 04s) |
[production] |
07:47 |
<hashar@deploy1002> |
Started scap: testwikis wikis to 1.40.0-wmf.6 refs T320511 |
[production] |
07:39 |
<hashar> |
`scap stage-train 1.40.0-wmf.6` # T320511 |
[production] |
07:37 |
<hashar> |
Scratched /srv/mediawiki-staging/php-1.40.0-wmf.6 entirely and doing `scap prep` instead |
[production] |
07:35 |
<hashar> |
Rebased /srv/mediawiki-staging/php-1.40.0-wmf.6 for de15f77aa428e3aacf6b66938fb7bdb45ef91443 ( T321021 ) and 0f8be847d9d81882ad5c1e54c2b45cc4d918eb97 ( T319447 ) |
[production] |
2022-10-17
§
|
23:16 |
<bblack@puppetmaster2001> |
conftool action : set/pooled=yes; selector: service=git-ssh |
[production] |
23:16 |
<bblack@puppetmaster2001> |
conftool action : set/weight=100; selector: service=git-ssh |
[production] |
22:55 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for otrs1001.eqiad.wmnet |
[production] |
22:55 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.remove-downtime for otrs1001.eqiad.wmnet |
[production] |
22:41 |
<mutante> |
otrs1001 - systemctl reset-failed (clear alert for ifup@ens13.service) |
[production] |
22:36 |
<bblack> |
ganeti1027 - gnt-instance reboot otrs1001.eqiad.wmnet |
[production] |
22:36 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on otrs1001.eqiad.wmnet with reason: reboot |
[production] |
22:35 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on otrs1001.eqiad.wmnet with reason: reboot |
[production] |
22:34 |
<bblack> |
ganeti1027: executing gnt-instance modify -B maxmem=8192 -B memory=8192 otrs1001.eqiad.wmnet |
[production] |
21:33 |
<mutante> |
otrs1001 - after local exim queue has been drained, set MaxThreads for clamav to 12 again, restarted clamav |
[production] |
21:33 |
<mstyles@deploy1002> |
Synchronized php-1.40.0-wmf.5/extensions/CheckUser/src/Api/ApiQueryCheckUser.php: (no justification provided) (duration: 03m 37s) |
[production] |
21:20 |
<mutante> |
otrs1001 - re-enabling puppet, running puppet |
[production] |
21:09 |
<mutante> |
otrs1001 - changing MaxThreads from 6 to 1 in /etc/clamav/clamd.conf, starting clamav |
[production] |
21:02 |
<mutante> |
otrs1001 - temp disabled puppet, changing MaxThreads from 12 to 6 in /etc/clamav/clamd.conf |
[production] |
20:40 |
<mutante> |
mx1001 - exim4 -qf - trying to re-deliver mail in queue for info@ OTRS queue |
[production] |
20:18 |
<urbanecm@deploy1002> |
Finished scap: 6762292a4: e320d48c8: 6762292a4: DicsussionTools/WikimediaEvents backports (T315688, T315689, T320938) (duration: 04m 35s) |
[production] |
20:13 |
<urbanecm@deploy1002> |
Started scap: 6762292a4: e320d48c8: 6762292a4: DicsussionTools/WikimediaEvents backports (T315688, T315689, T320938) |
[production] |
19:58 |
<dzahn@cumin2002> |
conftool action : set/pooled=yes; selector: dc=eqiad,name=phab1001-vcs.eqiad.wmnet |
[production] |
19:57 |
<dzahn@cumin2002> |
conftool action : set/pooled=yes; selector: dc=codfw,name=phab2001-vcs.codfw.wmnet |
[production] |
19:20 |
<mutante> |
otrs1001 - started failed clamav-daemon service |
[production] |
18:57 |
<mutante> |
puppetmaster2001 - deleted confd-template .err files |
[production] |
18:56 |
<mutante> |
puppetmaster1001 - deleted confd-template .err files |
[production] |
18:49 |
<dzahn@cumin2002> |
conftool action : set/pooled=inactive; selector: name=phab1001-vcs.eqiad.wmnet |
[production] |
18:48 |
<dzahn@cumin2002> |
conftool action : set/pooled=inactive; selector: name=phab2001-vcs.codfw.wmnet |
[production] |
18:12 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2177 (T318955)', diff saved to https://phabricator.wikimedia.org/P35544 and previous config saved to /var/cache/conftool/dbconfig/20221017-181217-ladsgroup.json |
[production] |
17:57 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P35543 and previous config saved to /var/cache/conftool/dbconfig/20221017-175711-ladsgroup.json |
[production] |
17:42 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P35542 and previous config saved to /var/cache/conftool/dbconfig/20221017-174204-ladsgroup.json |
[production] |
17:26 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2177 (T318955)', diff saved to https://phabricator.wikimedia.org/P35541 and previous config saved to /var/cache/conftool/dbconfig/20221017-172658-ladsgroup.json |
[production] |
17:19 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 32787 |
[production] |
17:16 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.peering with action 'configure' for AS: 32787 |
[production] |
17:12 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2177 (T318955)', diff saved to https://phabricator.wikimedia.org/P35540 and previous config saved to /var/cache/conftool/dbconfig/20221017-171229-ladsgroup.json |
[production] |
17:12 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2177.codfw.wmnet with reason: Maintenance |
[production] |
17:12 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2177.codfw.wmnet with reason: Maintenance |
[production] |
17:11 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T318955)', diff saved to https://phabricator.wikimedia.org/P35539 and previous config saved to /var/cache/conftool/dbconfig/20221017-171156-ladsgroup.json |
[production] |