2022-10-17
§
|
23:16 |
<bblack@puppetmaster2001> |
conftool action : set/pooled=yes; selector: service=git-ssh |
[production] |
23:16 |
<bblack@puppetmaster2001> |
conftool action : set/weight=100; selector: service=git-ssh |
[production] |
22:55 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for otrs1001.eqiad.wmnet |
[production] |
22:55 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.remove-downtime for otrs1001.eqiad.wmnet |
[production] |
22:41 |
<mutante> |
otrs1001 - systemctl reset-failed (clear alert for ifup@ens13.service) |
[production] |
22:36 |
<bblack> |
ganeti1027 - gnt-instance reboot otrs1001.eqiad.wmnet |
[production] |
22:36 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on otrs1001.eqiad.wmnet with reason: reboot |
[production] |
22:35 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on otrs1001.eqiad.wmnet with reason: reboot |
[production] |
22:34 |
<bblack> |
ganeti1027: executing gnt-instance modify -B maxmem=8192 -B memory=8192 otrs1001.eqiad.wmnet |
[production] |
21:33 |
<mutante> |
otrs1001 - after local exim queue has been drained, set MaxThreads for clamav to 12 again, restarted clamav |
[production] |
21:33 |
<mstyles@deploy1002> |
Synchronized php-1.40.0-wmf.5/extensions/CheckUser/src/Api/ApiQueryCheckUser.php: (no justification provided) (duration: 03m 37s) |
[production] |
21:20 |
<mutante> |
otrs1001 - re-enabling puppet, running puppet |
[production] |
21:09 |
<mutante> |
otrs1001 - changing MaxThreads from 6 to 1 in /etc/clamav/clamd.conf, starting clamav |
[production] |
21:02 |
<mutante> |
otrs1001 - temp disabled puppet, changing MaxThreads from 12 to 6 in /etc/clamav/clamd.conf |
[production] |
20:40 |
<mutante> |
mx1001 - exim4 -qf - trying to re-deliver mail in queue for info@ OTRS queue |
[production] |
20:18 |
<urbanecm@deploy1002> |
Finished scap: 6762292a4: e320d48c8: 6762292a4: DicsussionTools/WikimediaEvents backports (T315688, T315689, T320938) (duration: 04m 35s) |
[production] |
20:13 |
<urbanecm@deploy1002> |
Started scap: 6762292a4: e320d48c8: 6762292a4: DicsussionTools/WikimediaEvents backports (T315688, T315689, T320938) |
[production] |
19:58 |
<dzahn@cumin2002> |
conftool action : set/pooled=yes; selector: dc=eqiad,name=phab1001-vcs.eqiad.wmnet |
[production] |
19:57 |
<dzahn@cumin2002> |
conftool action : set/pooled=yes; selector: dc=codfw,name=phab2001-vcs.codfw.wmnet |
[production] |
19:20 |
<mutante> |
otrs1001 - started failed clamav-daemon service |
[production] |
18:57 |
<mutante> |
puppetmaster2001 - deleted confd-template .err files |
[production] |
18:56 |
<mutante> |
puppetmaster1001 - deleted confd-template .err files |
[production] |
18:49 |
<dzahn@cumin2002> |
conftool action : set/pooled=inactive; selector: name=phab1001-vcs.eqiad.wmnet |
[production] |
18:48 |
<dzahn@cumin2002> |
conftool action : set/pooled=inactive; selector: name=phab2001-vcs.codfw.wmnet |
[production] |
18:12 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2177 (T318955)', diff saved to https://phabricator.wikimedia.org/P35544 and previous config saved to /var/cache/conftool/dbconfig/20221017-181217-ladsgroup.json |
[production] |
17:57 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P35543 and previous config saved to /var/cache/conftool/dbconfig/20221017-175711-ladsgroup.json |
[production] |
17:42 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P35542 and previous config saved to /var/cache/conftool/dbconfig/20221017-174204-ladsgroup.json |
[production] |
17:26 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2177 (T318955)', diff saved to https://phabricator.wikimedia.org/P35541 and previous config saved to /var/cache/conftool/dbconfig/20221017-172658-ladsgroup.json |
[production] |
17:19 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 32787 |
[production] |
17:16 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.peering with action 'configure' for AS: 32787 |
[production] |
17:12 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2177 (T318955)', diff saved to https://phabricator.wikimedia.org/P35540 and previous config saved to /var/cache/conftool/dbconfig/20221017-171229-ladsgroup.json |
[production] |
17:12 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2177.codfw.wmnet with reason: Maintenance |
[production] |
17:12 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2177.codfw.wmnet with reason: Maintenance |
[production] |
17:11 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T318955)', diff saved to https://phabricator.wikimedia.org/P35539 and previous config saved to /var/cache/conftool/dbconfig/20221017-171156-ladsgroup.json |
[production] |
16:56 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P35538 and previous config saved to /var/cache/conftool/dbconfig/20221017-165649-ladsgroup.json |
[production] |
16:41 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P35537 and previous config saved to /var/cache/conftool/dbconfig/20221017-164143-ladsgroup.json |
[production] |
16:26 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T318955)', diff saved to https://phabricator.wikimedia.org/P35536 and previous config saved to /var/cache/conftool/dbconfig/20221017-162636-ladsgroup.json |
[production] |
16:18 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2156 (T318955)', diff saved to https://phabricator.wikimedia.org/P35535 and previous config saved to /var/cache/conftool/dbconfig/20221017-161843-ladsgroup.json |
[production] |
16:18 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on db2094.codfw.wmnet with reason: Maintenance |
[production] |
16:18 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 16:00:00 on db2094.codfw.wmnet with reason: Maintenance |
[production] |
16:18 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2156.codfw.wmnet with reason: Maintenance |
[production] |