2020-10-02
§
|
22:00 |
<mutante> |
depooling mw2271 because Icinga alerts about memcached and SAL shows there were ongoing tests of some kind on it |
[production] |
21:59 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: dc=codfw,name=mw2271.codfw.wmnet |
[production] |
21:35 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
21:32 |
<dzahn@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
21:26 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
21:22 |
<dzahn@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
21:09 |
<bstorm> |
rebooting tools-k8s-worker-70 because it seems to be unable to recover from an old NFS disconnect |
[tools] |
19:14 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
18:35 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:27 |
<effie> |
enable puppet on mw2271 |
[production] |
18:16 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@da6a098]: oozie: query_clicks_hourly needs to wait on codfw events (duration: 02m 01s) |
[production] |
18:14 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@da6a098]: oozie: query_clicks_hourly needs to wait on codfw events |
[production] |
17:37 |
<andrewbogott> |
stopping tools-prometheus-03 to attempt a snapshot |
[tools] |
17:15 |
<mutante> |
submitted puppet refactoring change on maps servers |
[production] |
16:49 |
<effie> |
disable puppet on mw2271 and briefly depool it |
[production] |
16:43 |
<joal> |
Rerun mediawiki-history-denormalize-wf-2020-09 after failed instance |
[analytics] |
16:03 |
<bstorm> |
shutting down tools-prometheus-04 to try to fsck the disk |
[tools] |
15:39 |
<_joe_> |
restarting redis on rdb2003, instance 6380 |
[production] |
15:28 |
<hnowlan> |
bootstrapping restbase1030-a |
[production] |
15:25 |
<cdanis@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=ores,name=eqiad |
[production] |
14:45 |
<cdanis@deploy1001> |
Synchronized docroot/wikimediafoundation.org: Separate foundation.wikimedia.org docroot & add .well-known/matrix/server T261531 4573776bd 2fb4c20ae (duration: 01m 01s) |
[production] |
14:39 |
<hashar> |
Successfully tagged docker-registry.discovery.wmnet/releng/operations-puppet:0.7.7 # T263728 |
[releng] |
14:23 |
<elukey> |
live patch refinery-drop-older-than on stat1007 to unblock timer (patch https://gerrit.wikimedia.org/r/6317800) |
[analytics] |
14:19 |
<moritzm> |
installing LLVM 7 bugfix updates from Buster point release |
[production] |
14:09 |
<hashar> |
Successfully tagged docker-registry.discovery.wmnet/releng/helm-linter:0.2.7 for jayme # T264157 |
[releng] |
14:08 |
<effie> |
enable puppet on mwdebug1001 |
[production] |
14:08 |
<moritzm> |
purging some unused kernels on ping* (these only have 3GB "disks") |
[production] |
13:37 |
<Urbanecm> |
Create bot_passwords table at fishbowl wikis (T258356) |
[production] |
13:35 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2140 (re)pooling @ 100%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12905 and previous config saved to /var/cache/conftool/dbconfig/20201002-133545-kormat.json |
[production] |
13:20 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2140 (re)pooling @ 75%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12904 and previous config saved to /var/cache/conftool/dbconfig/20201002-132042-kormat.json |
[production] |
13:05 |
<hashar> |
Successfully tagged docker-registry.wikimedia.org/releng/helm-linter:0.2.7 T264157 |
[releng] |
13:00 |
<elukey> |
add an-worker110[6-9] to the Hadoop cluster |
[analytics] |
13:00 |
<moritzm> |
installing Linux 4.19.146 on Buster updates (from latest Buster point release, at this point only installing the updates, no reboots (yet)) |
[production] |
12:26 |
<effie> |
disable puppet on mwdebug1001 |
[production] |
12:19 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2140 depooling: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12903 and previous config saved to /var/cache/conftool/dbconfig/20201002-121830-kormat.json |
[production] |
12:18 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |