2020-10-05
§
|
10:34 |
<elukey@cumin1001> |
START - Cookbook sre.aqs.roll-restart |
[production] |
10:32 |
<ema> |
cp3052: pool with varnish 5.1.3-1wm15 T264398 |
[production] |
10:28 |
<ema> |
cp3052: depool and downgrade varnish to 5.1.3-1wm15 T264398 |
[production] |
10:08 |
<moritzm> |
installing ldap-replica1002 T264390 |
[production] |
09:52 |
<moritzm> |
installing ldap-replica1001 T264390 |
[production] |
09:22 |
<moritzm> |
installing ldap-replica2003 T264390 |
[production] |
09:02 |
<hnowlan> |
bootstrapping restbase1030-b |
[production] |
08:57 |
<moritzm> |
installing ldap-replica2004 T264390 |
[production] |
08:40 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2073 depooling: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12918 and previous config saved to /var/cache/conftool/dbconfig/20201005-084022-kormat.json |
[production] |
08:39 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:39 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:38 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'Add db2119 to s4 dump/vslow temporarily T259831', diff saved to https://phabricator.wikimedia.org/P12917 and previous config saved to /var/cache/conftool/dbconfig/20201005-083822-kormat.json |
[production] |
08:23 |
<godog> |
prometheus codfw/ops, add 100G to the LV |
[production] |
08:06 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
07:46 |
<marostegui> |
Stop mysql on es2017 T264386 |
[production] |
07:30 |
<jmm@cumin2001> |
START - Cookbook sre.ganeti.makevm |
[production] |
06:52 |
<XioNoX> |
add static NAT to pfw3-eqiad - T264356 |
[production] |
06:33 |
<elukey> |
reboot stat1005 to resolve weird GPU state (scheduled last week) |
[production] |
05:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es2017 T264386 ', diff saved to https://phabricator.wikimedia.org/P12916 and previous config saved to /var/cache/conftool/dbconfig/20201005-050636-marostegui.json |
[production] |
2020-10-02
§
|
22:00 |
<mutante> |
depooling mw2271 because Icinga alerts about memcached and SAL shows there were ongoing tests of some kind on it |
[production] |
21:59 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: dc=codfw,name=mw2271.codfw.wmnet |
[production] |
21:35 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
21:32 |
<dzahn@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
21:26 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
21:22 |
<dzahn@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:14 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
18:35 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:27 |
<effie> |
enable puppet on mw2271 |
[production] |
18:16 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@da6a098]: oozie: query_clicks_hourly needs to wait on codfw events (duration: 02m 01s) |
[production] |
18:14 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@da6a098]: oozie: query_clicks_hourly needs to wait on codfw events |
[production] |
17:15 |
<mutante> |
submitted puppet refactoring change on maps servers |
[production] |
16:49 |
<effie> |
disable puppet on mw2271 and briefly depool it |
[production] |
15:39 |
<_joe_> |
restarting redis on rdb2003, instance 6380 |
[production] |
15:28 |
<hnowlan> |
bootstrapping restbase1030-a |
[production] |
15:25 |
<cdanis@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=ores,name=eqiad |
[production] |
14:45 |
<cdanis@deploy1001> |
Synchronized docroot/wikimediafoundation.org: Separate foundation.wikimedia.org docroot & add .well-known/matrix/server T261531 4573776bd 2fb4c20ae (duration: 01m 01s) |
[production] |
14:19 |
<moritzm> |
installing LLVM 7 bugfix updates from Buster point release |
[production] |
14:08 |
<effie> |
enable puppet on mwdebug1001 |
[production] |
14:08 |
<moritzm> |
purging some unused kernels on ping* (these only have 3GB "disks") |
[production] |
13:37 |
<Urbanecm> |
Create bot_passwords table at fishbowl wikis (T258356) |
[production] |
13:35 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2140 (re)pooling @ 100%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12905 and previous config saved to /var/cache/conftool/dbconfig/20201002-133545-kormat.json |
[production] |
13:20 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2140 (re)pooling @ 75%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12904 and previous config saved to /var/cache/conftool/dbconfig/20201002-132042-kormat.json |
[production] |
13:00 |
<moritzm> |
installing Linux 4.19.146 on Buster updates (from latest Buster point release, at this point only installing the updates, no reboots (yet)) |
[production] |
12:26 |
<effie> |
disable puppet on mwdebug1001 |
[production] |
12:19 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2140 depooling: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12903 and previous config saved to /var/cache/conftool/dbconfig/20201002-121830-kormat.json |
[production] |
12:18 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
12:18 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:08 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2110 (re)pooling @ 100%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12902 and previous config saved to /var/cache/conftool/dbconfig/20201002-120825-kormat.json |
[production] |