701-750 of 10000 results (60ms)
2019-12-04 ยง
14:53 <marostegui@cumin1001> dbctl commit (dc=all): 'Set db2135 as master for s10 in codfw', diff saved to https://phabricator.wikimedia.org/P9806 and previous config saved to /var/cache/conftool/dbconfig/20191204-145349-marostegui.json [production]
14:51 <marostegui@cumin1001> dbctl commit (dc=all): 'Pool db2135 in m5 codfw', diff saved to https://phabricator.wikimedia.org/P9805 and previous config saved to /var/cache/conftool/dbconfig/20191204-145145-marostegui.json [production]
14:40 <rzl@cumin1001> conftool action : set/pooled=yes; selector: service=apache2,cluster=appserver,dc=codfw,name=mw2274.codfw.wmnet [production]
14:40 <rzl@cumin1001> conftool action : set/pooled=yes; selector: service=nginx,cluster=appserver,dc=codfw,name=mw2274.codfw.wmnet [production]
14:31 <moritzm> test ldap-corp2001 as LDAP server on mx2001 [production]
14:24 <bblack> ns2 authdns: re-route from ganeti3003 to dns3001 - T236479 [production]
14:10 <bblack@cumin1001> conftool action : set/pooled=yes; selector: name=dns5001.wikimedia.org [production]
14:04 <bblack@cumin1001> conftool action : set/pooled=yes; selector: name=dns[34]001.wikimedia.org [production]
13:59 <rzl@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:57 <rzl@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:54 <bblack@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:52 <bblack@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:52 <bblack@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:50 <bblack@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:45 <bblack@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:43 <bblack@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:24 <bblack@cumin1001> conftool action : set/pooled=no; selector: name=dns[345]001.wikimedia.org [production]
13:24 <onimisionipe> downtimed maps1004 - T239728 [production]
13:23 <bblack> dns[345]001 - starting downtimes/etc for reimage to buster... [production]
12:31 <filippo@cumin1001> conftool action : set/pooled=no; selector: name=ms-fe2007.codfw.wmnet [production]
12:29 <Urbanecm> EU SWAT done [production]
12:28 <urbanecm@deploy1001> Synchronized php-1.35.0-wmf.5/extensions/WikimediaMessages/: SWAT: bbf2a33: Change Schema Revision of WMDEBannerEvents (T239430) (duration: 01m 02s) [production]
12:26 <urbanecm@deploy1001> Synchronized php-1.35.0-wmf.8/extensions/WikimediaMessages/: SWAT: b3ef5cd: Change Schema Revision of WMDEBannerEvents (T239430) (duration: 01m 04s) [production]
11:38 <jbond42> puppet enabled accross the fleet and new CA certificate installed [production]
11:31 <akosiaris> drain kubernetes1002 for test of nf_conntrack changes [production]
11:23 <jbond42> enable puppet in eqiad and deploy updated CA [production]
11:13 <gehel@cumin1001> END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) [production]
10:54 <jbond42> enable puppet in codfw and deploy updated CA [production]
10:46 <jbond42> enable puppet in esams and deploy updated CA [production]
10:42 <jbond42> enable puppet in ulsfo and deploy updated CA [production]
10:31 <gehel@cumin1001> START - Cookbook sre.wdqs.restart [production]
10:31 <gehel> rolling restart of wdqs for config change (event logging) - T101013 [production]
10:30 <jbond42> enable puppet in eqsin and deploy updated CA [production]
10:24 <marostegui> stop replication and mysql on db2107 (s2 codfw master) to test puppet CA changes [production]
10:21 <marostegui> stop replication and mysql on db2071 to test puppet CA changes [production]
10:02 <jbond42> disabling puppet accros the fleet to start CA update change 548241 [production]
09:29 <godog> roll-restart logstash7 in codfw/eqiad after https://gerrit.wikimedia.org/r/c/operations/puppet/+/554472 [production]
09:15 <marostegui> Reload labsdb1010 after reimporting wikidatawiki.page - T238399 [production]
09:06 <moritzm> updated jenkins on apt.wikimedia.org to 2.190.3 (T239586) [production]
08:05 <effie> Restart php7-fpm on mw1348 [production]
07:09 <marostegui> Depool labsdb1010 to reimport wikidatawiki.page - T238399 [production]
07:02 <marostegui> Repool labsdb1011 [production]
06:36 <mutante> removed LVS IP for git-ssh from interface on phab1003 [production]
06:25 <dzahn@cumin1001> conftool action : set/weight=10; selector: name=phab1001-vcs.eqiad.wmnet [production]
06:13 <mutante> phab1001 - running rsync of /srv/repos with --delete because it's larger than the source by about 5GB - deleting objects to match phab1003, former prod server. now both 50G (T238956) [production]
06:04 <marostegui> Depool labsdb1011 [production]
06:01 <mutante> rsyncing /srv/repos data once again. pulling from phab1003 to phab1001 (T238956) [production]
05:51 <marostegui> Deploy schema change on s3 primary master (db1123) [production]
04:59 <mutante> removed downtime for phabricator.wikimedia.org meta service (paging) [production]
04:58 <mutante> phabricator maintenance ended for today - now running on phab1001 (buster) [production]