1501-1550 of 10000 results (61ms)
2020-02-19 ยง
18:26 <mutante> phab2001 - upgraded ssh-server, kept locally modified config; apt autoremove removes python3-debconf [production]
18:23 <mutante> phab2001 - installing package upgrades, incl. openssh, PHP version [production]
18:22 <mutante> phab2001 - upgrading mariadb client package versions [production]
18:19 <mutante> removing problem ACK from Icinga alerts for wikitech-static MediaWiki version. comments were about things in 2019 [production]
17:48 <robh> cp1089 cp1090 returned to service via T243167 [production]
17:40 <jynus> starting data check between db1078 and db1140:3313 T244958 [production]
17:39 <addshore@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Start reading for the new term store for clients up to Q4000 (T225057) (just incase of cache issue) (duration: 01m 04s) [production]
17:26 <addshore@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Start reading for the new term store for clients up to Q4000 (T225057) (duration: 01m 01s) [production]
17:14 <ema> cp4026: repool after probe Connection:keep-alive experiment revert https://gerrit.wikimedia.org/r/573337 [production]
17:12 <robh> cp1088 returned to service, cp1089 & cp1090 offline for firmware update via T243167 [production]
16:44 <papaul> replacing ps1-a8-codfw mgmt in rack A8 will go down [production]
16:37 <otto@deploy1001> Finished deploy [analytics/refinery@e23918a]: Updating eventgate-analytics port (T245203) and also eventlogging whitelist (duration: 12m 27s) [production]
16:32 <ema> depool cp4026, 5xx [production]
16:24 <otto@deploy1001> Started deploy [analytics/refinery@e23918a]: Updating eventgate-analytics port (T245203) and also eventlogging whitelist [production]
16:13 <marostegui> Depool labsdb1011 to help replication to catch up [production]
16:05 <elukey> Update analytics-in4 filter term eventgate for T245203 on cr1/cr2 eqiad [production]
15:48 <ariel@deploy1001> Finished deploy [dumps/dumps@b42acb5]: fix temp stub generation, add pagerangeinfo cache, some unit tests (duration: 00m 03s) [production]
15:48 <ariel@deploy1001> Started deploy [dumps/dumps@b42acb5]: fix temp stub generation, add pagerangeinfo cache, some unit tests [production]
14:59 <marostegui> Stop mysql on es2021 - T243052 [production]
14:31 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
14:29 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
14:29 <marostegui> Data checksum on db1084 T245621 [production]
14:07 <marostegui> Upgrade and reboot db1084 - T245621 [production]
14:02 <marostegui> Start mysql on db1084 without replication - T245621 [production]
13:53 <jbond42> disable puppet to upgrade postgresql [production]
13:30 <jynus@cumin1001> dbctl commit (dc=all): 'Depool db1084, lots of connection errors', diff saved to https://phabricator.wikimedia.org/P10458 and previous config saved to /var/cache/conftool/dbconfig/20200219-133057-jynus.json [production]
12:25 <ladsgroup@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:573236|Start reading for the new term store for clients up to Q2000 (T225057)]], take II, the cache issue (duration: 01m 04s) [production]
12:22 <ladsgroup@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:573236|Start reading for the new term store for clients up to Q2000 (T225057)]] (duration: 01m 06s) [production]
11:56 <volans> better splay of periodic scripts that interact with Netbox - T244291 [production]
11:43 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:41 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
11:08 <ladsgroup@deploy1001> Synchronized php-1.35.0-wmf.20/extensions/Wikibase/lib/includes/Store: Get rid of useless metrics in EntityTermLookupBase (T245592) (duration: 01m 04s) [production]
11:06 <ladsgroup@deploy1001> Synchronized php-1.35.0-wmf.19/extensions/Wikibase/lib/includes/Store: Get rid of useless metrics in EntityTermLookupBase (T245592) (duration: 01m 12s) [production]
11:01 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
10:58 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
10:58 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
10:58 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:58 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:58 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:45 <jynus> upgrading mariadb client on cumin hosts [production]
10:38 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db2089:3315, db2089:3316 after new package testing', diff saved to https://phabricator.wikimedia.org/P10457 and previous config saved to /var/cache/conftool/dbconfig/20200219-103806-marostegui.json [production]
10:26 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
10:24 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:17 <jynus> stopping db2089 mariadb@s5 [production]
10:12 <jiji@cumin1001> conftool action : set/weight=30; selector: dc=eqiad,cluster=appserver,service=apache2,name=mw135[0-5]*.eqiad.wmnet [production]
10:12 <jiji@cumin1001> conftool action : set/weight=30; selector: dc=eqiad,cluster=appserver,service=nginx,name=mw135[0-5]*.eqiad.wmnet [production]
10:11 <jiji@cumin1001> conftool action : set/weight=30; selector: dc=eqiad,cluster=appserver,service=nginx,name=mw1349.eqiad.wmnet [production]
10:11 <jiji@cumin1001> conftool action : set/weight=30; selector: dc=eqiad,cluster=appserver,service=apache2,name=mw1349.eqiad.wmnet [production]
10:09 <moritzm> updated tftpboot environment for stretch-bootif for the 9.12 point release T241359 [production]
09:53 <jynus> stopping and upgrading db1140 instances [production]