351-400 of 10000 results (37ms)
2020-08-04 §
06:12 <marostegui@cumin1001> dbctl commit (dc=all): 'More weight to db1089 on main traffic', diff saved to https://phabricator.wikimedia.org/P12149 and previous config saved to /var/cache/conftool/dbconfig/20200804-061255-marostegui.json [production]
06:12 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly repool db1119', diff saved to https://phabricator.wikimedia.org/P12148 and previous config saved to /var/cache/conftool/dbconfig/20200804-061209-marostegui.json [production]
06:10 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1098:3317 for MCR', diff saved to https://phabricator.wikimedia.org/P12147 and previous config saved to /var/cache/conftool/dbconfig/20200804-061003-marostegui.json [production]
05:45 <wm-bot> <root> restarted webservice (T259560) [tools.sal]
05:37 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
05:35 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
05:18 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1119 for reimage', diff saved to https://phabricator.wikimedia.org/P12146 and previous config saved to /var/cache/conftool/dbconfig/20200804-051843-marostegui.json [production]
05:04 <marostegui> Reboot db1107 to pick up the last kernel [production]
05:01 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1089 into API', diff saved to https://phabricator.wikimedia.org/P12145 and previous config saved to /var/cache/conftool/dbconfig/20200804-050150-marostegui.json [production]
03:56 <legoktm> added Arlo to wmf-deployment Gerrit group [production]
03:53 <legoktm> added subbu to wmf-deployment Gerrit group [production]
2020-08-03 §
23:43 <mutante> mwdebug1001 - temp installing apt-file for debugging an issue on mwmaint [production]
23:14 <catrope@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Enable GrowthExperiments on fawiki (T253291) (duration: 00m 59s) [production]
21:35 <sbassett> Deployed mitigations for T115888 [production]
21:14 <sbassett@deploy1001> Synchronized php-1.36.0-wmf.2/resources/src/mediawiki.jqueryMsg/mediawiki.jqueryMsg.js: (no justification provided) (duration: 01m 00s) [production]
20:08 <hashar> Updating various jobs to fix a cache pollution caused by Chromium. https://gerrit.wikimedia.org/r/618144 [releng]
19:11 <hashar> Reloaded Zuul for I215ee6238932be041bff6fa6cc453dc4cfa9512f [releng]
18:15 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
18:13 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
18:13 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
18:13 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
18:09 <dcausse@deploy1001> Finished deploy [wdqs/wdqs@20dcff3]: deploy 0.3.43 and gui update (duration: 15m 53s) [production]
17:53 <dcausse@deploy1001> Started deploy [wdqs/wdqs@20dcff3]: deploy 0.3.43 and gui update [production]
17:33 <liw@deploy1001> rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.2 [production]
17:28 <dcausse@deploy1001> Finished deploy [wdqs/wdqs@20dcff3]: (no justification provided) (duration: 00m 35s) [production]
17:28 <dcausse@deploy1001> Started deploy [wdqs/wdqs@20dcff3]: (no justification provided) [production]
17:02 <bstorm> increased db connection limit to 800 across galera cluster because we were clearly hovering at limit [admin]
16:58 <liw@deploy1001> rebuilt and synchronized wikiversions files: Revert "group2 wikis to 1.36.0-wmf.1" [production]
16:21 <oblivian@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
16:16 <oblivian@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
16:02 <oblivian@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . [production]
15:55 <_joe_> regenerating the TLS certs for blubberoid [production]
15:46 <bd808> `service puppetdb start` on deployment-puppetdb03.deployment-prep.eqiad.wmflabs. Looks like it died from OOM [releng]
15:33 <XioNoX> standardize all routers routing-options config [production]
15:27 <marostegui> Change PK on frwiktionary.revision on db2087:3317, db2129, db2121 db2086:3317 T259524 [production]
15:16 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:14 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:12 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:12 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:12 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:12 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:12 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:12 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:11 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:11 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:11 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:11 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:11 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:11 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:11 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]