2020-05-29
ยง
|
22:07 |
<wm-bot> |
<rhinosf1> purge old *.*db files, tar+gzip logs/* and nuke the pycahce's |
[tools.zppixbot-test] |
21:24 |
<wm-bot> |
<rhinosf1> sync done |
[tools.zppixbot] |
21:23 |
<wm-bot> |
<rhinosf1> syncing to deploy 599873 -- T233993 |
[tools.zppixbot] |
20:55 |
<bstorm_> |
updating views on labsdb1011 T252219 |
[production] |
19:39 |
<bstorm_> |
switch deployment to the openresty version to try it out T252217 |
[tools.paws-public] |
19:37 |
<bstorm_> |
adding docker image for paws-public docker-registry.tools.wmflabs.org/paws-public-nginx:openresty T252217 |
[tools] |
19:27 |
<ryankemper> |
Successfully finished a rolling restart of the `cloudelastic` clusters (chi, psi, omega) as part of elasticsearch plugins upgrade. Host and service checks re-enabled. |
[production] |
18:42 |
<hauskatze> |
gerrit: replication start mediawiki/extensions/WikiShare --wait refs. T250400 |
[releng] |
18:17 |
<hauskatze> |
GitHub: Deleted mirror wikimedia/mediawiki-extensions-PopupPages refs. T251000 |
[releng] |
18:12 |
<bstorm_> |
applied in-place fix for non-ASCII usernames and applied this to my own version of the image T252217 |
[tools.paws-public] |
17:28 |
<bstorm_> |
updating views on labsdb1009 T252219 |
[production] |
16:50 |
<ryankemper> |
Performing a rolling restart of the `cloudelastic` clusters (chi, psi, omega) as part of elasticsearch plugins upgrade. Host and service checks disabled. |
[production] |
16:00 |
<bstorm_> |
Updating views on labsdb1012 T252219 |
[production] |
15:59 |
<ryankemper> |
Concluded rolling restart of the `relforge` clusters as part of elasticsearch plugins upgrade. Both hosts `relforge1001` and `relforge1002` are back up. Downtime lifted. |
[production] |
15:29 |
<ryankemper> |
Performing a rolling restart of the `relforge` clusters as part of elasticsearch plugins upgrade |
[production] |
14:59 |
<cdanis> |
disabling puppet on netflow* to deploy Ic71e96f0 T253128 |
[production] |
14:47 |
<akosiaris@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
14:47 |
<akosiaris@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'kube-system' for release 'coredns' . |
[production] |
14:41 |
<akosiaris@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
14:41 |
<akosiaris@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'kube-system' for release 'coredns' . |
[production] |
14:36 |
<hashar> |
Built Successfully tagged docker-registry.discovery.wmnet/releng/java8:0.6.4 |
[releng] |
14:35 |
<akosiaris@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
14:35 |
<akosiaris@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'kube-system' for release 'coredns' . |
[production] |
14:27 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:24 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:15 |
<mdholloway> |
ran extensions/MachineVision/maintenance/removeBlacklistedSuggestions.php on commonswiki (T253821) |
[production] |
13:19 |
<elukey> |
re-run druid webrequest hourly 29/05T11 (failed due to a host reimage in progress) |
[analytics] |
12:49 |
<hnowlan> |
reimaging restbase2009 after disk replacement |
[production] |
12:37 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
12:35 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:19 |
<elukey> |
reimage druid1001 to Debian Buster |
[analytics] |
12:15 |
<godog> |
roll-restart to upgrade thanos to 0.13.0rc0 - T252186 T233956 |
[production] |
11:32 |
<moritzm> |
installing cups security updates (client-side libs/tools) |
[production] |
11:01 |
<ema> |
upload prometheus-rdkafka-exporter 0.2 to buster-wikimedia T253551 |
[production] |
10:53 |
<moritzm> |
updating mwdebug2002 to 7.2.31 |
[production] |
10:05 |
<elukey> |
move el2druid config from druid1001 to an-druid1001 |
[analytics] |
10:02 |
<marostegui> |
Compress InnoDB on db1138 T232446 |
[production] |
09:09 |
<hashar> |
Updating all Jenkins jobs: tox -e jenkins-jobs -- --flush-cache --conf jenkins_jobs.ini update --workers 2 ./jjb |
[releng] |
08:30 |
<godog> |
update swift uid/gid on thanos hosts - T123918 |
[production] |
08:04 |
<mutante> |
phabricator - restarted apache2 - back for me now |
[production] |
08:03 |
<XioNoX> |
add new AMS-IX link to LACP bundle |
[production] |
08:01 |
<mutante> |
phabricator - broken due to "PhabricatorRepositoryMirrorEngine::pushToGitRepository" starting git process that uses 100% CPU, stopped phd service |
[production] |
07:56 |
<mutante> |
phabricator - killed pid 25070 (git) which used 100% of CPU, restarted phd service |
[production] |
07:25 |
<moritzm> |
updating perf on buster systems to new version from 10.4 point release |
[production] |
07:18 |
<hashar> |
Deleted last php5.6 job hurrah and thanks James ! https://integration.wikimedia.org/ci/job/composer-php56-docker/ # T224906 |
[releng] |
07:18 |
<hashar> |
Deleted last php5.6 job hurrah and thanks James ! https://integration.wikimedia.org/ci/job/composer-php56-docker/ |
[releng] |
07:15 |
<moritzm> |
installing el-api update from latest Buster point release |
[production] |
07:12 |
<moritzm> |
installing xdg-utils update from latest Buster point release |
[production] |
07:11 |
<mutante> |
mw1293 (canary jobrunner ) replace apache2.conf with version from mwdebug1001, restart apache, to debug for T190111 |
[production] |
07:00 |
<moritzm> |
installing rake security updates |
[production] |