101-150 of 10000 results (26ms)
2020-06-15 §
08:34 <moritzm> reimaging cumin2001 T245114 [production]
08:22 <marostegui> Switchover m3-master from dbproxy1008 to dbproxy1016 - T202367 [production]
08:17 <marostegui> Deploy schema change on db1131 (s6 master) - T250066 [production]
08:09 <moritzm> installing libexif security updates [production]
07:51 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
07:49 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
07:46 <XioNoX> standardize ae device-count on all routers [production]
07:36 <XioNoX> push new pfw firewall policies - T255185 [production]
07:28 <marostegui> Deploy schema change on db1093 [production]
07:28 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1093 for schema change', diff saved to https://phabricator.wikimedia.org/P11492 and previous config saved to /var/cache/conftool/dbconfig/20200615-072835-marostegui.json [production]
07:27 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2092', diff saved to https://phabricator.wikimedia.org/P11491 and previous config saved to /var/cache/conftool/dbconfig/20200615-072742-marostegui.json [production]
06:21 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
06:19 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
2020-06-14 §
13:51 <qchris> Disabling puppet on gerrit1002 (test instance) to do some more upgrade testing [production]
2020-06-13 §
21:12 <qchris> Enabling puppet on gerrit1002 (test instance). Done with testing for today. [production]
12:51 <herron> restarted logstash service on logstash1007, logstash1009 [production]
12:34 <qchris> Disabling puppet on gerrit1002 (test instance) to do some more upgrade testing [production]
12:33 <godog> bounce logstash on logstash1008, GC death [production]
2020-06-12 §
17:44 <herron> restarting logstash1011 elasticsearch instance [production]
16:49 <elukey> restart php-fpm and pool mw1384 - T255282 [production]
16:33 <elukey> (correct) depool again mw1384 - investigation will follow up in a task [production]
16:32 <elukey> depool again mw1348 - investigation will follow up in a task [production]
15:49 <akosiaris@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:44 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:40 <hnowlan@deploy1001> helmfile [CODFW] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
15:40 <pt1979@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:37 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime [production]
15:36 <hnowlan@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
15:27 <akosiaris@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:25 <hnowlan@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
15:24 <akosiaris@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:24 <akosiaris@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:24 <akosiaris@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:24 <akosiaris@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:22 <akosiaris@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:22 <akosiaris@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:22 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:22 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:22 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:22 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:22 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:22 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:22 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime [production]
14:51 <elukey> repool mw1384 as test [production]
14:31 <akosiaris@deploy1001> helmfile [CODFW] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
14:30 <akosiaris> bump cpu limits for changeprop another 50% [production]
14:30 <akosiaris@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
13:36 <akosiaris@deploy1001> helmfile [CODFW] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
13:34 <akosiaris> update changeprop in eqiad+codfw for higher CPU limits [production]
13:34 <akosiaris@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]