551-600 of 10000 results (25ms)
2019-07-29 ยง
17:05 <XioNoX> reprepro copy buster-wikimedia stretch-wikimedia anycast-healthchecker [production]
16:47 <godog> add anycast syslog to wezen/centrallog1001 [production]
16:19 <elukey> manually stopped the sre.kafka.roll-restart-brokers cookbook after 4 brokers restarts since the sleep interval (10mins) is too tight. [production]
16:17 <elukey@cumin1001> END (ERROR) - Cookbook sre.kafka.roll-restart-brokers (exit_code=97) [production]
15:35 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Retry - Produce resource_change stream to eventgate-main - T211248 (duration: 00m 46s) [production]
15:34 <elukey@cumin1001> START - Cookbook sre.kafka.roll-restart-brokers [production]
15:30 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Produce resource_change stream to eventgate-main - T211248 (duration: 00m 47s) [production]
14:35 <papaul> shutting down pc2010 for maintenance [production]
13:57 <cdanis@cumin1001> dbctl commit of MediaWiki config (dc=all), diff saved to 'https://phabricator.wikimedia.org/P8816', previous config saved to /var/cache/conftool/dbconfig/20190729-135730-cdanis.json [production]
13:30 <elukey@cumin1001> END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) [production]
13:28 <marostegui> Stop MySQL on pc2010 - T227552 [production]
13:23 <arturo> T228870 reboot cloudvirt1007.eqiad.wmnet for kernel updates [production]
13:23 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:23 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:09 <arturo> T228870 reboot cloudvirt1006.eqiad.wmnet for kernel updates [production]
13:09 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:09 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:01 <elukey@cumin1001> START - Cookbook sre.druid.roll-restart-workers [production]
12:45 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Provision db2128 into s5 api T221533 (duration: 00m 47s) [production]
12:45 <marostegui> Provision db2128 into s5 codfw - T228969 [production]
12:44 <marostegui@deploy1001> Synchronized wmf-config/db-codfw.php: Provision db2128 into s5 api T221533 (duration: 00m 47s) [production]
12:39 <arturo> T228870 reboot cloudvirt1005.eqiad.wmnet for kernel updates [production]
12:38 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
12:38 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
12:20 <arturo> T228870 reboot cloudvirt1004.eqiad.wmnet for kernel updates [production]
12:20 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
12:20 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
11:58 <arturo> T228870 reboot cloudvirt1003.eqiad.wmnet for kernel updates [production]
11:57 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:57 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
11:36 <arturo> icinga downtime toolschecker for 6h [production]
11:31 <arturo> T228870 reboot cloudvirt1002.eqiad.wmnet for kernel updates [production]
11:31 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:31 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
11:14 <arturo> T228870 reboot cloudvirt1001.eqiad.wmnet for kernel updates [production]
11:14 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:13 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
11:13 <aborrero@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
11:13 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
11:11 <dcausse> EU SWAT done [production]
11:10 <dcausse@deploy1001> Synchronized wmf-config/SearchSettingsForWikidata.php: [cirrus] Use correct factory declaration for EntityFullTextQueryBuilder (duration: 00m 47s) [production]
10:37 <jdrewniak@deploy1001> Synchronized portals: Wikimedia Portals Update: [[gerrit:526125| Bumping portals to master (T128546)]] (duration: 00m 47s) [production]
10:36 <jdrewniak@deploy1001> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:526125| Bumping portals to master (T128546)]] (duration: 00m 47s) [production]
09:49 <marostegui> Add db2128 to tendril and zarcillo - T228969 [production]
09:24 <elukey@cumin1001> END (FAIL) - Cookbook sre.druid.roll-restart-workers (exit_code=99) [production]
09:22 <elukey@cumin1001> START - Cookbook sre.druid.roll-restart-workers [production]
09:21 <elukey@cumin1001> END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) [production]
08:55 <elukey@cumin1001> START - Cookbook sre.druid.roll-restart-workers [production]
08:51 <root@> helmfile [STAGING] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . [production]
08:47 <elukey> set mcrouter async behavior for codfw replication to all mw app/api servers (changes will be picked up when puppet runs on the hosts) - T225642 [production]