5001-5050 of 10000 results (36ms)
2020-09-28 ยง
16:34 <cdanis@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-logging-external' for release 'production' . [production]
16:27 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
16:25 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
16:24 <cdanis@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-logging-external' for release 'canary' . [production]
16:23 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
16:23 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
16:23 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
16:23 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
16:20 <nskaggs@cumin1001> END (FAIL) - Cookbook wmcs.wikireplicas.add_wiki (exit_code=99) [production]
16:20 <nskaggs@cumin1001> START - Cookbook wmcs.wikireplicas.add_wiki [production]
16:20 <cdanis@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-logging-external' for release 'production' . [production]
16:20 <cdanis@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-logging-external' for release 'canary' . [production]
16:08 <hnowlan> reimaging new restbase hosts - restbase1028, restbase1029, restbase1030 [production]
16:08 <XioNoX> push pfw policies - T264013 [production]
15:53 <addshore> reload zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/630607 [releng]
15:51 <papaul> poweroff elastic2037 for DIMM replacing [production]
15:26 <kormat@cumin1001> dbctl commit (dc=all): 'Repool db1114 T196487', diff saved to https://phabricator.wikimedia.org/P12818 and previous config saved to /var/cache/conftool/dbconfig/20200928-152635-kormat.json [production]
15:25 <hashar> Restarting CI Jenkins for plugins uninstallation T260565 [production]
15:15 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . [production]
15:15 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'api-gateway' for release 'production' . [production]
15:13 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'api-gateway' for release 'production' . [production]
15:13 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . [production]
15:12 <cdanis@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-logging-external' for release 'production' . [production]
15:12 <cdanis@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-logging-external' for release 'canary' . [production]
15:09 <elukey> execute set global max_connections=200 on an-coord1001's mariadb (hue reporting too many conns, but in reality the fault is from superset) [analytics]
15:08 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . [production]
15:08 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'api-gateway' for release 'production' . [production]
15:03 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:01 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:00 <elukey@cumin1001> START - Cookbook sre.hosts.downtime [production]
14:59 <elukey@cumin1001> START - Cookbook sre.hosts.downtime [production]
14:55 <arturo> [jbond42] upgraded facter to v3 across the VM fleet [admin]
14:49 <moritzm> installing glib-networking security updates [production]
14:44 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . [production]
14:44 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'api-gateway' for release 'production' . [production]
14:40 <elukey@puppetmaster1001> conftool action : set/pooled=yes; selector: name=aqs1006.eqiad.wmnet [production]
14:33 <XioNoX> repool eqiad [production]
14:27 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . [production]
14:27 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'api-gateway' for release 'production' . [production]
14:05 <moritzm> uploaded libdbi-perl 1.631-3+wmf1 for jessie-wikimedia T259102 [production]
13:58 <XioNoX> asw2-d-eqiad# run request system power-off member 4 [production]
13:54 <andrewbogott> moving cloudvirt1035 from aggregate 'spare' to 'ceph'. We're going to need all the capacity we can get while converting older cloudvirts to ceph [admin]
13:51 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:47 <hashar> deployment-snapshot01: freed from texlive packages. No more installed by puppet after https://gerrit.wikimedia.org/r/c/operations/puppet/+/540154 [releng]
13:46 <elukey@puppetmaster1001> conftool action : set/pooled=no; selector: name=aqs1006.eqiad.wmnet [production]
13:45 <XioNoX> downtiming all eqiad row D hosts - T196487 [production]
13:42 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]
13:39 <hashar> deployment-snapshot01 removing large texlive packages. Superseded by mathoid [releng]
13:38 <godog> roll restart object-replicator on ms-be2* for higher concurrency - T261633 [production]
13:35 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]