2151-2200 of 10000 results (26ms)
2020-07-14 ยง
13:18 <jbond@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:18 <jbond@cumin1001> conftool action : set/pooled=no; selector: name=dns1002.wikimedia.org [production]
13:16 <jbond@cumin1001> conftool action : set/pooled=yes; selector: name=dns2002.wikimedia.org [production]
13:13 <jbond42> reboot dns2002 [production]
13:13 <jbond@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:13 <jbond@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:13 <jbond@cumin1001> conftool action : set/pooled=no; selector: name=dns2002.wikimedia.org [production]
13:13 <jbond@cumin1001> conftool action : set/pooled=yes; selector: name=dns2001.wikimedia.org [production]
13:10 <jbond42> reboot dns2001 [production]
13:10 <jbond@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:10 <jbond@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:10 <jbond@cumin1001> conftool action : set/pooled=no; selector: name=dns2001.wikimedia.org [production]
13:09 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro [production]
13:06 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) [production]
13:01 <jbond42> rebooting dns3002 [production]
13:01 <jbond@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:01 <jbond@cumin1001> START - Cookbook sre.hosts.downtime [production]
12:58 <elukey@cumin1001> START - Cookbook sre.hadoop.stop-cluster [production]
12:57 <oblivian@deploy1001> Synchronized wmf-config/InitialiseSettings.php: revert forcehttps after fixing T257887 (duration: 01m 02s) [production]
12:31 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.change-distro (exit_code=0) [production]
12:24 <jbond42> route ns0.wikimedia.org to codfw for reboot [production]
12:20 <moritzm> installing xen security updates (client-side tools/libs) [production]
12:19 <jbond42> re-enable puppet fleet [production]
12:07 <jbond42> disable puppet fleet wide to reboot puppetdb's [production]
12:07 <jbond42> disable puppet ro reboot puppetdb's [production]
12:01 <jforrester@deploy1001> rebuilt and synchronized wikiversions files: group0 wikis to 1.35.0-wmf.41 [production]
11:36 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1130 for query plan checks T238966 ', diff saved to https://phabricator.wikimedia.org/P11898 and previous config saved to /var/cache/conftool/dbconfig/20200714-113612-marostegui.json [production]
11:35 <_joe_> restart pybal on lvs2009 T257887 [production]
11:31 <_joe_> restart pybal on lvs2010 T257887 [production]
11:25 <_joe_> restart pybal on lvs1015 T257887 [production]
11:22 <_joe_> restart pybal on lvs1016 [production]
11:15 <jayme@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'wikifeeds' for release 'production' . [production]
11:03 <jayme@deploy1001> helmfile [CODFW] Ran 'sync' command on namespace 'wikifeeds' for release 'production' . [production]
10:59 <jayme@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'wikifeeds' for release 'staging' . [production]
10:56 <volans@cumin1001> conftool action : set/pooled=inactive; selector: name=wtp2005.codfw.wmnet [production]
10:52 <volans> powerdown wtp2005, hardware issue - T257903 [production]
10:47 <volans@cumin1001> conftool action : set/pooled=no; selector: name=wtp2005.codfw.wmnet [production]
10:45 <jiji@cumin1001> conftool action : set/pooled=no; selector: name=wtp2005.codfw.wmnet,service=parsoid-php [production]
10:45 <jiji@cumin1001> conftool action : set/pooled=no; selector: name=wtp2005.codfw.wmnet,service=parsoid [production]
10:45 <effie> depool wtp2005 [production]
10:42 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
10:42 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:39 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro [production]
10:39 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) [production]
10:32 <elukey@cumin1001> START - Cookbook sre.hadoop.stop-cluster [production]
10:18 <oblivian@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'termbox' for release 'production' . [production]
10:14 <James_F> Running AbuseFilter's updateVarDumps for group1 T246539 [production]
10:13 <oblivian@deploy1001> helmfile [CODFW] Ran 'sync' command on namespace 'termbox' for release 'production' . [production]
10:10 <oblivian@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
10:10 <oblivian@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]