8501-8550 of 10000 results (35ms)
2020-06-24 ยง
15:24 <ppchelko@deploy1001> deploy aborted: Release updates to PCS endpoints (duration: 05m 04s) [production]
15:20 <jayme> rolling restart of swift-proxy on thanos-fe[2001-2003].codfw.wmnet,thanos-fe[1001-1003].eqiad.wmnet - T256020 [production]
15:19 <ppchelko@deploy1001> Started deploy [restbase/deploy@9686627]: Release updates to PCS endpoints [production]
15:06 <brennen> merging backports and running a full scap sync for UBN at T256151 [production]
15:00 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) [production]
14:57 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single [production]
14:57 <moritzm> rebooting deneb for kernel update [production]
14:57 <ema> rmlist teampractices T255525 [production]
14:42 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Migrate SearchSatisfaction from EventLogging to EventGate on group0 - T249261 (duration: 01m 06s) [production]
13:28 <nikerabbit@deploy1001> Synchronized wmf-config/CommonSettings.php: [config] 603167 Remove TranslationNotifications user settings 1/2 (2nd attempt, now with correct file) (duration: 01m 06s) [production]
13:23 <marostegui> Deploy schema change on s6 eqiad primary master - T238966 [production]
12:59 <jbond42> update metamonitoring to use icinga-extmon.wikimedia.org [production]
12:23 <akosiaris@cumin1001> conftool action : set/pooled=yes; selector: name=kubernetes1005.eqiad.wmnet [production]
12:23 <akosiaris@cumin1001> conftool action : set/pooled=yes; selector: name=kubernetes1006.eqiad.wmnet [production]
12:19 <akosiaris@cumin1001> conftool action : set/pooled=no; selector: name=kubernetes1006.eqiad.wmnet [production]
12:19 <akosiaris@cumin1001> conftool action : set/pooled=no; selector: name=kubernetes1005.eqiad.wmnet [production]
12:19 <akosiaris@cumin1001> conftool action : set/pooled=yes; selector: name=kubernetes2005.codfw.wmnet [production]
12:19 <akosiaris@cumin1001> conftool action : set/pooled=yes; selector: name=kubernetes2006.codfw.wmnet [production]
12:17 <akosiaris> depool/drain/reboot/pool kubernetes1005,6 for CPU capacity increase T256236 [production]
12:14 <akosiaris> reboot kubernetes2005,6 for CPU capacity increase T256236 [production]
12:11 <akosiaris> depool kubernetes2005,kubernetes2006 for CPU capacity increase T256236 [production]
12:10 <akosiaris> depool kubernetes2005,kubernetes2006 for CPU capacity increase [production]
12:05 <akosiaris@cumin1001> conftool action : set/pooled=no; selector: name=kubernetes2006.codfw.wmnet [production]
12:05 <akosiaris@cumin1001> conftool action : set/pooled=no; selector: name=kubernetes2005.codfw.wmnet [production]
12:04 <awight> EU vegan BACON cooked [production]
12:03 <awight@deploy1001> Synchronized php-1.35.0-wmf.38/extensions/GrowthExperiments: BACON: [[gerrit:607453|Help panel home screen menu item fixes (T255254)]] (duration: 01m 06s) [production]
11:40 <nikerabbit@deploy1001> Synchronized private/PrivateSettings.php: Remove TranslationNotifications user settings 3/2 (duration: 01m 06s) [production]
11:35 <nikerabbit@deploy1001> Synchronized private/readme.php: [config] 607414 Remove TranslationNotifications user settings 2/2 (duration: 01m 04s) [production]
11:28 <nikerabbit@deploy1001> Synchronized wmf-config/InitialiseSettings.php: [config] 603167 Remove TranslationNotifications user settings 1/2 (duration: 01m 03s) [production]
11:09 <awight@deploy1001> Synchronized wmf-config/CommonSettings.php: BACON: [[gerrit:605255|TwoColConflict: Talk page small deployment CommonSettings.php (T254458)]] (duration: 01m 17s) [production]
10:45 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) [production]
10:39 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single [production]
10:38 <marostegui> Stop haproxy on dbproxy1003 T256216 [production]
10:36 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
10:36 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
10:01 <volans> Production management IP allocation must be done from Netbox from now on, see https://wikitech.wikimedia.org/wiki/DNS/Netbox#Cutoff_dates [production]
09:55 <volans@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:53 <kormat@cumin1001> dbctl commit (dc=all): 'Pool db1088 @ 75% into s6 T255927', diff saved to https://phabricator.wikimedia.org/P11648 and previous config saved to /var/cache/conftool/dbconfig/20200624-095338-kormat.json [production]
09:50 <volans@cumin1001> START - Cookbook sre.dns.netbox [production]
09:36 <kormat@cumin1001> dbctl commit (dc=all): 'Pool db1088 @ 50% into s6 T255927', diff saved to https://phabricator.wikimedia.org/P11647 and previous config saved to /var/cache/conftool/dbconfig/20200624-093624-kormat.json [production]
09:13 <marostegui@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:10 <marostegui@cumin2001> START - Cookbook sre.hosts.downtime [production]
08:40 <moritzm> prune remaining nginx packages on mw* servers T255565 [production]
08:31 <kormat@cumin1001> dbctl commit (dc=all): 'Pool db1088 @ 20% into s6 T255927', diff saved to https://phabricator.wikimedia.org/P11645 and previous config saved to /var/cache/conftool/dbconfig/20200624-083120-kormat.json [production]
08:06 <moritzm> re-enable puppet in eqiad [production]
08:04 <marostegui@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
08:04 <marostegui@cumin2001> START - Cookbook sre.hosts.downtime [production]
08:00 <moritzm> disable puppet in eqiad to unblock puppetdb1002 VM migration [production]
07:22 <gehel> restarting blazegraph on wdqs1007 [production]
06:53 <moritzm> draining ganeti1009 for eventual reboot [production]