7301-7350 of 10000 results (81ms)
2020-03-03 ยง
16:51 <bblack> lvs5003 - restart pybal, back to normal operations [production]
16:51 <hnowlan@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
16:51 <krinkle@deploy1001> Synchronized multiversion/MWWikiversions.php: I9d658ff41b78 (duration: 01m 04s) [production]
16:50 <hnowlan@deploy1001> helmfile [STAGING] Ran 'apply' command on namespace 'changeprop' for release 'staging' . [production]
16:49 <hnowlan@deploy1001> helmfile [STAGING] Ran 'apply' command on namespace 'changeprop' for release 'staging' . [production]
16:49 <bblack> reload icinga config on icinga1001 [production]
16:48 <krinkle@deploy1001> Synchronized wmf-config/import.php: I9d658ff41b78 (duration: 01m 03s) [production]
16:47 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
16:47 <otto@deploy1001> Started restart [restbase/deploy@bfdd342]: Restart to pick up new LVS TLS port for eventgate T242224 [production]
16:47 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
16:45 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
16:44 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime [production]
16:35 <otto@deploy1001> Started restart [restbase/deploy@bfdd342] (dev-cluster): Restart (dev-cluster) to pick up new LVS TLS port for eventgate T242224 [production]
16:34 <krinkle@deploy1001> Synchronized multiversion/MWWikiversions.php: I8815be28d6a26a1 - T169821 (duration: 01m 04s) [production]
16:32 <vgutierrez> reimage lvs1013 with buster - T245984 [production]
16:28 <bblack> stopping pybal on lvs5003 to test the new icinga checks (will cause a BGP alert, among others) [production]
16:17 <Pchelolo> restart restbase on 2009 for T242224 [production]
16:14 <ottomata> switching restbase & change prop to new eventgate-main LVS TLS ports [production]
16:13 <vgutierrez> Re-enable BGP in lvs1014 - T245984 [production]
16:05 <vgutierrez> Starting pybal on lvs2009 - T246686 [production]
16:04 <marostegui@cumin1001> dbctl commit (dc=all): 'Fully repool db1096:3315 and db1096:3316 after reimage to buster T246604', diff saved to https://phabricator.wikimedia.org/P10597 and previous config saved to /var/cache/conftool/dbconfig/20200303-160433-marostegui.json [production]
15:59 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:56 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:55 <liw@deploy1001> Finished scap: group0 to 1.35.0-wmf.22 (duration: 24m 29s) [production]
15:49 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly repool db1096:3315 and db1096:3316 after reimage to buster T246604', diff saved to https://phabricator.wikimedia.org/P10596 and previous config saved to /var/cache/conftool/dbconfig/20200303-154913-marostegui.json [production]
15:47 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
15:45 <vgutierrez> Stopping pybal on lvs2009 to let lvs2010 get its traffic - T246686 [production]
15:45 <mutante> wtp1025 - scap pull as user cscott - testing sudo privs issue [production]
15:44 <vgutierrez> reimage lvs1014 with buster - T245984 [production]
15:43 <mutante> wtp1025 - scap pull [production]
15:35 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
15:34 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
15:31 <vgutierrez> Re-enable BGP in lvs1015 - T245984 [production]
15:31 <liw@deploy1001> Started scap: group0 to 1.35.0-wmf.22 [production]
15:30 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
15:22 <liw@deploy1001> Finished scap: testwiki to php-1.35.0-wmf.22 and rebuild l10n cache (duration: 71m 23s) [production]
15:20 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
15:18 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly repool db1096:3315 and db1096:3316 after reimage to buster T246604', diff saved to https://phabricator.wikimedia.org/P10595 and previous config saved to /var/cache/conftool/dbconfig/20200303-151805-marostegui.json [production]
15:15 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:15 <elukey@cumin1001> END (PASS) - Cookbook sre.elasticsearch.rolling-restart (exit_code=0) [production]
15:13 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:07 <marostegui@cumin1001> dbctl commit (dc=all): 'Decrease a bit the weight for db1126', diff saved to https://phabricator.wikimedia.org/P10594 and previous config saved to /var/cache/conftool/dbconfig/20200303-150712-marostegui.json [production]
15:00 <vgutierrez> reimage lvs1015 with buster - T245984 [production]
14:52 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly repool db1096:3315 and db1096:3316 after reimage to buster T246604', diff saved to https://phabricator.wikimedia.org/P10591 and previous config saved to /var/cache/conftool/dbconfig/20200303-145230-marostegui.json [production]
14:44 <vgutierrez@cumin2001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
14:43 <vgutierrez@cumin2001> START - Cookbook sre.hosts.decommission [production]
14:43 <vgutierrez> running the decommission cookbook against lvs2001.codfw.wmnet - T246779 [production]
14:42 <vgutierrez> replace lvs2001 with lvs2007 - T196560 [production]
14:41 <addshore> START warm cache for db1111 & db1126 for Q20-25 million T219123 (pass 2) [production]
14:29 <vgutierrez> update puppet compiler facts [production]