1301-1350 of 10000 results (24ms)
2020-05-19 §
13:09 <jayme> updated helm: 2.16.7-1 -> 2.16.7-2 on deploy[1,2]001 and contint[1,2]001 [production]
13:09 <elukey@cumin1001> START - Cookbook sre.ganeti.makevm [production]
13:03 <kormat@cumin1001> dbctl commit (dc=all): 'Pool db2136 into s4 T252985', diff saved to https://phabricator.wikimedia.org/P11233 and previous config saved to /var/cache/conftool/dbconfig/20200519-130313-kormat.json [production]
12:40 <ariel@deploy1001> Finished deploy [dumps/dumps@a329605]: make page content fixup script move inprog files into place if good (duration: 00m 04s) [production]
12:40 <ariel@deploy1001> Started deploy [dumps/dumps@a329605]: make page content fixup script move inprog files into place if good [production]
12:37 <jayme> imported helm 2.16.7-2 to main for buster-wikimedia, stretch-wikimedia, jessie-wikimedia [production]
12:17 <hnowlan@cumin1001> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) [production]
11:51 <jynus> starting backups of es1, es2, es3 on eqiad into backup1002 [production]
11:41 <jynus@cumin1001> dbctl commit (dc=all): 'Depool es1018, es1015, es1019', diff saved to https://phabricator.wikimedia.org/P11232 and previous config saved to /var/cache/conftool/dbconfig/20200519-114148-jynus.json [production]
11:12 <marostegui> Deploy schema change on db2124 (frwiki, jawiki, ruwiki) T238966 [production]
10:34 <mutante> releases2001 - restarted failed jenkins [production]
10:33 <mutante> releases2001 - Failed to restart jenkins.service: The name org.freedesktop.PolicyKit1 was not provided by any .service files [production]
10:32 <volans> flushed all Netbox caches (manage.py invalidate all) - T253091 [production]
10:29 <volans> start Netbox restore - T253091 [production]
10:18 <jayme@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'mathoid' for release 'staging' . [production]
10:13 <akosiaris> upgrade etherpad-lite to 1.8.4 on etherpad1002 [production]
09:58 <hnowlan> roll-restart of eqiad restbase hosts for java security updates [production]
09:58 <hnowlan@cumin1001> START - Cookbook sre.cassandra.roll-restart [production]
09:55 <jayme@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'mathoid' for release 'production' . [production]
09:55 <jayme@deploy1001> helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'canary' . [production]
09:55 <jayme@deploy1001> helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'production' . [production]
09:54 <jayme@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'mathoid' for release 'staging' . [production]
09:10 <godog> eqiad-prod: decom ms-be101[678] - T252008 [production]
08:07 <XioNoX> Push 596597: BGP: standardize fixed part of IX4/IX6 groups - eqsin [production]
08:04 <XioNoX> Push 596597: BGP: standardize fixed part of IX4/IX6 groups - esams [production]
08:01 <XioNoX> Push 596597: BGP: standardize fixed part of IX4/IX6 groups - eqiad [production]
07:55 <volker-e@deploy1001> Finished deploy [design/style-guide@37c67dd]: Deploy design/style-guide: (duration: 00m 06s) [production]
07:54 <volker-e@deploy1001> Started deploy [design/style-guide@37c67dd]: Deploy design/style-guide: [production]
07:52 <XioNoX> Push 596597: BGP: standardize fixed part of IX4/IX6 groups - *dfw [production]
07:49 <XioNoX> Push 596597: BGP: standardize fixed part of IX4/IX6 groups - ulsfo [production]
07:45 <vgutierrez> rolling upgrade to trafficserver 8.0.7-1wm10 with puppet disabled on cp hosts [production]
07:09 <jynus> starting es4 & es5 eqiad backups with low concurrency [production]
06:35 <elukey@cumin1001> END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) [production]
06:29 <elukey@cumin1001> START - Cookbook sre.zookeeper.roll-restart-zookeeper [production]
06:24 <elukey@cumin1001> END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) [production]
06:17 <elukey@cumin1001> START - Cookbook sre.zookeeper.roll-restart-zookeeper [production]
05:57 <volker-e@deploy1001> Finished deploy [design/style-guide@7bfbd2a]: Deploy design/style-guide: (duration: 00m 06s) [production]
05:57 <volker-e@deploy1001> Started deploy [design/style-guide@7bfbd2a]: Deploy design/style-guide: [production]
05:03 <marostegui@cumin1001> dbctl commit (dc=all): 'Set s2 and s8 as read-only=off for maintenance T251981', diff saved to https://phabricator.wikimedia.org/P11227 and previous config saved to /var/cache/conftool/dbconfig/20200519-050346-marostegui.json [production]
05:00 <marostegui@cumin1001> dbctl commit (dc=all): 'Set s2 and s8 as read-only for maintenance T251981', diff saved to https://phabricator.wikimedia.org/P11226 and previous config saved to /var/cache/conftool/dbconfig/20200519-050043-marostegui.json [production]
04:27 <marostegui> Repool labsdb1011 T249188 [production]
03:29 <volker-e@deploy1001> Finished deploy [design/style-guide@4b4bc51]: Deploy design/style-guide: (duration: 00m 07s) [production]
03:28 <volker-e@deploy1001> Started deploy [design/style-guide@4b4bc51]: Deploy design/style-guide: [production]
2020-05-18 §
23:50 <pt1979@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
23:47 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime [production]
23:25 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
23:23 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime [production]
23:12 <ryankemper> Restarted `wdqs-updater` across all wdqs nodes and restarted `wdqs-categories` across all nodes except 1010 (test wdqs server) and 1009 (automated deployment server) [production]
22:55 <Krinkle> Clear module_deps on dewiki (group2, old mw version, s5) to monitor regeneration [production]
22:48 <Krinkle> Clear module_deps on group0 (mostly s3) to monitor regeneration [production]