3251-3300 of 10000 results (33ms)
2021-01-22 ยง
17:55 <dzahn@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2364.codfw.wmnet with reason: REIMAGE [production]
17:54 <dzahn@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2360.codfw.wmnet with reason: REIMAGE [production]
17:53 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2358.codfw.wmnet with reason: REIMAGE [production]
17:52 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2362.codfw.wmnet with reason: REIMAGE [production]
17:52 <mutante> releases2001 - create new partition table with fdisk, make ext4 filesystem on /dev/vdb1 [production]
17:50 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2364.codfw.wmnet with reason: REIMAGE [production]
17:50 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2362.codfw.wmnet with reason: REIMAGE [production]
17:49 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2360.codfw.wmnet with reason: REIMAGE [production]
17:49 <ppchelko@deploy1001> Finished deploy [restbase/deploy@e54225d]: T270411 T270415 T270281 T270277 (duration: 65m 37s) [production]
17:49 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2358.codfw.wmnet with reason: REIMAGE [production]
17:29 <mforns@deploy1001> Finished deploy [analytics/refinery@eea071d] (thin): Extra bug-fix train THIN [analytics/refinery@eea071def90a8a856b1e04dda23b77a850134253] (duration: 00m 07s) [production]
17:29 <mforns@deploy1001> Started deploy [analytics/refinery@eea071d] (thin): Extra bug-fix train THIN [analytics/refinery@eea071def90a8a856b1e04dda23b77a850134253] [production]
17:23 <mforns@deploy1001> Finished deploy [analytics/refinery@eea071d]: Extra bug-fix train [analytics/refinery@eea071def90a8a856b1e04dda23b77a850134253] (duration: 10m 03s) [production]
17:13 <mforns@deploy1001> Started deploy [analytics/refinery@eea071d]: Extra bug-fix train [analytics/refinery@eea071def90a8a856b1e04dda23b77a850134253] [production]
16:44 <ppchelko@deploy1001> Started deploy [restbase/deploy@e54225d]: T270411 T270415 T270281 T270277 [production]
16:40 <cmjohnson1> replacing optics/fiber pfw3a-eqiad:xe-0/0/17 and fasw-c1a-eqiad:xe-0/2/0 T271295 [production]
16:19 <jynus> restart of backup source hosts on codfw T271913 [production]
15:54 <otto@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventstreams-internal' for release 'main' . [production]
15:40 <moritzm> installing puppetboard1002 [production]
15:24 <moritzm> installing puppetboard2002 [production]
13:44 <kormat@cumin1001> dbctl commit (dc=all): 'db1149 (re)pooling @ 100%: Reboot T272255', diff saved to https://phabricator.wikimedia.org/P13932 and previous config saved to /var/cache/conftool/dbconfig/20210122-134444-kormat.json [production]
13:33 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1121', diff saved to https://phabricator.wikimedia.org/P13931 and previous config saved to /var/cache/conftool/dbconfig/20210122-133341-marostegui.json [production]
13:31 <marostegui> Stop replication on db1121 [production]
13:30 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1121', diff saved to https://phabricator.wikimedia.org/P13930 and previous config saved to /var/cache/conftool/dbconfig/20210122-133044-marostegui.json [production]
13:29 <kormat@cumin1001> dbctl commit (dc=all): 'db1149 (re)pooling @ 75%: Reboot T272255', diff saved to https://phabricator.wikimedia.org/P13929 and previous config saved to /var/cache/conftool/dbconfig/20210122-132939-kormat.json [production]
13:21 <jmm@cumin2001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host puppetboard2002.codfw.wmnet [production]
13:20 <kormat@cumin1001> dbctl commit (dc=all): 'es1023 (re)pooling @ 100%: Reboot T272121', diff saved to https://phabricator.wikimedia.org/P13927 and previous config saved to /var/cache/conftool/dbconfig/20210122-132028-kormat.json [production]
13:14 <kormat@cumin1001> dbctl commit (dc=all): 'db1149 (re)pooling @ 50%: Reboot T272255', diff saved to https://phabricator.wikimedia.org/P13926 and previous config saved to /var/cache/conftool/dbconfig/20210122-131436-kormat.json [production]
13:05 <kormat@cumin1001> dbctl commit (dc=all): 'es1023 (re)pooling @ 75%: Reboot T272121', diff saved to https://phabricator.wikimedia.org/P13925 and previous config saved to /var/cache/conftool/dbconfig/20210122-130525-kormat.json [production]
12:59 <kormat@cumin1001> dbctl commit (dc=all): 'db1149 (re)pooling @ 25%: Reboot T272255', diff saved to https://phabricator.wikimedia.org/P13924 and previous config saved to /var/cache/conftool/dbconfig/20210122-125932-kormat.json [production]
12:54 <jmm@cumin2001> START - Cookbook sre.ganeti.makevm for new host puppetboard2002.codfw.wmnet [production]
12:53 <jmm@cumin2001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host puppetboard1002.eqiad.wmnet [production]
12:50 <kormat@cumin1001> dbctl commit (dc=all): 'es1023 (re)pooling @ 50%: Reboot T272121', diff saved to https://phabricator.wikimedia.org/P13923 and previous config saved to /var/cache/conftool/dbconfig/20210122-125021-kormat.json [production]
12:47 <kormat@cumin1001> dbctl commit (dc=all): 'db1149 depooling: Rebooting for T272255', diff saved to https://phabricator.wikimedia.org/P13922 and previous config saved to /var/cache/conftool/dbconfig/20210122-124748-kormat.json [production]
12:47 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:30:00 on db1149.eqiad.wmnet with reason: Rebooting for T272255 [production]
12:47 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 1:30:00 on db1149.eqiad.wmnet with reason: Rebooting for T272255 [production]
12:43 <kormat@cumin1001> dbctl commit (dc=all): 'Remove db1110 from api group T272255', diff saved to https://phabricator.wikimedia.org/P13921 and previous config saved to /var/cache/conftool/dbconfig/20210122-124310-kormat.json [production]
12:38 <jmm@cumin2001> START - Cookbook sre.ganeti.makevm for new host puppetboard1002.eqiad.wmnet [production]
12:38 <kormat@cumin1001> dbctl commit (dc=all): 'Remove db1127 from api group T272255', diff saved to https://phabricator.wikimedia.org/P13920 and previous config saved to /var/cache/conftool/dbconfig/20210122-123832-kormat.json [production]
12:35 <kormat@cumin1001> dbctl commit (dc=all): 'es1023 (re)pooling @ 25%: Reboot T272121', diff saved to https://phabricator.wikimedia.org/P13919 and previous config saved to /var/cache/conftool/dbconfig/20210122-123518-kormat.json [production]
12:33 <volker-e@deploy1001> Finished deploy [design/style-guide@9a811b8]: Deploy design/style-guide: 9a811b8 Add Language selectors to component overview Sketch document (#424) (duration: 00m 07s) [production]
12:33 <volker-e@deploy1001> Started deploy [design/style-guide@9a811b8]: Deploy design/style-guide: 9a811b8 Add Language selectors to component overview Sketch document (#424) [production]
12:10 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker[1135,1137].eqiad.wmnet [production]
12:08 <elukey@cumin1001> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker[1135,1137].eqiad.wmnet [production]
12:00 <kormat@cumin1001> dbctl commit (dc=all): 'db1141 (re)pooling @ 100%: Reboot T272255', diff saved to https://phabricator.wikimedia.org/P13918 and previous config saved to /var/cache/conftool/dbconfig/20210122-120011-kormat.json [production]
11:54 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'main' . [production]
11:51 <kormat@cumin1001> dbctl commit (dc=all): 'db1136 (re)pooling @ 100%: Reboot T272255', diff saved to https://phabricator.wikimedia.org/P13917 and previous config saved to /var/cache/conftool/dbconfig/20210122-115113-kormat.json [production]
11:50 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on es1023.eqiad.wmnet with reason: Extended reboot for T272121 [production]
11:50 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on es1023.eqiad.wmnet with reason: Extended reboot for T272121 [production]
11:46 <kormat@cumin1001> dbctl commit (dc=all): 'db1134 (re)pooling @ 100%: Reboot T272255', diff saved to https://phabricator.wikimedia.org/P13916 and previous config saved to /var/cache/conftool/dbconfig/20210122-114642-kormat.json [production]