3851-3900 of 10000 results (23ms)
2021-02-04 §
07:52 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 7%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14190 and previous config saved to /var/cache/conftool/dbconfig/20210204-075236-root.json [production]
07:45 <marostegui@cumin1001> dbctl commit (dc=all): 'db1137 (re)pooling @ 50%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14189 and previous config saved to /var/cache/conftool/dbconfig/20210204-074558-root.json [production]
07:37 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 5%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14188 and previous config saved to /var/cache/conftool/dbconfig/20210204-073733-root.json [production]
07:30 <marostegui@cumin1001> dbctl commit (dc=all): 'db1137 (re)pooling @ 25%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14187 and previous config saved to /var/cache/conftool/dbconfig/20210204-073054-root.json [production]
07:22 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 3%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14186 and previous config saved to /var/cache/conftool/dbconfig/20210204-072229-root.json [production]
07:16 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1117.eqiad.wmnet [production]
07:15 <marostegui@cumin1001> dbctl commit (dc=all): 'db1137 (re)pooling @ 20%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14185 and previous config saved to /var/cache/conftool/dbconfig/20210204-071551-root.json [production]
07:13 <elukey@cumin1001> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1117.eqiad.wmnet [production]
07:07 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 2%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14184 and previous config saved to /var/cache/conftool/dbconfig/20210204-070726-root.json [production]
07:00 <marostegui@cumin1001> dbctl commit (dc=all): 'db1137 (re)pooling @ 10%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14183 and previous config saved to /var/cache/conftool/dbconfig/20210204-070047-root.json [production]
06:45 <marostegui@cumin1001> dbctl commit (dc=all): 'db1137 (re)pooling @ 5%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14182 and previous config saved to /var/cache/conftool/dbconfig/20210204-064544-root.json [production]
06:42 <marostegui> Restart mysql on db1137 [production]
06:41 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1137 T266483', diff saved to https://phabricator.wikimedia.org/P14181 and previous config saved to /var/cache/conftool/dbconfig/20210204-064157-marostegui.json [production]
06:30 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 1%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14180 and previous config saved to /var/cache/conftool/dbconfig/20210204-063033-root.json [production]
06:28 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1173 to dbctl - depooled T258361', diff saved to https://phabricator.wikimedia.org/P14179 and previous config saved to /var/cache/conftool/dbconfig/20210204-062836-marostegui.json [production]
02:02 <legoktm@deploy1001> Synchronized logos/config.yaml: Update and recompress logos for nowiki, cawiki, fiwiki, ukwiki, cswiki, huwiki, trwiki (2/2) (duration: 01m 06s) [production]
02:00 <legoktm@deploy1001> Synchronized static/images/project-logos/: Update and recompress logos for nowiki, cawiki, fiwiki, ukwiki, cswiki, huwiki, trwiki (1/2) (duration: 01m 10s) [production]
01:15 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@4b4872d]: transfer_to_es: Increase timeout waiting for source data to three hours (duration: 01m 16s) [production]
01:13 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@4b4872d]: transfer_to_es: Increase timeout waiting for source data to three hours [production]
01:04 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1310.eqiad.wmnet [production]
00:55 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1318.eqiad.wmnet [production]
00:51 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1310.eqiad.wmnet [production]
00:44 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1318.eqiad.wmnet [production]
00:22 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2279.codfw.wmnet [production]
00:19 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2280.codfw.wmnet [production]
00:17 <eileen> civicrm revision changed from dfb2ea2148 to 1e9a86dd6e, config revision is 01ea3062f4 [production]
00:12 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2279.codw.wmnet [production]
00:11 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2280.codfw.wmnet [production]
00:05 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1310.eqiad.wmnet with reason: REIMAGE [production]
00:03 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1310.eqiad.wmnet with reason: REIMAGE [production]
2021-02-03 §
23:59 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1318.eqiad.wmnet with reason: REIMAGE [production]
23:56 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1318.eqiad.wmnet with reason: REIMAGE [production]
23:50 <mutante> installservers: replacing squid proxy logrotate cron with systemd timer [production]
23:50 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2279.codfw.wmnet with reason: REIMAGE [production]
23:48 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2280.codfw.wmnet with reason: REIMAGE [production]
23:48 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2279.codfw.wmnet with reason: REIMAGE [production]
23:46 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2280.codfw.wmnet with reason: REIMAGE [production]
22:53 <razzi@cumin1001> END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001 [production]
22:06 <crusnov@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox1001.wikimedia.org [production]
21:53 <crusnov@cumin1001> START - Cookbook sre.hosts.reboot-single for host netbox1001.wikimedia.org [production]
21:53 <crusnov@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netboxdb1001.eqiad.wmnet [production]
21:46 <crusnov@cumin1001> START - Cookbook sre.hosts.reboot-single for host netboxdb1001.eqiad.wmnet [production]
21:44 <crusnov@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox2001.wikimedia.org [production]
21:40 <crusnov@cumin1001> START - Cookbook sre.hosts.reboot-single for host netbox2001.wikimedia.org [production]
21:39 <crusnov@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netboxdb2001.codfw.wmnet [production]
21:34 <crusnov@cumin1001> START - Cookbook sre.hosts.reboot-single for host netboxdb2001.codfw.wmnet [production]
21:33 <chaomodus> rebooting Netbox cluster [production]
21:05 <razzi@cumin1001> START - Cookbook sre.kafka.reboot-workers for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001 [production]
20:34 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:24 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]