2021-02-04
§
|
07:07 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 2%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14184 and previous config saved to /var/cache/conftool/dbconfig/20210204-070726-root.json |
[production] |
07:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1137 (re)pooling @ 10%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14183 and previous config saved to /var/cache/conftool/dbconfig/20210204-070047-root.json |
[production] |
06:45 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1137 (re)pooling @ 5%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14182 and previous config saved to /var/cache/conftool/dbconfig/20210204-064544-root.json |
[production] |
06:42 |
<marostegui> |
Restart mysql on db1137 |
[production] |
06:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1137 T266483', diff saved to https://phabricator.wikimedia.org/P14181 and previous config saved to /var/cache/conftool/dbconfig/20210204-064157-marostegui.json |
[production] |
06:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 1%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14180 and previous config saved to /var/cache/conftool/dbconfig/20210204-063033-root.json |
[production] |
06:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Add db1173 to dbctl - depooled T258361', diff saved to https://phabricator.wikimedia.org/P14179 and previous config saved to /var/cache/conftool/dbconfig/20210204-062836-marostegui.json |
[production] |
02:02 |
<legoktm@deploy1001> |
Synchronized logos/config.yaml: Update and recompress logos for nowiki, cawiki, fiwiki, ukwiki, cswiki, huwiki, trwiki (2/2) (duration: 01m 06s) |
[production] |
02:00 |
<legoktm@deploy1001> |
Synchronized static/images/project-logos/: Update and recompress logos for nowiki, cawiki, fiwiki, ukwiki, cswiki, huwiki, trwiki (1/2) (duration: 01m 10s) |
[production] |
01:15 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@4b4872d]: transfer_to_es: Increase timeout waiting for source data to three hours (duration: 01m 16s) |
[production] |
01:13 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@4b4872d]: transfer_to_es: Increase timeout waiting for source data to three hours |
[production] |
01:04 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1310.eqiad.wmnet |
[production] |
00:55 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1318.eqiad.wmnet |
[production] |
00:51 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1310.eqiad.wmnet |
[production] |
00:44 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1318.eqiad.wmnet |
[production] |
00:22 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2279.codfw.wmnet |
[production] |
00:19 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2280.codfw.wmnet |
[production] |
00:17 |
<eileen> |
civicrm revision changed from dfb2ea2148 to 1e9a86dd6e, config revision is 01ea3062f4 |
[production] |
00:12 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2279.codw.wmnet |
[production] |
00:11 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2280.codfw.wmnet |
[production] |
00:05 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1310.eqiad.wmnet with reason: REIMAGE |
[production] |
00:03 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1310.eqiad.wmnet with reason: REIMAGE |
[production] |
2021-02-03
§
|
23:59 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1318.eqiad.wmnet with reason: REIMAGE |
[production] |
23:56 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1318.eqiad.wmnet with reason: REIMAGE |
[production] |
23:50 |
<mutante> |
installservers: replacing squid proxy logrotate cron with systemd timer |
[production] |
23:50 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2279.codfw.wmnet with reason: REIMAGE |
[production] |
23:48 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2280.codfw.wmnet with reason: REIMAGE |
[production] |
23:48 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2279.codfw.wmnet with reason: REIMAGE |
[production] |
23:46 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2280.codfw.wmnet with reason: REIMAGE |
[production] |
22:53 |
<razzi@cumin1001> |
END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001 |
[production] |
22:06 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox1001.wikimedia.org |
[production] |
21:53 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host netbox1001.wikimedia.org |
[production] |
21:53 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netboxdb1001.eqiad.wmnet |
[production] |
21:46 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host netboxdb1001.eqiad.wmnet |
[production] |
21:44 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox2001.wikimedia.org |
[production] |
21:40 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host netbox2001.wikimedia.org |
[production] |
21:39 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netboxdb2001.codfw.wmnet |
[production] |
21:34 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host netboxdb2001.codfw.wmnet |
[production] |
21:33 |
<chaomodus> |
rebooting Netbox cluster |
[production] |
21:05 |
<razzi@cumin1001> |
START - Cookbook sre.kafka.reboot-workers for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001 |
[production] |
20:34 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:24 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
20:03 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1334.eqiad.wmnet |
[production] |
20:01 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1334.eqiad.wmnet |
[production] |
19:34 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2281.codfw.wmnet |
[production] |
19:31 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2282.codfw.wmnet |
[production] |
19:21 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2281.codfw.wmnet |
[production] |
19:17 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2282.codfw.wmnet |
[production] |
19:12 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1334.eqiad.wmnet with reason: REIMAGE |
[production] |
19:10 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1334.eqiad.wmnet with reason: REIMAGE |
[production] |