2021-04-27
ยง
|
23:58 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on snapshot1015.eqiad.wmnet with reason: REIMAGE |
[production] |
23:57 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on snapshot1013.eqiad.wmnet with reason: REIMAGE |
[production] |
23:57 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on snapshot1014.eqiad.wmnet with reason: REIMAGE |
[production] |
23:55 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on snapshot1012.eqiad.wmnet with reason: REIMAGE |
[production] |
23:54 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on snapshot1013.eqiad.wmnet with reason: REIMAGE |
[production] |
23:53 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on snapshot1011.eqiad.wmnet with reason: REIMAGE |
[production] |
23:52 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on snapshot1012.eqiad.wmnet with reason: REIMAGE |
[production] |
23:51 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on snapshot1011.eqiad.wmnet with reason: REIMAGE |
[production] |
21:07 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts rdb[2005-2006].codfw.wmnet |
[production] |
20:55 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts rdb[2005-2006].codfw.wmnet |
[production] |
20:54 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts rdb[2003-2004].codfw.wmnet |
[production] |
20:42 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts rdb[2003-2004].codfw.wmnet |
[production] |
20:32 |
<bblack> |
re-pooling codfw public traffic - T279457 |
[production] |
20:11 |
<jhuneidi@deploy1002> |
Synchronized php-1.37.0-wmf.3/includes/rcfeed/IRCColourfulRCFeedFormatter.php: Backport rcfeed: Remove reference assignment (T281226) to 1.37.0-wmf.3 (duration: 01m 12s) |
[production] |
20:08 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main2005.codfw.wmnet with reason: REIMAGE |
[production] |
20:06 |
<herron@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main2005.codfw.wmnet with reason: REIMAGE |
[production] |
19:44 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host people1003.eqiad.wmnet |
[production] |
19:37 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main2004.codfw.wmnet with reason: REIMAGE |
[production] |
19:35 |
<papaul> |
powerdown ms-backup2001 for maintenance |
[production] |
19:35 |
<herron@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main2004.codfw.wmnet with reason: REIMAGE |
[production] |
19:07 |
<papaul> |
powerdown logstash2035 for maintenance |
[production] |
19:03 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host people1003.eqiad.wmnet |
[production] |
19:00 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts people1003.eqiad.wmnet |
[production] |
18:50 |
<mutante> |
people1003 - destroying VM and recreating again from scratch to test if issue of no console and no access is repeatable |
[production] |
18:50 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts people1003.eqiad.wmnet |
[production] |
18:37 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main1005.eqiad.wmnet with reason: REIMAGE |
[production] |
18:35 |
<herron@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main1005.eqiad.wmnet with reason: REIMAGE |
[production] |
18:33 |
<mutante> |
people1003 - rebooting, trying to get new VM to work |
[production] |
18:33 |
<Urbanecm> |
Morning B&C window done |
[production] |
18:32 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: 91a85f2: ac770bf: Enable language in header for office and testwiki users (T280526) (duration: 01m 19s) |
[production] |
18:32 |
<bblack> |
lvs2009 - restart pybal + re-run puppet agent - T279457 |
[production] |
18:23 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
18:20 |
<bblack@cumin1001> |
conftool action : set/pooled=yes; selector: name=cp203[56].codfw.wmnet |
[production] |
18:20 |
<bblack> |
cp203[56] - repooling in etcd - T279457 |
[production] |
18:19 |
<robh@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:17 |
<robh@cumin1001> |
END (ERROR) - Cookbook sre.dns.netbox (exit_code=97) |
[production] |
18:17 |
<robh@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:16 |
<robh@cumin1001> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
18:12 |
<robh@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:11 |
<bblack> |
dns2001 - restarting bird to repool, then re-enabling puppet - T279457 |
[production] |
18:04 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
18:02 |
<pt1979@cumin2001> |
START - Cookbook sre.dns.netbox |
[production] |
18:02 |
<ejegg> |
update payments-wiki from 9a4eef1375 to 44570561f2 |
[production] |
18:00 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main1004.eqiad.wmnet with reason: REIMAGE |
[production] |
17:58 |
<herron@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main1004.eqiad.wmnet with reason: REIMAGE |
[production] |
17:34 |
<papaul> |
powerdown moss-fe2001 for maintenance |
[production] |
17:32 |
<robh@cumin1001> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
17:29 |
<robh@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
17:25 |
<mbsantos@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
17:23 |
<mbsantos@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |