2021-03-04
ยง
|
13:52 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1064.eqiad.wmnet with reason: REIMAGE |
[production] |
13:50 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1063.eqiad.wmnet with reason: REIMAGE |
[production] |
13:48 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1001.eqiad.wmnet |
[production] |
13:45 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2116', diff saved to https://phabricator.wikimedia.org/P14632 and previous config saved to /var/cache/conftool/dbconfig/20210304-134521-marostegui.json |
[production] |
13:44 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host sretest1001.eqiad.wmnet |
[production] |
13:44 |
<volans> |
uploaded spicerack_0.0.49 to apt.wikimedia.org buster-wikimedia |
[production] |
13:35 |
<moritzm> |
restarting mw canaries for libzstd update |
[production] |
13:32 |
<elukey> |
drain + reimage analytics10[63,64] to Debian Buster |
[production] |
13:29 |
<moritzm> |
installing libzstd security updates on Buster |
[production] |
13:13 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Add db2146 to dbctl T275633', diff saved to https://phabricator.wikimedia.org/P14631 and previous config saved to /var/cache/conftool/dbconfig/20210304-131301-marostegui.json |
[production] |
13:10 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1062.eqiad.wmnet with reason: REIMAGE |
[production] |
13:08 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1061.eqiad.wmnet with reason: REIMAGE |
[production] |
13:07 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1062.eqiad.wmnet with reason: REIMAGE |
[production] |
13:06 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1061.eqiad.wmnet with reason: REIMAGE |
[production] |
12:48 |
<elukey> |
drain + reimage analytics10[61,62] to Debian Buster |
[production] |
12:45 |
<jakob@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . |
[production] |
12:40 |
<mbsantos@deploy1002> |
Finished deploy [tilerator/deploy@6fcbb9f]: (no justification provided) (duration: 00m 14s) |
[production] |
12:40 |
<wmde-fisch@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:668108|Remove conflicting gadget configuration for hewiki (T276330)]] (duration: 01m 12s) |
[production] |
12:40 |
<mbsantos@deploy1002> |
Started deploy [tilerator/deploy@6fcbb9f]: (no justification provided) |
[production] |
12:34 |
<jakob@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . |
[production] |
12:11 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on db1115.eqiad.wmnet,dbmonitor1001.wikimedia.org with reason: Restart db1115 to fix memory leak |
[production] |
12:11 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:30:00 on db1115.eqiad.wmnet,dbmonitor1001.wikimedia.org with reason: Restart db1115 to fix memory leak |
[production] |
12:10 |
<jakob@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . |
[production] |
12:00 |
<marostegui> |
Stop mysql on db1117:3321 to clone db1159 |
[production] |
11:42 |
<jakob@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . |
[production] |
11:40 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Add db2145 to s1 (and repool db2116) - T275633', diff saved to https://phabricator.wikimedia.org/P14625 and previous config saved to /var/cache/conftool/dbconfig/20210304-114052-marostegui.json |
[production] |
11:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Add db2145 into dbctl depooled - T275633', diff saved to https://phabricator.wikimedia.org/P14624 and previous config saved to /var/cache/conftool/dbconfig/20210304-112848-marostegui.json |
[production] |
11:27 |
<_joe_> |
restarted redis on mc2027 to pick up the replication change |
[production] |
11:14 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1059.eqiad.wmnet with reason: REIMAGE |
[production] |
11:11 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1059.eqiad.wmnet with reason: REIMAGE |
[production] |
11:10 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: Needs fixing after T274472 |
[production] |
11:10 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: Needs fixing after T274472 |
[production] |
11:08 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1022.eqiad.wmnet |
[production] |
11:04 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1060.eqiad.wmnet with reason: REIMAGE |
[production] |
11:02 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host cloudvirt1022.eqiad.wmnet |
[production] |
11:02 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1060.eqiad.wmnet with reason: REIMAGE |
[production] |
10:40 |
<elukey> |
drain + reimage analytics1059/1060 to Debian Buster |
[production] |
10:32 |
<moritzm> |
uploaded screen 4.2.1-3+deb8u1+wmf1 to jessie-wikimedia |
[production] |
09:32 |
<elukey> |
install linux 5.10 on an-worker[1097-1101] (GPU workers) and reboot them |
[production] |
09:30 |
<kormat> |
disabling puppet on all db hosts while deploying a puppet monitoring change T275497 |
[production] |
09:19 |
<moritzm> |
uploaded udplog 1.8.5+deb10u1 to buster-wikimedia |
[production] |
08:45 |
<elukey@deploy1002> |
Finished deploy [analytics/refinery@605f8b8]: Fix for geoeditors monthly job (duration: 11m 03s) |
[production] |
08:33 |
<elukey@deploy1002> |
Started deploy [analytics/refinery@605f8b8]: Fix for geoeditors monthly job |
[production] |
07:38 |
<elukey> |
reboot an-worker1096 to pick up 5.10 kernel |
[production] |
06:25 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1088 T276025', diff saved to https://phabricator.wikimedia.org/P14622 and previous config saved to /var/cache/conftool/dbconfig/20210304-062503-marostegui.json |
[production] |
06:11 |
<marostegui> |
Stop MySQL on db2116 to clone db2145 T275633 |
[production] |
06:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2116 T275633', diff saved to https://phabricator.wikimedia.org/P14621 and previous config saved to /var/cache/conftool/dbconfig/20210304-061134-marostegui.json |
[production] |
05:20 |
<kart_> |
Updated apertium to 2021-03-03-170806-production (T274262) |
[production] |
05:15 |
<kartik@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'production' . |
[production] |
05:11 |
<kartik@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'production' . |
[production] |