2019-02-25
§
|
15:20 |
<vgutierrez> |
shutting down certcentral VMs for decommission - T207389 |
[production] |
15:18 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Increase API traffic for db1085 after MySQL upgrade (duration: 00m 45s) |
[production] |
15:04 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Increase traffic for db1085 after MySQL upgrade (duration: 00m 45s) |
[production] |
14:50 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Increase traffic for db1085 after MySQL upgrade (duration: 00m 45s) |
[production] |
14:49 |
<jiji@deploy1001> |
Finished deploy [3d2png/deploy@ca39432]: (no justification provided) (duration: 00m 15s) |
[production] |
14:49 |
<jiji@deploy1001> |
Started deploy [3d2png/deploy@ca39432]: (no justification provided) |
[production] |
14:47 |
<jiji@deploy1001> |
deploy aborted: (no justification provided) (duration: 00m 19s) |
[production] |
14:46 |
<jiji@deploy1001> |
Started deploy [3d2png/deploy@ca39432]: (no justification provided) |
[production] |
14:32 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Slowly repool into API db1085 after MySQL upgrade (duration: 00m 45s) |
[production] |
14:15 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Slowly repool db1085 after MySQL upgrade (duration: 00m 45s) |
[production] |
14:04 |
<marostegui> |
Stop MySQL on db1085 for mysql upgrade |
[production] |
13:55 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Depool db1085 for MySQL upgrade and schema change (duration: 00m 46s) |
[production] |
13:39 |
<hashar> |
Rebuilding some CI Docker images using PHP sury.org to switch the sury.org component from jessie to stretch ( https://gerrit.wikimedia.org/r/#/c/integration/config/+/492666/ ) |
[releng] |
13:32 |
<akosiaris> |
upgrade etherpad-lite to 1.7.5 |
[production] |
13:11 |
<chicocvenancio> |
PAWS: Stopped AABot notebook pod T217010 |
[tools] |
12:54 |
<chicocvenancio> |
PAWS: Restarted Criscod notebook pod T217010 |
[tools] |
12:38 |
<gilles@deploy1001> |
Finished deploy [3d2png/deploy@ca39432]: (no justification provided) (duration: 00m 07s) |
[production] |
12:38 |
<gilles@deploy1001> |
Started deploy [3d2png/deploy@ca39432]: (no justification provided) |
[production] |
12:27 |
<gilles@deploy1001> |
Finished deploy [3d2png/deploy@ca39432]: (no justification provided) (duration: 00m 05s) |
[production] |
12:27 |
<gilles@deploy1001> |
Started deploy [3d2png/deploy@ca39432]: (no justification provided) |
[production] |
12:22 |
<gilles@deploy1001> |
Finished deploy [3d2png/deploy@ca39432]: (no justification provided) (duration: 01m 15s) |
[production] |
12:21 |
<gilles@deploy1001> |
Started deploy [3d2png/deploy@ca39432]: (no justification provided) |
[production] |
12:21 |
<chicocvenancio> |
PAWS: killed proxy and hub pods to attempt to get it to see routes to open notebooks servers to no avail. Restarted BernhardHumm's notebook pod T217010 |
[tools] |
12:19 |
<gtirloni> |
deleted local crontab on tools-bastion-03 (T217019) |
[tools.wikiloves] |
11:49 |
<moritzm> |
rolling out intel-microcode 3.20180807a.2 on all jessie/stretch servers, tests on a number of previously unsupported servers with Westmere CPU were successful and I've verified that all other microcode files are identical compared to the current 3.20180807a.1 microcode |
[production] |
11:19 |
<jijiki> |
Reimageing thumbor1001 - T214597 |
[production] |
10:40 |
<jdrewniak@deploy1001> |
Synchronized portals: Wikimedia Portals Update: [[gerrit:492636| Bumping portals to master (T128546, T202497)]] (duration: 00m 46s) |
[production] |
10:39 |
<jdrewniak@deploy1001> |
Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:492636| Bumping portals to master (T128546, T202497)]] (duration: 00m 46s) |
[production] |
10:32 |
<gtirloni> |
restarted nfsd on labstore1004 |
[admin] |
10:31 |
<gtirloni> |
labstore1004 restarted nfsd and killed stuck rpc.mountd.real processed (T216988) |
[production] |
10:16 |
<jijiki> |
Depooling thumbor1001 to reimage - T214597 |
[production] |
09:54 |
<marostegui> |
Deploy schema change on db1074, this will generate lag on labsdb:s2 - T187295 |
[production] |
09:50 |
<gtirloni> |
rebooted tools-sgeexec-09{16,22,40} (T216988) |
[tools] |
09:41 |
<gtirloni> |
rebooted tools-sgeexec-09{16,22,40} |
[tools] |
09:31 |
<gtirloni> |
commented cronjobs, stop webservices and truncated Worker*.err files (T216988) |
[tools.iabot] |
09:07 |
<marostegui@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Increase ParserCache TTL from 24 days to 30 - T210992 (duration: 00m 46s) |
[production] |
08:52 |
<marostegui> |
Deploy schema change on s2 on codfw master - lag will happen on s2 codfw - T187295 |
[production] |
08:49 |
<_joe_> |
generating mcrouter certificate for mw2151 T192457 |
[production] |
08:37 |
<zhuyifei1999_> |
uncordon tools-worker-1015.tools.eqiad.wmflabs |
[tools] |
08:34 |
<legoktm> |
hard rebooted tools-worker-1015 via horizon |
[tools] |
07:48 |
<zhuyifei1999_> |
systemd stuck in D state. :( |
[tools] |
07:44 |
<zhuyifei1999_> |
I saved dmesg and process list to a few files in /root if that helps debugging |
[tools] |
07:43 |
<zhuyifei1999_> |
D states are not responding to SIGKILL. Will reboot. |
[tools] |
07:37 |
<zhuyifei1999_> |
tools-worker-1015.tools.eqiad.wmflabs having severe NFS issues (all NFS accessing processes are stuck in D state). Draining. |
[tools] |
07:09 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Fully repool db1104 after MySQL upgrade (duration: 00m 45s) |
[production] |
06:28 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Repool db1104 in API after MySQL upgrade (duration: 00m 45s) |
[production] |
06:13 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Slowly repool db1104 after MySQL upgrade (duration: 00m 45s) |
[production] |
06:02 |
<marostegui> |
Stop MySQL on db1104 for mysql upgrade |
[production] |
06:02 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Depool db1104 for MySQL upgrade (duration: 00m 50s) |
[production] |