2021-05-09
§
|
21:44 |
<legoktm> |
restarted mailman3 again (T282348) pymysql.err.InternalError: (1205, 'Lock wait timeout exceeded; try restarting transaction') |
[production] |
18:28 |
<legoktm> |
systemctl restart mailman3, bounce runner died again (T282348) |
[production] |
10:52 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 180 days, 0:00:00 on cloudmetrics1002.eqiad.wmnet with reason: T275605 |
[production] |
10:52 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime for 180 days, 0:00:00 on cloudmetrics1002.eqiad.wmnet with reason: T275605 |
[production] |
09:16 |
<legoktm> |
mailman3 live hacked patch at https://phabricator.wikimedia.org/T282348#7072358 to fix bounce queue |
[production] |
06:21 |
<legoktm> |
restarting mailman3 service, bounce runner died |
[production] |
04:27 |
<Amir1> |
starting upgrade of batch H of mailing lists (T280322) |
[production] |
2021-05-07
§
|
21:40 |
<legoktm> |
deleted education@ from MM3, didn't import properly |
[production] |
21:35 |
<legoktm> |
deleted festivalsommer-teilnehmer from MM3, didn't import properly |
[production] |
21:33 |
<legoktm> |
fixed owner for wdqs-gui-build list |
[production] |
19:48 |
<pt1979@cumin2001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:42 |
<pt1979@cumin2001> |
START - Cookbook sre.dns.netbox |
[production] |
18:55 |
<legoktm> |
deleted daily-article-l from mailman3 after failed import |
[production] |
18:33 |
<brennen@deploy1002> |
rebuilt and synchronized wikiversions files: all wikis to 1.37.0-wmf.4 |
[production] |
18:28 |
<brennen@deploy1002> |
Synchronized php: group1 wikis to 1.37.0-wmf.4 (duration: 01m 07s) |
[production] |
18:27 |
<brennen@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.37.0-wmf.4 |
[production] |
18:23 |
<brennen> |
1.37.0-wmf.4 train status (T281145): blockers appear resolved, going ahead in the interest of not having a split deploy over weekend |
[production] |
17:50 |
<brennen@deploy1002> |
Synchronized php-1.37.0-wmf.4/includes/cache/LinkBatch.php: Backport: [[gerrit:685901|LinkBatch: skip bad input (T282180 T282070)]] (duration: 01m 06s) |
[production] |
17:25 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@20f479e]: updated trove -> codfw1dev (duration: 01m 55s) |
[production] |
17:23 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@20f479e]: updated trove -> codfw1dev |
[production] |
15:10 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 24s) |
[production] |
15:08 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev |
[production] |
15:03 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 11s) |
[production] |
15:02 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev |
[production] |
15:02 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 26s) |
[production] |
15:00 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev |
[production] |
15:00 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 29s) |
[production] |
14:58 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev |
[production] |
14:57 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 22s) |
[production] |
14:56 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev |
[production] |
14:41 |
<bblack@cumin1001> |
conftool action : set/pooled=yes; selector: name=cp203[34].codfw.wmnet |
[production] |
14:40 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 19s) |
[production] |
14:38 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev |
[production] |
14:38 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 00m 50s) |
[production] |
14:37 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev |
[production] |
13:04 |
<Urbanecm> |
Start server-side upload for 1 video file (T281927) |
[production] |
12:19 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 100%: reimaged to buster T280751', diff saved to https://phabricator.wikimedia.org/P15856 and previous config saved to /var/cache/conftool/dbconfig/20210507-121908-kormat.json |
[production] |
12:04 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 75%: reimaged to buster T280751', diff saved to https://phabricator.wikimedia.org/P15855 and previous config saved to /var/cache/conftool/dbconfig/20210507-120404-kormat.json |
[production] |
11:49 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 50%: reimaged to buster T280751', diff saved to https://phabricator.wikimedia.org/P15854 and previous config saved to /var/cache/conftool/dbconfig/20210507-114859-kormat.json |
[production] |
11:33 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 25%: reimaged to buster T280751', diff saved to https://phabricator.wikimedia.org/P15853 and previous config saved to /var/cache/conftool/dbconfig/20210507-113355-kormat.json |
[production] |
09:55 |
<dcausse> |
depooling wdqs1012 T280382, T282222 |
[production] |
09:44 |
<vgutierrez> |
Enforce Puppet Internal CA validation on trafficserver@codfw - T281673 |
[production] |
08:50 |
<jmm@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=ldap-replica2005.wikimedia.org |
[production] |
08:19 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2057.codfw.wmnet |
[production] |