2251-2300 of 10000 results (43ms)
2021-05-10 §
05:46 <marostegui@cumin1001> dbctl commit (dc=all): 'db1129 (re)pooling @ 25%: Repool db1129', diff saved to https://phabricator.wikimedia.org/P15872 and previous config saved to /var/cache/conftool/dbconfig/20210510-054610-root.json [production]
05:13 <marostegui@cumin1001> dbctl commit (dc=all): 'Remove db1082 from dbctl T281794', diff saved to https://phabricator.wikimedia.org/P15871 and previous config saved to /var/cache/conftool/dbconfig/20210510-051334-marostegui.json [production]
05:07 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1129 for schema change', diff saved to https://phabricator.wikimedia.org/P15870 and previous config saved to /var/cache/conftool/dbconfig/20210510-050727-marostegui.json [production]
2021-05-09 §
21:44 <legoktm> restarted mailman3 again (T282348) pymysql.err.InternalError: (1205, 'Lock wait timeout exceeded; try restarting transaction') [production]
18:28 <legoktm> systemctl restart mailman3, bounce runner died again (T282348) [production]
10:52 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 180 days, 0:00:00 on cloudmetrics1002.eqiad.wmnet with reason: T275605 [production]
10:52 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime for 180 days, 0:00:00 on cloudmetrics1002.eqiad.wmnet with reason: T275605 [production]
09:16 <legoktm> mailman3 live hacked patch at https://phabricator.wikimedia.org/T282348#7072358 to fix bounce queue [production]
06:21 <legoktm> restarting mailman3 service, bounce runner died [production]
04:27 <Amir1> starting upgrade of batch H of mailing lists (T280322) [production]
2021-05-08 §
17:17 <Amir1> starting upgrade of batch G of mailing lists (T280322) [production]
2021-05-07 §
21:40 <legoktm> deleted education@ from MM3, didn't import properly [production]
21:35 <legoktm> deleted festivalsommer-teilnehmer from MM3, didn't import properly [production]
21:33 <legoktm> fixed owner for wdqs-gui-build list [production]
19:48 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:42 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
18:55 <legoktm> deleted daily-article-l from mailman3 after failed import [production]
18:33 <brennen@deploy1002> rebuilt and synchronized wikiversions files: all wikis to 1.37.0-wmf.4 [production]
18:28 <brennen@deploy1002> Synchronized php: group1 wikis to 1.37.0-wmf.4 (duration: 01m 07s) [production]
18:27 <brennen@deploy1002> rebuilt and synchronized wikiversions files: group1 wikis to 1.37.0-wmf.4 [production]
18:23 <brennen> 1.37.0-wmf.4 train status (T281145): blockers appear resolved, going ahead in the interest of not having a split deploy over weekend [production]
17:50 <brennen@deploy1002> Synchronized php-1.37.0-wmf.4/includes/cache/LinkBatch.php: Backport: [[gerrit:685901|LinkBatch: skip bad input (T282180 T282070)]] (duration: 01m 06s) [production]
17:25 <andrew@deploy1002> Finished deploy [horizon/deploy@20f479e]: updated trove -> codfw1dev (duration: 01m 55s) [production]
17:23 <andrew@deploy1002> Started deploy [horizon/deploy@20f479e]: updated trove -> codfw1dev [production]
15:10 <andrew@deploy1002> Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 24s) [production]
15:08 <andrew@deploy1002> Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev [production]
15:03 <andrew@deploy1002> Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 11s) [production]
15:02 <andrew@deploy1002> Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev [production]
15:02 <andrew@deploy1002> Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 26s) [production]
15:00 <andrew@deploy1002> Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev [production]
15:00 <andrew@deploy1002> Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 29s) [production]
14:58 <andrew@deploy1002> Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev [production]
14:57 <andrew@deploy1002> Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 22s) [production]
14:56 <andrew@deploy1002> Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev [production]
14:41 <bblack@cumin1001> conftool action : set/pooled=yes; selector: name=cp203[34].codfw.wmnet [production]
14:40 <andrew@deploy1002> Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 01m 19s) [production]
14:38 <andrew@deploy1002> Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev [production]
14:38 <andrew@deploy1002> Finished deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev (duration: 00m 50s) [production]
14:37 <andrew@deploy1002> Started deploy [horizon/deploy@71f273c]: updated trove -> codfw1dev [production]
13:04 <Urbanecm> Start server-side upload for 1 video file (T281927) [production]
12:19 <kormat@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 100%: reimaged to buster T280751', diff saved to https://phabricator.wikimedia.org/P15856 and previous config saved to /var/cache/conftool/dbconfig/20210507-121908-kormat.json [production]
12:04 <kormat@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 75%: reimaged to buster T280751', diff saved to https://phabricator.wikimedia.org/P15855 and previous config saved to /var/cache/conftool/dbconfig/20210507-120404-kormat.json [production]
11:49 <kormat@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 50%: reimaged to buster T280751', diff saved to https://phabricator.wikimedia.org/P15854 and previous config saved to /var/cache/conftool/dbconfig/20210507-114859-kormat.json [production]
11:33 <kormat@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 25%: reimaged to buster T280751', diff saved to https://phabricator.wikimedia.org/P15853 and previous config saved to /var/cache/conftool/dbconfig/20210507-113355-kormat.json [production]
09:55 <dcausse> depooling wdqs1012 T280382, T282222 [production]
09:44 <vgutierrez> Enforce Puppet Internal CA validation on trafficserver@codfw - T281673 [production]
08:50 <jmm@puppetmaster1001> conftool action : set/pooled=yes; selector: name=ldap-replica2005.wikimedia.org [production]
08:19 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2057.codfw.wmnet [production]
08:15 <vgutierrez> Enforce Puppet Internal CA validation on trafficserver@eqsin - T281673 [production]
08:10 <filippo@cumin1001> START - Cookbook sre.hosts.reboot-single for host ms-be2057.codfw.wmnet [production]