1401-1450 of 10000 results (40ms)
2021-03-18 §
11:37 <mvolz@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'zotero' for release 'production' . [production]
11:34 <mvolz@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'zotero' for release 'production' . [production]
11:25 <mvolz@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'zotero' for release 'staging' . [production]
11:24 <urbanecm@deploy1002> Synchronized wmf-config/flaggedrevs.php: 896c9f019b17d1ad3a1589d377158ca2fb91ebaa: flaggedrevs: Disable multiple dimensions in hewikisource (duration: 01m 09s) [production]
11:20 <urbanecm@deploy1002> Synchronized php-1.36.0-wmf.35/extensions/GrowthExperiments/includes/HomepageHooks.php: 3b2aa1aa28e9d204f32ae937a84ec211137cbb2e: Remove variant C from list of valid variants (T277727) (duration: 01m 09s) [production]
11:16 <mvolz@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'citoid' for release 'production' . [production]
11:14 <mvolz@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'citoid' for release 'production' . [production]
11:11 <mvolz@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'citoid' for release 'staging' . [production]
11:11 <urbanecm@deploy1002> Synchronized wmf-config/InitialiseSettings.php: 0005676e704cad907655a4a0bca7bd2164714b1c: GrowthExperiments: set $wgGEHomepageNewAccountVariants to D only (T277727) (duration: 01m 10s) [production]
11:08 <urbanecm@deploy1002> Synchronized wmf-config/CommonSettings.php: NOOP: e7f5eac: Enable CentralAuth IRC feed in beta cluster (T277432) (duration: 01m 12s) [production]
09:13 <_joe_> hard reboot of snapshot1005 [production]
09:04 <_joe_> attempted reboot of snapshot1005, read-only filesystem and probably disks are broken beyond repair [production]
08:27 <godog> swift eqiad-prod: less weight for ms-be[1019-1026] - T272836 [production]
08:18 <akosiaris@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1004.eqiad.wmnet with reason: REIMAGE [production]
08:16 <akosiaris@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1004.eqiad.wmnet with reason: REIMAGE [production]
08:03 <marostegui@cumin1001> dbctl commit (dc=all): 'db1126 (re)pooling @ 100%: Slowly repool db1126', diff saved to https://phabricator.wikimedia.org/P14946 and previous config saved to /var/cache/conftool/dbconfig/20210318-080258-root.json [production]
08:02 <akosiaris> reimage ml-serve1004 to debug a docker volume_group issue [production]
07:47 <marostegui@cumin1001> dbctl commit (dc=all): 'db1126 (re)pooling @ 75%: Slowly repool db1126', diff saved to https://phabricator.wikimedia.org/P14945 and previous config saved to /var/cache/conftool/dbconfig/20210318-074754-root.json [production]
07:32 <marostegui@cumin1001> dbctl commit (dc=all): 'db1126 (re)pooling @ 50%: Slowly repool db1126', diff saved to https://phabricator.wikimedia.org/P14944 and previous config saved to /var/cache/conftool/dbconfig/20210318-073250-root.json [production]
07:20 <dcausse> depooling & restarting blazegraph on wdqs1005 [production]
07:19 <marostegui> Deploy schema change on s4 codfw master, lag will appear - T276150 T276156 [production]
07:17 <marostegui@cumin1001> dbctl commit (dc=all): 'db1126 (re)pooling @ 25%: Slowly repool db1126', diff saved to https://phabricator.wikimedia.org/P14943 and previous config saved to /var/cache/conftool/dbconfig/20210318-071747-root.json [production]
07:15 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1156.eqiad.wmnet with reason: REIMAGE [production]
07:13 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1156.eqiad.wmnet with reason: REIMAGE [production]
06:32 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1161 to dbctl, depooled T258361', diff saved to https://phabricator.wikimedia.org/P14942 and previous config saved to /var/cache/conftool/dbconfig/20210318-063241-marostegui.json [production]
06:22 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db2120', diff saved to https://phabricator.wikimedia.org/P14941 and previous config saved to /var/cache/conftool/dbconfig/20210318-062201-marostegui.json [production]
06:04 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1126 for schema change', diff saved to https://phabricator.wikimedia.org/P14940 and previous config saved to /var/cache/conftool/dbconfig/20210318-060445-marostegui.json [production]
03:46 <andrewbogott> restarting slapd on seaborgium, serpens, and r-o ldap replicas (we're getting irregular connection failures) [production]
00:05 <eileen> tools revision changed from b7b4060c30 to ef54260b0d [production]
2021-03-17 §
23:42 <urbanecm@deploy1002> Synchronized wmf-config/InitialiseSettings.php: c730dd5feb865a8325279cd4e76c133512f14251: idwiki: Deploy Growth features to newcomers (T259024) (duration: 01m 08s) [production]
23:40 <urbanecm@deploy1002> Synchronized wmf-config/CommonSettings.php: 5c14e7d2045f0905f7e85b249e821bbe8d69c600: Define confirmed group in MediaWikiServices hook (T275334, T277704, T275310, T275333) (duration: 01m 08s) [production]
23:30 <ebernhardson@deploy1002> Synchronized php-1.36.0-wmf.35/extensions/CirrusSearch/profiles/FallbackProfiles.config.php: Add fallback profile including glent m1 (duration: 01m 42s) [production]
22:27 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1038.eqiad.wmnet with reason: REIMAGE [production]
22:25 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1037.eqiad.wmnet with reason: REIMAGE [production]
22:25 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1038.eqiad.wmnet with reason: REIMAGE [production]
22:23 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1037.eqiad.wmnet with reason: REIMAGE [production]
20:52 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1184.eqiad.wmnet with reason: REIMAGE [production]
20:50 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1183.eqiad.wmnet with reason: REIMAGE [production]
20:48 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1184.eqiad.wmnet with reason: REIMAGE [production]
20:48 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1182.eqiad.wmnet with reason: REIMAGE [production]
20:47 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1183.eqiad.wmnet with reason: REIMAGE [production]
20:46 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1181.eqiad.wmnet with reason: REIMAGE [production]
20:45 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1182.eqiad.wmnet with reason: REIMAGE [production]
20:44 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1180.eqiad.wmnet with reason: REIMAGE [production]
20:43 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1181.eqiad.wmnet with reason: REIMAGE [production]
20:42 <andrew@deploy1002> Finished deploy [horizon/deploy@17ea780]: display volume usage summaries (duration: 03m 34s) [production]
20:42 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1179.eqiad.wmnet with reason: REIMAGE [production]
20:41 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1180.eqiad.wmnet with reason: REIMAGE [production]
20:40 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1178.eqiad.wmnet with reason: REIMAGE [production]
20:39 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1179.eqiad.wmnet with reason: REIMAGE [production]