1151-1200 of 10000 results (60ms)
2019-11-14 §
08:37 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1110 after schema change', diff saved to https://phabricator.wikimedia.org/P9633 and previous config saved to /var/cache/conftool/dbconfig/20191114-083729-marostegui.json [production]
08:03 <eileen> process-control config revision is 6adc66a20b re-enable backfill [production]
08:00 <marostegui@cumin1001> dbctl commit (dc=all): 'Pool a non partitioned slave db1089 on special groups for s1 T223151', diff saved to https://phabricator.wikimedia.org/P9632 and previous config saved to /var/cache/conftool/dbconfig/20191114-080038-marostegui.json [production]
07:54 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1103:3312 T235599', diff saved to https://phabricator.wikimedia.org/P9631 and previous config saved to /var/cache/conftool/dbconfig/20191114-075449-marostegui.json [production]
07:41 <eileen> process-control config revision is b7c2cf7227 - disabled backfill again - some error? [production]
07:29 <eileen> process-control config revision is 909108622d re-enable omnirecipient date repair job [production]
07:25 <eileen> process-control config revision is d3ebeddcc1 (I renabled the old back fill job) [production]
07:12 <moritzm> installing intel-microcode updates [production]
06:53 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1067', diff saved to https://phabricator.wikimedia.org/P9630 and previous config saved to /var/cache/conftool/dbconfig/20191114-065309-marostegui.json [production]
06:16 <marostegui> Stop replication on db1067 [production]
06:01 <marostegui@cumin2001> dbctl commit (dc=all): 'Promote db1083 to s1 master and remove read-only from s1 T234800', diff saved to https://phabricator.wikimedia.org/P9629 and previous config saved to /var/cache/conftool/dbconfig/20191114-060138-marostegui.json [production]
06:00 <marostegui@cumin2001> dbctl commit (dc=all): 'Set s1 as read-only for maintenance T234800', diff saved to https://phabricator.wikimedia.org/P9628 and previous config saved to /var/cache/conftool/dbconfig/20191114-060026-marostegui.json [production]
06:00 <marostegui> Starting s1 failover from db1067 to db1083 - T234800 [production]
05:51 <jynus> stopping db1114 replication [production]
05:34 <marostegui> Compress db2089:3316 - T235599 [production]
05:24 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1110 for schema change', diff saved to https://phabricator.wikimedia.org/P9627 and previous config saved to /var/cache/conftool/dbconfig/20191114-052400-marostegui.json [production]
05:23 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1130 after schema change', diff saved to https://phabricator.wikimedia.org/P9626 and previous config saved to /var/cache/conftool/dbconfig/20191114-052303-marostegui.json [production]
05:13 <marostegui> Move replicas from db1067 to db1083 T234800 [production]
05:09 <marostegui@cumin1001> dbctl commit (dc=all): 'Set db1083 with weight 0 T234800', diff saved to https://phabricator.wikimedia.org/P9625 and previous config saved to /var/cache/conftool/dbconfig/20191114-050940-marostegui.json [production]
05:08 <vgutierrez> Repooling cp1077 - T238289 [production]
05:07 <marostegui> Start pre-failover steps T234800 [production]
05:01 <kart_> Updated cxserver to 2019-11-13-111130-production tag (T237379, T235748, T236906) [production]
04:56 <kartik@deploy1001> helmfile [EQIAD] Ran 'apply' command on namespace 'cxserver' for release 'production' . [production]
04:51 <kartik@deploy1001> helmfile [CODFW] Ran 'apply' command on namespace 'cxserver' for release 'production' . [production]
04:49 <kartik@deploy1001> helmfile [STAGING] Ran 'apply' command on namespace 'cxserver' for release 'staging' . [production]
03:49 <vgutierrez> power cycling cp0177 - T238289 [production]
03:49 <vgutierrez@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp1077.eqiad.wmnet [production]
03:49 <vgutierrez> depooling cp1077 - T238289 [production]
00:41 <ebernhardson> T237849 Start CirrusSearch forceSearchIndex.php commonswiki 2019-10-20T00:00:00 - 2019-11-14T01:00:00 pushing into jobqueue [production]
00:40 <crusnov@deploy1001> Finished deploy [netbox/deploy@56df4a5]: deploy netbox for script update (duration: 00m 49s) [production]
00:39 <crusnov@deploy1001> Started deploy [netbox/deploy@56df4a5]: deploy netbox for script update [production]
00:39 <crusnov@deploy1001> Finished deploy [netbox/deploy@56df4a5]: deploy netbox for script update (duration: 00m 44s) [production]
00:38 <crusnov@deploy1001> Started deploy [netbox/deploy@56df4a5]: deploy netbox for script update [production]
00:36 <ebernhardson@deploy1001> Synchronized php-1.35.0-wmf.5/extensions/CirrusSearch/includes/BuildDocument/BuildDocument.php: T237849: Restore CirrusSearchBuildDocumentParse hook (duration: 00m 54s) [production]
2019-11-13 §
23:00 <jeh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
22:58 <jeh@cumin1001> START - Cookbook sre.hosts.downtime [production]
22:25 <catrope@deploy1001> Finished scap: For some reason that limited i18n sync didn't work, trying a full scap (duration: 18m 33s) [production]
22:07 <catrope@deploy1001> Started scap: For some reason that limited i18n sync didn't work, trying a full scap [production]
22:04 <catrope@deploy1001> scap sync-l10n completed (1.35.0-wmf.5) (duration: 02m 54s) [production]
22:00 <catrope@deploy1001> Synchronized php-1.35.0-wmf.5/extensions/GrowthExperiments/: Update to master (b937dce) (duration: 00m 54s) [production]
20:17 <XioNoX> delete unused asw2-esams:ae1 [production]
19:37 <mholloway-shell@deploy1001> Synchronized wmf-config/InitialiseSettings.php: MachineVision: Update WD item blacklist (again) (duration: 00m 52s) [production]
18:49 <Jeff_Green> authdns-update to remove host alnilam [production]
17:49 <mholloway-shell@deploy1001> Synchronized wmf-config/InitialiseSettings.php: MachineVision: Update WD item blacklist (duration: 00m 53s) [production]
16:40 <gehel> depool wdqs1005 - T238232 [production]
16:36 <gehel> restart blazegraph on wdqs1005 [production]
16:21 <ema> pool cp3054 with ATS backend T227432 [production]
16:21 <gehel> draining elastic1017-1031 to prepare for decommission - T230746 [production]
16:02 <ema@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
16:00 <ema@cumin1001> START - Cookbook sre.hosts.downtime [production]