production SAL

1151-1200 of 10000 results (64ms)

2019-11-14 §
08:37	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repool db1110 after schema change', diff saved to https://phabricator.wikimedia.org/P9633 and previous config saved to /var/cache/conftool/dbconfig/20191114-083729-marostegui.json	[production]
08:03	<eileen>	process-control config revision is 6adc66a20b re-enable backfill	[production]
08:00	<marostegui@cumin1001>	dbctl commit (dc=all): 'Pool a non partitioned slave db1089 on special groups for s1 T223151', diff saved to https://phabricator.wikimedia.org/P9632 and previous config saved to /var/cache/conftool/dbconfig/20191114-080038-marostegui.json	[production]
07:54	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1103:3312 T235599', diff saved to https://phabricator.wikimedia.org/P9631 and previous config saved to /var/cache/conftool/dbconfig/20191114-075449-marostegui.json	[production]
07:41	<eileen>	process-control config revision is b7c2cf7227 - disabled backfill again - some error?	[production]
07:29	<eileen>	process-control config revision is 909108622d re-enable omnirecipient date repair job	[production]
07:25	<eileen>	process-control config revision is d3ebeddcc1 (I renabled the old back fill job)	[production]
07:12	<moritzm>	installing intel-microcode updates	[production]
06:53	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1067', diff saved to https://phabricator.wikimedia.org/P9630 and previous config saved to /var/cache/conftool/dbconfig/20191114-065309-marostegui.json	[production]
06:16	<marostegui>	Stop replication on db1067	[production]
06:01	<marostegui@cumin2001>	dbctl commit (dc=all): 'Promote db1083 to s1 master and remove read-only from s1 T234800', diff saved to https://phabricator.wikimedia.org/P9629 and previous config saved to /var/cache/conftool/dbconfig/20191114-060138-marostegui.json	[production]
06:00	<marostegui@cumin2001>	dbctl commit (dc=all): 'Set s1 as read-only for maintenance T234800', diff saved to https://phabricator.wikimedia.org/P9628 and previous config saved to /var/cache/conftool/dbconfig/20191114-060026-marostegui.json	[production]
06:00	<marostegui>	Starting s1 failover from db1067 to db1083 - T234800	[production]
05:51	<jynus>	stopping db1114 replication	[production]
05:34	<marostegui>	Compress db2089:3316 - T235599	[production]
05:24	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1110 for schema change', diff saved to https://phabricator.wikimedia.org/P9627 and previous config saved to /var/cache/conftool/dbconfig/20191114-052400-marostegui.json	[production]
05:23	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repool db1130 after schema change', diff saved to https://phabricator.wikimedia.org/P9626 and previous config saved to /var/cache/conftool/dbconfig/20191114-052303-marostegui.json	[production]
05:13	<marostegui>	Move replicas from db1067 to db1083 T234800	[production]
05:09	<marostegui@cumin1001>	dbctl commit (dc=all): 'Set db1083 with weight 0 T234800', diff saved to https://phabricator.wikimedia.org/P9625 and previous config saved to /var/cache/conftool/dbconfig/20191114-050940-marostegui.json	[production]
05:08	<vgutierrez>	Repooling cp1077 - T238289	[production]
05:07	<marostegui>	Start pre-failover steps T234800	[production]
05:01	<kart_>	Updated cxserver to 2019-11-13-111130-production tag (T237379, T235748, T236906)	[production]
04:56	<kartik@deploy1001>	helmfile [EQIAD] Ran 'apply' command on namespace 'cxserver' for release 'production' .	[production]
04:51	<kartik@deploy1001>	helmfile [CODFW] Ran 'apply' command on namespace 'cxserver' for release 'production' .	[production]
04:49	<kartik@deploy1001>	helmfile [STAGING] Ran 'apply' command on namespace 'cxserver' for release 'staging' .	[production]
03:49	<vgutierrez>	power cycling cp0177 - T238289	[production]
03:49	<vgutierrez@puppetmaster1001>	conftool action : set/pooled=no; selector: name=cp1077.eqiad.wmnet	[production]
03:49	<vgutierrez>	depooling cp1077 - T238289	[production]
00:41	<ebernhardson>	T237849 Start CirrusSearch forceSearchIndex.php commonswiki 2019-10-20T00:00:00 - 2019-11-14T01:00:00 pushing into jobqueue	[production]
00:40	<crusnov@deploy1001>	Finished deploy [netbox/deploy@56df4a5]: deploy netbox for script update (duration: 00m 49s)	[production]
00:39	<crusnov@deploy1001>	Started deploy [netbox/deploy@56df4a5]: deploy netbox for script update	[production]
00:39	<crusnov@deploy1001>	Finished deploy [netbox/deploy@56df4a5]: deploy netbox for script update (duration: 00m 44s)	[production]
00:38	<crusnov@deploy1001>	Started deploy [netbox/deploy@56df4a5]: deploy netbox for script update	[production]
00:36	<ebernhardson@deploy1001>	Synchronized php-1.35.0-wmf.5/extensions/CirrusSearch/includes/BuildDocument/BuildDocument.php: T237849: Restore CirrusSearchBuildDocumentParse hook (duration: 00m 54s)	[production]
2019-11-13 §
23:00	<jeh@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
22:58	<jeh@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
22:25	<catrope@deploy1001>	Finished scap: For some reason that limited i18n sync didn't work, trying a full scap (duration: 18m 33s)	[production]
22:07	<catrope@deploy1001>	Started scap: For some reason that limited i18n sync didn't work, trying a full scap	[production]
22:04	<catrope@deploy1001>	scap sync-l10n completed (1.35.0-wmf.5) (duration: 02m 54s)	[production]
22:00	<catrope@deploy1001>	Synchronized php-1.35.0-wmf.5/extensions/GrowthExperiments/: Update to master (b937dce) (duration: 00m 54s)	[production]
20:17	<XioNoX>	delete unused asw2-esams:ae1	[production]
19:37	<mholloway-shell@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: MachineVision: Update WD item blacklist (again) (duration: 00m 52s)	[production]
18:49	<Jeff_Green>	authdns-update to remove host alnilam	[production]
17:49	<mholloway-shell@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: MachineVision: Update WD item blacklist (duration: 00m 53s)	[production]
16:40	<gehel>	depool wdqs1005 - T238232	[production]
16:36	<gehel>	restart blazegraph on wdqs1005	[production]
16:21	<ema>	pool cp3054 with ATS backend T227432	[production]
16:21	<gehel>	draining elastic1017-1031 to prepare for decommission - T230746	[production]
16:02	<ema@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
16:00	<ema@cumin1001>	START - Cookbook sre.hosts.downtime	[production]