2018-09-12
§
|
08:00 |
<mobrovac@deploy1001> |
Started restart [proton/deploy@ecb9a0e]: (no justification provided) |
[production] |
07:53 |
<jynus@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Depool db1098 (duration: 00m 50s) |
[production] |
07:45 |
<addshore@deploy1001> |
Synchronized php-1.32.0-wmf.20/extensions/Wikibase/lib/includes/Store/: REVERT: Debug logging for T97368 [[gerrit:459905]] (duration: 00m 51s) |
[production] |
07:38 |
<addshore@deploy1001> |
Synchronized php-1.32.0-wmf.20/extensions/Wikibase/lib/includes/Store/: Debug logging for T97368 [[gerrit:459905]] (duration: 00m 53s) |
[production] |
06:57 |
<mobrovac@deploy1001> |
Finished deploy [proton/deploy@ecb9a0e]: Update to Puppeteer v1.7.0 and fix browser connection abort handling - T181623 (duration: 00m 58s) |
[production] |
06:56 |
<mobrovac@deploy1001> |
Started deploy [proton/deploy@ecb9a0e]: Update to Puppeteer v1.7.0 and fix browser connection abort handling - T181623 |
[production] |
06:35 |
<moritzm> |
installing confuse security updates |
[production] |
06:21 |
<elukey> |
re-run webrequest-load-wf-text-2018-9-12-4, failed due to sql exceptions/timeouts to the database |
[analytics] |
02:38 |
<l10nupdate@deploy1001> |
ResourceLoader cache refresh completed at Wed Sep 12 02:38:53 UTC 2018 (duration 10m 44s) |
[production] |
02:28 |
<l10nupdate@deploy1001> |
scap sync-l10n completed (1.32.0-wmf.20) (duration: 08m 30s) |
[production] |
00:14 |
<marxarelli> |
deleting instance integration-slave-docker-1029 |
[releng] |
2018-09-11
§
|
22:43 |
<marxarelli> |
launching m1.xlarge integration-slave-docker-1029 using stretch image |
[releng] |
22:40 |
<marxarelli> |
deleting integration-slave-docker-1028 in favor of trying a stretch instance |
[releng] |
22:06 |
<legoktm> |
legoktm@mwmaint1001:~$ mwscript deleteEqualMessages.php --wiki fixcopyrightwiki --delete --lang-code='*' |
[production] |
22:05 |
<marxarelli> |
launching replacement instance integration-slave-docker-1028 |
[releng] |
21:56 |
<legoktm@deploy1001> |
Finished scap: i18n updates for fixcopyright (duration: 32m 21s) |
[production] |
21:49 |
<marxarelli> |
removing unresponsive jenkins node integration-slave-docker-1025 |
[releng] |
21:39 |
<marxarelli> |
cherry-pick https://gerrit.wikimedia.org/r/c/operations/puppet/+/459850 on integration-puppetmaster01 |
[releng] |
21:23 |
<legoktm@deploy1001> |
Started scap: i18n updates for fixcopyright |
[production] |
21:22 |
<legoktm@deploy1001> |
Synchronized php-1.32.0-wmf.20/skins/EUCopyrightCampaignSkin/: add og:image meta tag - https://gerrit.wikimedia.org/r/459836 (duration: 00m 51s) |
[production] |
21:03 |
<mutante> |
restarted apache on mwdebug1002, running puppet |
[production] |
19:38 |
<ema> |
switch all services to codfw only |
[production] |
19:31 |
<ema> |
switch restbase to active/active |
[production] |
19:20 |
<ema> |
depool eqiad from edge traffic |
[production] |
19:06 |
<ema> |
route esams via codfw |
[production] |
17:42 |
<bd808> |
Update to b71fd65 (Revert "Add a cache-busting parameter when fetching from noc.wm.o") |
[tools.versions] |
17:22 |
<zhuyifei1999_> |
doing another backup of main db: `sudo mysqldump quarry | sudo tee /data/project/dump-$(date '+%Y-%m-%d').sql > /dev/null` T202588 |
[quarry] |
17:14 |
<zhuyifei1999_> |
disabling puppet on quarry-main-01, quarry-runner-0{1,2} T202588 |
[quarry] |
17:10 |
<XioNoX> |
delete BGP sessions with old AS10089 router on cr1-eqsin |
[production] |
16:58 |
<arturo> |
add myself as project admin |
[bastion] |
16:53 |
<godog> |
repair sdd on ms-be1043 - T199198 |
[production] |
16:52 |
<arturo> |
again, restarted nova-network after killing all dnsmasq procs in labnet1001 for T202636 |
[admin] |
16:27 |
<mutante> |
added gtirloni to acl*sre-team on Phabricator (T203489) |
[production] |
16:17 |
<godog> |
correction, sdk1 on ms-be1041 - T199198 |
[production] |
16:16 |
<godog> |
repair sdd1 on ms-be1043 - T199198 |
[production] |
16:13 |
<arturo> |
force-reboot bastion-restricted-01 bc it lost the IP address, and to force a new DHCP query |
[bastion] |
16:08 |
<arturo> |
restarted nova-network after killing all dnsmasq procs in labnet1001 for T202636 |
[admin] |
15:06 |
<godog> |
serve switch originals and thumbs from codfw only |
[production] |
15:00 |
<godog> |
begin switching swift to codfw |
[production] |
14:40 |
<END> |
(PASS) - Cookbook sre.switchdc.services.02-restore-ttl (exit_code=0) (akosiaris@sarin) |
[production] |
14:40 |
<START> |
- Cookbook sre.switchdc.services.02-restore-ttl (akosiaris@sarin) |
[production] |
14:38 |
<END> |
(PASS) - Cookbook sre.switchdc.services.01-switch-dc (exit_code=0) (akosiaris@sarin) |
[production] |
14:38 |
<Switching> |
services parsoid, restbase, restbase-async, mobileapps, apertium, citoid, cxserver, eventstreams, graphoid, mathoid, proton, pdfrender, recommendation-api, zotero, eventbus, ores, wdqs, wdqs-internal: eqiad => codfw (akosiaris@sarin) |
[production] |
14:38 |
<START> |
- Cookbook sre.switchdc.services.01-switch-dc (akosiaris@sarin) |
[production] |
14:38 |
<END> |
(PASS) - Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (exit_code=0) (akosiaris@sarin) |
[production] |
14:32 |
<START> |
- Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (akosiaris@sarin) |
[production] |
14:31 |
<END> |
(FAIL) - Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (exit_code=99) (akosiaris@sarin) |
[production] |
14:31 |
<START> |
- Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (akosiaris@sarin) |
[production] |
14:31 |
<END> |
(FAIL) - Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (exit_code=99) (akosiaris@sarin) |
[production] |
14:31 |
<START> |
- Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (akosiaris@sarin) |
[production] |