351-400 of 10000 results (45ms)
2021-11-30 ยง
17:35 <jynus@cumin1001> dbctl commit (dc=all): 'Repool db1163 at 25%', diff saved to https://phabricator.wikimedia.org/P17910 and previous config saved to /var/cache/conftool/dbconfig/20211130-173517-jynus.json [production]
17:34 <moritzm> installing libvorbis security updates [production]
17:15 <jynus@cumin1001> dbctl commit (dc=all): 'Repool db1163 at 5%', diff saved to https://phabricator.wikimedia.org/P17908 and previous config saved to /var/cache/conftool/dbconfig/20211130-171550-jynus.json [production]
17:00 <jynus> move db1139:s1 under db1118 [production]
16:57 <jynus@cumin1001> dbctl commit (dc=all): 'Depool db1163 fully', diff saved to https://phabricator.wikimedia.org/P17907 and previous config saved to /var/cache/conftool/dbconfig/20211130-165718-jynus.json [production]
16:29 <XioNoX> Move cr2-codfw lumen transit link to BO cable - T289241 [production]
16:26 <XioNoX> Move cr2-codfw eqord link to BO cable - T289241 [production]
16:23 <XioNoX> Move cr2-codfw pfw3 link to BO cable - T289241 [production]
16:20 <Emperor> reboot ms-be2059 to fix device enumeration order re T295563 [production]
16:14 <jynus@cumin1001> dbctl commit (dc=all): 'Depool db1163 at 25%', diff saved to https://phabricator.wikimedia.org/P17906 and previous config saved to /var/cache/conftool/dbconfig/20211130-161457-jynus.json [production]
16:13 <XioNoX> cr2-codfw bounce fpc 1 pic 0 (vrrp backup) - T289241 [production]
16:07 <jynus@cumin1001> dbctl commit (dc=all): 'Depool db1163 at 50%', diff saved to https://phabricator.wikimedia.org/P17905 and previous config saved to /var/cache/conftool/dbconfig/20211130-160748-jynus.json [production]
16:06 <bblack> lvs2007 - repooling into service [production]
16:01 <bblack> lvs2007 - depooling for network maint - do not push LVS config changes please! [production]
15:41 <jbond@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts puppetboard2001.codfw.wmnet [production]
15:41 <jbond@cumin1001> START - Cookbook sre.hosts.decommission for hosts puppetboard2001.codfw.wmnet [production]
15:38 <jbond@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts puppetboard2001.codfw.wmnet [production]
15:37 <jbond@cumin1001> START - Cookbook sre.hosts.decommission for hosts puppetboard2001.codfw.wmnet [production]
15:32 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:29 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:23 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:22 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:16 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:15 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:12 <jforrester@deploy1002> Synchronized multiversion/MWMultiVersion.php: Add wikifunctions hard-coded value to setSiteInfoForWiki for Beta Cluster T284162 (duration: 00m 56s) [production]
15:09 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:08 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
13:45 <elukey@cumin1001> END (PASS) - Cookbook sre.kafka.roll-restart-brokers (exit_code=0) for Kafka A:kafka-test-eqiad cluster: Roll restart of jvm daemons for openjdk upgrade. [production]
13:25 <elukey@cumin1001> START - Cookbook sre.kafka.roll-restart-brokers for Kafka A:kafka-test-eqiad cluster: Roll restart of jvm daemons for openjdk upgrade. [production]
13:11 <marostegui@cumin1001> dbctl commit (dc=all): 'After maintenance db2114 (T277354)', diff saved to https://phabricator.wikimedia.org/P17904 and previous config saved to /var/cache/conftool/dbconfig/20211130-131124-marostegui.json [production]
13:05 <topranks> Running homer against CR routers to adjust loopback4 filter enabling local NTP queries for status. T296623 [production]
12:56 <marostegui@cumin1001> dbctl commit (dc=all): 'After maintenance db2114 (T277354)', diff saved to https://phabricator.wikimedia.org/P17903 and previous config saved to /var/cache/conftool/dbconfig/20211130-125620-marostegui.json [production]
12:41 <marostegui@cumin1001> dbctl commit (dc=all): 'After maintenance db2114 (T277354)', diff saved to https://phabricator.wikimedia.org/P17902 and previous config saved to /var/cache/conftool/dbconfig/20211130-124115-marostegui.json [production]
12:26 <marostegui@cumin1001> dbctl commit (dc=all): 'After maintenance db2114 (T277354)', diff saved to https://phabricator.wikimedia.org/P17901 and previous config saved to /var/cache/conftool/dbconfig/20211130-122610-marostegui.json [production]
12:25 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db2114 (T277354)', diff saved to https://phabricator.wikimedia.org/P17900 and previous config saved to /var/cache/conftool/dbconfig/20211130-122555-marostegui.json [production]
12:25 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2114.codfw.wmnet with reason: Maintenance T277354 [production]
12:25 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 4:00:00 on db2114.codfw.wmnet with reason: Maintenance T277354 [production]
12:09 <jbond@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts puppetboard1001.eqiad.wmnet [production]
12:02 <jbond@cumin1001> START - Cookbook sre.hosts.decommission for hosts puppetboard1001.eqiad.wmnet [production]
11:50 <moritzm> running "sudo gnt-cluster renew-crypto --new-node-certificates --new-rapi-certificate --new-spice-certificate" for Ganeti codfw cluster T296622 [production]
11:01 <hnowlan> restarting tilerator, kartotherian and tileratorui for updates in eqiad [production]
11:01 <hnowlan> restarting tilerator, kartotherian and tileratorui in codfw [production]
10:39 <elukey> rollout wmf-certificates 0~20211129-1 fleet wide (add group/others permissions to the cert bundle) [production]
10:30 <lucaswerkmeister-wmde@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'termbox' for release 'production' . [production]
10:29 <lucaswerkmeister-wmde@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'termbox' for release 'production' . [production]
09:58 <moritzm> installing remaining ICU security updates [production]
09:06 <Amir1> dropping wikiadmin@localhost from all pooled replicas of s6 (T296511) [production]
08:24 <dcausse> restarting blazegraph on wdqs1006 (jvm stuck for 6hours) [production]
08:14 <Amir1> revoking DROP from wikiadmin on all pooled replicas (T249683) [production]
03:46 <ejegg> updated payments-wiki from dbc92132 to 4a4ef51d [production]