1201-1250 of 10000 results (75ms)
2023-07-11 ยง
15:17 <eevans@cumin1001> END (FAIL) - Cookbook sre.cassandra.roll-restart (exit_code=99) for nodes matching A:restbase-codfw: Applying JVM update - eevans@cumin1001 [production]
15:09 <eevans@cumin1001> START - Cookbook sre.cassandra.roll-restart for nodes matching A:restbase-codfw: Applying JVM update - eevans@cumin1001 [production]
14:49 <btullis@cumin1001> END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid public cluster: Roll restart of Druid jvm daemons. [production]
14:21 <btullis@deploy1002> helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main [production]
14:19 <btullis@cumin1001> END (PASS) - Cookbook sre.kafka.roll-restart-mirror-maker (exit_code=0) restart MirrorMaker for Kafka A:kafka-mirror-maker-jumbo-eqiad cluster: Roll restart of jvm daemons. [production]
14:17 <moritzm> restarting apache on mw canaries [production]
14:17 <btullis@deploy1002> helmfile [eqiad] START helmfile.d/services/datahub: apply on main [production]
14:15 <btullis@deploy1002> helmfile [codfw] DONE helmfile.d/services/datahub: sync on main [production]
14:12 <btullis@deploy1002> helmfile [codfw] START helmfile.d/services/datahub: apply on main [production]
14:02 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
14:00 <btullis@cumin1001> START - Cookbook sre.kafka.roll-restart-mirror-maker restart MirrorMaker for Kafka A:kafka-mirror-maker-jumbo-eqiad cluster: Roll restart of jvm daemons. [production]
13:59 <moritzm> installing yajl security updates [production]
13:59 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
13:57 <btullis@cumin1001> START - Cookbook sre.druid.roll-restart-workers for Druid public cluster: Roll restart of Druid jvm daemons. [production]
13:49 <moritzm> rebalance ganeti group eqiad/d after reboots [production]
13:42 <jgiannelos@deploy1002> Finished deploy [restbase/deploy@930f075]: (no justification provided) (duration: 19m 50s) [production]
13:33 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:936826|Enable tabs for non loggedin mobile users on knwikisource (T340276)]] (duration: 11m 33s) [production]
13:23 <urbanecm@deploy1002> urbanecm and anzx: Backport for [[gerrit:936826|Enable tabs for non loggedin mobile users on knwikisource (T340276)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
13:22 <jgiannelos@deploy1002> Started deploy [restbase/deploy@930f075]: (no justification provided) [production]
13:21 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:936826|Enable tabs for non loggedin mobile users on knwikisource (T340276)]] [production]
13:21 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:936639|Growth: Increase mentorship percentage to 25% on enwiki (T341399)]] (duration: 07m 15s) [production]
13:14 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:936639|Growth: Increase mentorship percentage to 25% on enwiki (T341399)]] [production]
13:13 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:935723|GrowthExperiments: Enable backend of link recommendation 10, 11, 12th round wikis (T308135 T308136 T308137)]] (duration: 09m 45s) [production]
13:04 <urbanecm@deploy1002> sgimeno and urbanecm: Backport for [[gerrit:935723|GrowthExperiments: Enable backend of link recommendation 10, 11, 12th round wikis (T308135 T308136 T308137)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet [production]
13:03 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:935723|GrowthExperiments: Enable backend of link recommendation 10, 11, 12th round wikis (T308135 T308136 T308137)]] [production]
13:00 <jbond@cumin1001> END (FAIL) - Cookbook sre.postgresql.postgres-init (exit_code=99) [production]
12:59 <jbond@cumin1001> START - Cookbook sre.postgresql.postgres-init [production]
12:59 <jbond@cumin1001> END (ERROR) - Cookbook sre.postgresql.postgres-init (exit_code=97) [production]
12:53 <jbond@cumin1001> START - Cookbook sre.postgresql.postgres-init [production]
12:00 <XioNoX> decom datahop in knams - T340049 [production]
11:42 <btullis@deploy1002> helmfile [codfw] DONE helmfile.d/services/datahub: sync on main [production]
11:38 <btullis@deploy1002> helmfile [codfw] START helmfile.d/services/datahub: apply on main [production]
11:37 <btullis@deploy1002> helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main [production]
11:27 <btullis@deploy1002> helmfile [eqiad] START helmfile.d/services/datahub: apply on main [production]
11:17 <btullis@deploy1002> helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main [production]
11:06 <btullis@deploy1002> helmfile [eqiad] START helmfile.d/services/datahub: apply on main [production]
10:46 <moritzm> installing libx11 security updates [production]
10:44 <isaranto@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
10:44 <ladsgroup@deploy1002> Sync cancelled. [production]
10:39 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:936733|ExternalLinks: Make oneWildcard avoid adding wildcard to domain (T326251)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
10:38 <isaranto@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
10:37 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:936733|ExternalLinks: Make oneWildcard avoid adding wildcard to domain (T326251)]] [production]
10:19 <moritzm> rebalance ganeti group codfw/C after reboots [production]
10:03 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:936796|Override liftwing hostname (T319170)]] (duration: 14m 34s) [production]
09:52 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:936796|Override liftwing hostname (T319170)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
09:52 <jbond> disable puppet fleet wide to deploy 936273 [production]
09:49 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:936796|Override liftwing hostname (T319170)]] [production]
09:47 <jbond> renable puppet [production]
09:43 <hashar> Updating Zuul configuration which was stall to a version from March 29th after the switchover from contint2001 to contint2002 | T324659 T341556 [production]
09:36 <jbond> deploy gerrit:936273 enable submitting data to puppetdb7 [production]