5251-5300 of 10000 results (39ms)
2021-08-25 §
07:01 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on db1160.eqiad.wmnet with reason: REIMAGE [production]
06:59 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1160.eqiad.wmnet with reason: REIMAGE [production]
06:43 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1160 for reimage T288803', diff saved to https://phabricator.wikimedia.org/P17078 and previous config saved to /var/cache/conftool/dbconfig/20210825-064319-marostegui.json [production]
06:08 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2118.codfw.wmnet with reason: Reimaging T288244 [production]
06:08 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2118.codfw.wmnet with reason: Reimaging T288244 [production]
06:07 <kormat@cumin1001> dbctl commit (dc=all): 'Depool db2118 until it's reimaged to buster T289129', diff saved to https://phabricator.wikimedia.org/P17077 and previous config saved to /var/cache/conftool/dbconfig/20210825-060742-kormat.json [production]
06:02 <kormat@cumin1001> dbctl commit (dc=all): 'Promote db2121 to s7 primary and set section read-write T289129', diff saved to https://phabricator.wikimedia.org/P17076 and previous config saved to /var/cache/conftool/dbconfig/20210825-060222-kormat.json [production]
06:01 <kormat@cumin1001> dbctl commit (dc=all): 'Set s7 codfw as read-only for maintenance - T289129', diff saved to https://phabricator.wikimedia.org/P17075 and previous config saved to /var/cache/conftool/dbconfig/20210825-060112-kormat.json [production]
06:00 <kormat> Starting s7 codfw failover from db2118 to db2121 - T289129 [production]
05:33 <eileen> civicrm revision changed from a4ce949828 to 42bb64c608, config revision is 1afcea7f5b [production]
05:28 <kormat> Moving s7 codfw replicas under db2121 - T289129 [production]
05:27 <kormat@cumin1001> dbctl commit (dc=all): 'Set db2121 with weight 0 T289129', diff saved to Unable to send diff to phaste and previous config saved to /var/cache/conftool/dbconfig/20210825-052741-kormat.json [production]
05:27 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:04:00 on 27 hosts with reason: Primary switchover s7 T289129 [production]
05:27 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 1:04:00 on 27 hosts with reason: Primary switchover s7 T289129 [production]
02:06 <eileen> civicrm revision changed from 8ed303f2d1 to a4ce949828, config revision is ac2d75d4a8 [production]
00:53 <legoktm@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'shellbox' for release 'main' . [production]
00:50 <legoktm@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'shellbox' for release 'main' . [production]
00:47 <legoktm@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'shellbox' for release 'main' . [production]
2021-08-24 §
22:05 <legoktm@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'shellbox' for release 'main' . [production]
22:04 <legoktm@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'shellbox-constraints' for release 'main' . [production]
21:10 <tgr> running extensions/GrowthExperiments/maintenance/revalidateLinkRecommendations.php on various wikis per T282873#7303828 [production]
20:59 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
20:55 <urbanecm@deploy1002> Synchronized wmf-config/InitialiseSettings.php: a6fd96b15e6e3c068c2faac60208b9722d32af0f: Growth features: Promote 9 wikis out of dark mode (T287871; T287874; T287872; T287880; T287868; T287873; T287879; T287875; T287876) (duration: 01m 25s) [production]
20:54 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
20:43 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
20:35 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
20:35 <dancy@deploy1002> Pruned MediaWiki: 1.37.0-wmf.17 (duration: 01m 48s) [production]
20:33 <dancy@deploy1002> Pruned MediaWiki: 1.37.0-wmf.18 (duration: 03m 26s) [production]
20:27 <dancy@deploy1002> rebuilt and synchronized wikiversions files: group0 wikis to 1.37.0-wmf.20 [production]
20:18 <dancy@deploy1002> Finished scap: testwikis wikis to 1.37.0-wmf.20 (duration: 36m 32s) [production]
20:16 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
20:09 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
19:41 <dancy@deploy1002> Started scap: testwikis wikis to 1.37.0-wmf.20 [production]
17:23 <mbsantos@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . [production]
17:19 <mbsantos@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mobileapps' for release 'production' . [production]
17:17 <mbsantos@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . [production]
15:26 <dcausse@deploy1002> Finished deploy [wikimedia/discovery/analytics@e02c602]: transfer_to_es: stop adding data to article_topics (duration: 02m 17s) [production]
15:23 <dcausse@deploy1002> Started deploy [wikimedia/discovery/analytics@e02c602]: transfer_to_es: stop adding data to article_topics [production]
15:15 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:13 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
14:55 <jayme@deploy1002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:54 <jayme@deploy1002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
14:50 <jayme@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:49 <jayme@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
14:23 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2031.codfw.wmnet with reason: REIMAGE [production]
14:19 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc2031.codfw.wmnet with reason: REIMAGE [production]
13:12 <XioNoX> push pfw policies - T289353 [production]
12:45 <vgutierrez> enable puppet on P:tlsproxy::envoy hosts - merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/710507/9 [production]
12:37 <vgutierrez> disable puppet on P:tlsproxy::envoy hosts - merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/710507/9 [production]
12:33 <godog> test patched python3-eventlet on thanos-fe1003 - T283714 [production]