3001-3050 of 10000 results (38ms)
2021-08-25 ยง
12:21 <kormat@cumin1001> START - Cookbook sre.dns.netbox [production]
11:39 <jayme> slowly restarting all pods in kube-system namespace in eqiad k8s cluster - T289131 [production]
11:38 <btullis@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host an-test-coord1002.eqiad.wmnet [production]
11:32 <kharlan@deploy1002> Synchronized php-1.37.0-wmf.20/extensions/VisualEditor/includes/ApiVisualEditorEdit.php: Backport: [[gerrit:714670|ApiVisualEditorEdit: data-{plugin} is not multi (T289652)]] (duration: 01m 06s) [production]
11:30 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
11:28 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
11:18 <volans> uploaded spicerack_0.0.58 to apt.wikimedia.org buster-wikimedia,bullseye-wikimedia [production]
11:02 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rdb2010.codfw.wmnet [production]
10:57 <jiji@cumin1001> START - Cookbook sre.hosts.reboot-single for host rdb2010.codfw.wmnet [production]
10:54 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rdb2008.codfw.wmnet [production]
10:49 <ladsgroup@deploy1002> Synchronized php-1.37.0-wmf.19/includes/Storage/DerivedPageDataUpdater.php: Backport: [[gerrit:714672|Introduce concept of generateHTMLOnEdit() for ContentHandler (T285987)]], Part II (duration: 01m 04s) [production]
10:47 <ladsgroup@deploy1002> Synchronized php-1.37.0-wmf.19/includes/content/ContentHandler.php: Backport: [[gerrit:714672|Introduce concept of generateHTMLOnEdit() for ContentHandler (T285987)]], Part I (duration: 01m 08s) [production]
10:46 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
10:45 <jiji@cumin1001> START - Cookbook sre.hosts.reboot-single for host rdb2008.codfw.wmnet [production]
10:45 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
10:21 <jbond> rolling out openssl updates [production]
10:07 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
10:05 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
10:03 <ladsgroup@deploy1002> Synchronized php-1.37.0-wmf.20/includes: Backport: [[gerrit:714671|Introduce concept of generateHTMLOnEdit() for ContentHandler (T285987)]] (duration: 02m 17s) [production]
10:01 <mutante> - removed jmads from wmf group [production]
09:59 <btullis@cumin1001> START - Cookbook sre.ganeti.makevm for new host an-test-coord1002.eqiad.wmnet [production]
09:49 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rdb1011.eqiad.wmnet [production]
09:44 <jiji@cumin1001> START - Cookbook sre.hosts.reboot-single for host rdb1011.eqiad.wmnet [production]
09:35 <jayme@deploy1002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
09:35 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rdb1012.eqiad.wmnet [production]
09:35 <jayme@deploy1002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
09:35 <jayme@deploy1002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
09:34 <jayme@deploy1002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
09:30 <jiji@cumin1001> START - Cookbook sre.hosts.reboot-single for host rdb1012.eqiad.wmnet [production]
08:59 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mc2033.codfw.wmnet with reason: REIMAGE [production]
08:57 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc2033.codfw.wmnet with reason: REIMAGE [production]
08:17 <godog> swift codfw add ms-be20[62-65] with initial weight - T288458 [production]
07:01 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on db1160.eqiad.wmnet with reason: REIMAGE [production]
06:59 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1160.eqiad.wmnet with reason: REIMAGE [production]
06:43 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1160 for reimage T288803', diff saved to https://phabricator.wikimedia.org/P17078 and previous config saved to /var/cache/conftool/dbconfig/20210825-064319-marostegui.json [production]
06:08 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2118.codfw.wmnet with reason: Reimaging T288244 [production]
06:08 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2118.codfw.wmnet with reason: Reimaging T288244 [production]
06:07 <kormat@cumin1001> dbctl commit (dc=all): 'Depool db2118 until it's reimaged to buster T289129', diff saved to https://phabricator.wikimedia.org/P17077 and previous config saved to /var/cache/conftool/dbconfig/20210825-060742-kormat.json [production]
06:02 <kormat@cumin1001> dbctl commit (dc=all): 'Promote db2121 to s7 primary and set section read-write T289129', diff saved to https://phabricator.wikimedia.org/P17076 and previous config saved to /var/cache/conftool/dbconfig/20210825-060222-kormat.json [production]
06:01 <kormat@cumin1001> dbctl commit (dc=all): 'Set s7 codfw as read-only for maintenance - T289129', diff saved to https://phabricator.wikimedia.org/P17075 and previous config saved to /var/cache/conftool/dbconfig/20210825-060112-kormat.json [production]
06:00 <kormat> Starting s7 codfw failover from db2118 to db2121 - T289129 [production]
05:33 <eileen> civicrm revision changed from a4ce949828 to 42bb64c608, config revision is 1afcea7f5b [production]
05:28 <kormat> Moving s7 codfw replicas under db2121 - T289129 [production]
05:27 <kormat@cumin1001> dbctl commit (dc=all): 'Set db2121 with weight 0 T289129', diff saved to Unable to send diff to phaste and previous config saved to /var/cache/conftool/dbconfig/20210825-052741-kormat.json [production]
05:27 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:04:00 on 27 hosts with reason: Primary switchover s7 T289129 [production]
05:27 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 1:04:00 on 27 hosts with reason: Primary switchover s7 T289129 [production]
02:06 <eileen> civicrm revision changed from 8ed303f2d1 to a4ce949828, config revision is ac2d75d4a8 [production]
00:53 <legoktm@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'shellbox' for release 'main' . [production]
00:50 <legoktm@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'shellbox' for release 'main' . [production]
00:47 <legoktm@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'shellbox' for release 'main' . [production]