951-1000 of 10000 results (69ms)
2022-07-07 ยง
12:22 <moritzm> draining ganeti2015 T311686 [production]
11:53 <jayme@deploy1002> helmfile [staging] DONE helmfile.d/services/tegola-vector-tiles: apply [production]
11:49 <jayme> rolling back helm release eventstreams-internal/main to revision 3 on eqiad and codfw clusters because it's pending-upgrade since Mon Mar 21 21:36:56 2022 / Mon Mar 21 16:05:54 2022 [production]
11:42 <jayme@deploy1002> helmfile [staging] START helmfile.d/services/tegola-vector-tiles: apply [production]
11:42 <jayme@deploy1002> helmfile [staging] DONE helmfile.d/services/tegola-vector-tiles: apply [production]
11:41 <jayme@deploy1002> helmfile [staging] START helmfile.d/services/tegola-vector-tiles: apply [production]
11:40 <jayme> rolling back helm release tegola-vector-tiles/main to revision 11 on staging-eqiad because it's pending-upgrade since Mon Jun 27 09:45:56 2022 [production]
11:04 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1160.eqiad.wmnet with reason: Maintenance [production]
11:04 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1160.eqiad.wmnet with reason: Maintenance [production]
11:00 <moritzm> installing intel-microcode security updates [production]
10:59 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/api-gateway: sync [production]
10:59 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/api-gateway: sync [production]
10:57 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/api-gateway: sync [production]
10:56 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/api-gateway: sync [production]
10:54 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:49 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/api-gateway: sync [production]
10:48 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/api-gateway: sync [production]
10:46 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]
10:32 <moritzm> draining ganeti2010 T311686 [production]
10:13 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
10:12 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
10:12 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
10:11 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
10:06 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
10:05 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
10:05 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
10:04 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
09:47 <cmooney@cumin1001> START - Cookbook sre.hosts.reimage for host cloudnet1006.eqiad.wmnet with OS bullseye [production]
09:44 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
09:44 <moritzm> installing 5.10.120-1~bpo10+1 kernels on buster hosts running Linux 5.10 [production]
09:43 <urbanecm@deploy1002> Synchronized wmf-config/InitialiseSettings.php: 8599f395bd3af2b27aa06cdc318d44e97efc8119: Declare mediawiki.editgrowthconfig schema (T312148) (duration: 03m 37s) [production]
09:43 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
09:43 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
09:42 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
09:38 <klausman@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
09:37 <klausman@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
09:35 <klausman@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
09:33 <marostegui> dbmaint s3@eqiad T312285 [production]
09:33 <marostegui> dbmaint s7@eqiad T312285 [production]
09:33 <marostegui> dbmaint s2@eqiad T312285 [production]
09:32 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dbstore1007.eqiad.wmnet [production]
09:31 <marostegui> dbmaint s6@eqiad T312285 [production]
09:24 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db2161 to dbctl T311493', diff saved to https://phabricator.wikimedia.org/P30940 and previous config saved to /var/cache/conftool/dbconfig/20220707-092424-marostegui.json [production]
09:22 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host dbstore1007.eqiad.wmnet [production]
09:21 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dbstore1005.eqiad.wmnet [production]
09:17 <moritzm> draining ganeti2009 T311686 [production]
09:14 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host dbstore1005.eqiad.wmnet [production]
09:11 <btullis@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host dbstore1003.eqiad.wmnet [production]
09:10 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubetcd2004.codfw.wmnet with reason: Switch disk type back to plain [production]
09:09 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on kubetcd2004.codfw.wmnet with reason: Switch disk type back to plain [production]