2021-08-20
§
|
23:19 |
<bstorm> |
tried renewing all the certs to get certs working again in kubernetes |
[toolsbeta] |
23:17 |
<legoktm> |
deployed patch for T289385 |
[production] |
21:11 |
<urbanecm> |
urbanecm@deployment-deploy01:/srv/mediawiki-staging/private$ rm mwblocker.log # remove weird blank log file |
[releng] |
19:55 |
<wm-bot> |
<lucaswerkmeister> ran "php extensions/WikiLambda/maintenance/reloadBuiltinData.php" at ArthurPSmith’s request |
[tools.notwikilambda] |
19:46 |
<wm-bot> |
<lucaswerkmeister> ran "git -C extensions/WikiLambda/ submodule update --init --recursive", apparently the function-schemata submodule was missing? |
[tools.notwikilambda] |
19:10 |
<majavah> |
rebuilding node12-sssd/{base,web} to use debian packaged npm 7 |
[tools] |
18:56 |
<jeena> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/713439 |
[releng] |
18:55 |
<urbanecm> |
urbanecm@deployment-mwmaint01:/srv/mediawiki/php-master$ mwscript extensions/GrowthExperiments/maintenance/updateMenteeData.php --wiki=cswiki |
[releng] |
18:54 |
<urbanecm> |
urbanecm@deployment-mwmaint01:~$ for i in {1..20}; do echo "test $i" | mwscript edit.php --wiki={cswiki,enwiki} --user="Martin Urbanec (test $i)" --summary="test" Sandbox; done |
[releng] |
18:49 |
<urbanecm> |
urbanecm@deployment-mwmaint01:/srv/mediawiki/php-master$ mwscript extensions/GrowthExperiments/maintenance/updateMenteeData.php --wiki=enwiki |
[releng] |
18:49 |
<urbanecm> |
urbanecm@deployment-mwmaint01:/srv/mediawiki/php-master$ mwscript extensions/GrowthExperiments/maintenance/updateMenteeData.php --wiki=cswiki |
[releng] |
18:46 |
<urbanecm> |
urbanecm@deployment-mwmaint01:/srv/mediawiki/php-master$ for i in {1..20}; do mwscript extensions/CentralAuth/maintenance/createLocalAccount.php --wiki=enwiki "Martin Urbanec (test $i)"; done |
[releng] |
18:42 |
<majavah> |
rebuilding php74-sssd/{base,web} to use composer 2 |
[tools] |
18:40 |
<urbanecm> |
urbanecm@deployment-mwmaint01:~$ for i in {1..20}; do mwscript createAndPromote.php --wiki=cswiki "Martin Urbanec (test $i)" "$password"; done # to test a feature that needs a lot of different accounts |
[releng] |
17:03 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1141.eqiad.wmnet |
[production] |
17:01 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1141.eqiad.wmnet |
[production] |
16:58 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1140.eqiad.wmnet |
[production] |
16:56 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1140.eqiad.wmnet |
[production] |
16:56 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1139.eqiad.wmnet |
[production] |
16:54 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1139.eqiad.wmnet |
[production] |
16:45 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1134.eqiad.wmnet |
[production] |
16:43 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1134.eqiad.wmnet |
[production] |
16:38 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1133.eqiad.wmnet |
[production] |
16:36 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1133.eqiad.wmnet |
[production] |
16:30 |
<majavah> |
restart sssd on deployment-cache-text06, T286502? |
[releng] |
16:24 |
<majavah> |
deployment-prep: configure wikifunctions.beta.wmflabs.org dns zones and add to acme-chief T284162 |
[releng] |
15:56 |
<Globgor> |
Bacchus instance rebuilded |
[wikisp] |
15:37 |
<jayme> |
deleting various pods from staging to have them recreated with priorities - T289131 |
[production] |
15:25 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1129.eqiad.wmnet |
[production] |
15:23 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1129.eqiad.wmnet |
[production] |
15:14 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
14:41 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2021.codfw.wmnet with reason: REIMAGE |
[production] |
14:39 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc2021.codfw.wmnet with reason: REIMAGE |
[production] |
13:54 |
<jelto@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
13:48 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
12:00 |
<jayme> |
enabled priority admission plugin on k8s staging, rolling restart all pods in kube-system namespace - T289131 |
[production] |
11:55 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
10:35 |
<dzahn@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
09:33 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts druid1001.eqiad.wmnet |
[production] |
09:32 |
<dzahn@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
09:23 |
<btullis@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts druid1001.eqiad.wmnet |
[production] |
08:48 |
<godog> |
roll depool/pool thanos-fe to apply swift change - T288815 |
[production] |
08:46 |
<btullis> |
btullis@druid1001:~$ sudo systemctl stop druid-broker druid-coordinator druid-historical druid-middlemanager druid-overlord |
[analytics] |
08:43 |
<godog> |
temp depool thanos-fe2003 to test https://gerrit.wikimedia.org/r/c/operations/puppet/+/713815 |
[production] |
08:43 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on druid1001.eqiad.wmnet with reason: decommissioning druid1001 |
[production] |
08:43 |
<btullis@cumin1001> |
START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on druid1001.eqiad.wmnet with reason: decommissioning druid1001 |
[production] |
07:14 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mc2019.codfw.wmnet with reason: REIMAGE |
[production] |
07:12 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1038.eqiad.wmnet with reason: REIMAGE |
[production] |
07:10 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1037.eqiad.wmnet with reason: REIMAGE |
[production] |
07:10 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1038.eqiad.wmnet with reason: REIMAGE |
[production] |