2021-03-08
ยง
|
20:44 |
<legoktm> |
legoktm@registry1004:~$ sudo systemctl reset-failed # to fix icinga warning |
[production] |
20:43 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-druid1003.eqiad.wmnet with reason: REIMAGE |
[production] |
20:41 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-druid1003.eqiad.wmnet with reason: REIMAGE |
[production] |
20:38 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: 5ce7b4602d2b109adfb86bef6795a4d07a1208b9: Set wgGEHelpPanelAskMentor to true by default (T275908) (duration: 01m 07s) |
[production] |
20:32 |
<bblack> |
miscweb[12]002 - re-enabled puppet and deployed new cert |
[production] |
20:23 |
<bblack> |
miscweb[12]002 - disabling puppet to remake cergen cert... |
[production] |
19:55 |
<otto@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Migrate Editing schemas to Event Platform on testwiki - T267343, T267353 (duration: 00m 57s) |
[production] |
19:47 |
<dduvall@deploy1002> |
Synchronized php-1.36.0-wmf.33/maintenance/: maintenance: aa6f291: 4893ddb: fa97162: 380c448: DB_NONE offline maintenance improvements (duration: 00m 58s) |
[production] |
19:37 |
<dduvall@deploy1002> |
Synchronized wmf-config/: wmf-config/env.php,CommonSettings.php: f70049b: e53dc3a: f9b9ea1: WMF_DATACENTER, WMF_MAINTENANCE_OFFLINE handling (duration: 01m 00s) |
[production] |
19:37 |
<bblack> |
cp-text: banning varnish-fe for req.http.host == ( 7 wikis from T274784 ) |
[production] |
19:21 |
<urbanecm@deploy1002> |
Synchronized wmf-config/config/: 1c46d0b: 1aad60b: vector: Expand Desktop Improvements pilot wiki group (T273090) (duration: 00m 58s) |
[production] |
19:20 |
<urbanecm@deploy1002> |
Synchronized dblists/desktop-improvements.dblist: 1c46d0b: 1aad60b: vector: Expand Desktop Improvements pilot wiki group (T273090) (duration: 00m 57s) |
[production] |
19:14 |
<bblack> |
cp-text: disabling puppet ahead of T274784 changes - https://gerrit.wikimedia.org/r/c/operations/puppet/+/669840 |
[production] |
19:10 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: e1cb98890fd4ad0ed25670de2fff6db6e59d7132: Enable flood flag on hrwiki (T276560) (duration: 00m 58s) |
[production] |
18:58 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: a85580030027ca5b879688ed5d76123454164001: Fix sqwiki help panel links description (T275550) (duration: 00m 58s) |
[production] |
18:47 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
18:40 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: dfd95883ed15c532e6345d1dfacfc274b87fcd80: hiwiki: Add missing help panel link descriptions (T276450) (duration: 00m 58s) |
[production] |
18:37 |
<robh@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:36 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1116.eqiad.wmnet with reason: REIMAGE |
[production] |
18:34 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1116.eqiad.wmnet with reason: REIMAGE |
[production] |
18:33 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1115.eqiad.wmnet with reason: REIMAGE |
[production] |
18:33 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
18:31 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1115.eqiad.wmnet with reason: REIMAGE |
[production] |
18:29 |
<robh@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:11 |
<elukey> |
drain + reimage an-worker11[15,16] to Buster |
[production] |
17:40 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1114.eqiad.wmnet with reason: REIMAGE |
[production] |
17:38 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1113.eqiad.wmnet with reason: REIMAGE |
[production] |
17:37 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1114.eqiad.wmnet with reason: REIMAGE |
[production] |
17:36 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1113.eqiad.wmnet with reason: REIMAGE |
[production] |
17:12 |
<elukey> |
drain + reimage an-worker11[13,14] to Buster |
[production] |
16:45 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1110.eqiad.wmnet with reason: REIMAGE |
[production] |
16:43 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1109.eqiad.wmnet with reason: REIMAGE |
[production] |
16:43 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1110.eqiad.wmnet with reason: REIMAGE |
[production] |
16:41 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1109.eqiad.wmnet with reason: REIMAGE |
[production] |
16:17 |
<elukey> |
drain + reimage an-worker1109/1110 to Buster |
[production] |
15:55 |
<marostegui> |
Restart db1115 (tendril host) |
[production] |
15:55 |
<marostegui> |
Restar db11115 |
[production] |
15:47 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1166 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P14669 and previous config saved to /var/cache/conftool/dbconfig/20210308-154710-root.json |
[production] |
15:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1166 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P14666 and previous config saved to /var/cache/conftool/dbconfig/20210308-153207-root.json |
[production] |
15:18 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1108.eqiad.wmnet with reason: REIMAGE |
[production] |
15:17 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1166 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P14665 and previous config saved to /var/cache/conftool/dbconfig/20210308-151703-root.json |
[production] |
15:16 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1108.eqiad.wmnet with reason: REIMAGE |
[production] |
15:16 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1107.eqiad.wmnet with reason: REIMAGE |
[production] |
15:14 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1107.eqiad.wmnet with reason: REIMAGE |
[production] |
15:07 |
<otto@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Migrate PrefUpdate to EventGate on all wikis - T267348 (duration: 00m 59s) |
[production] |
15:02 |
<otto@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Remove wgEventLoggingSchemas overrides for Growth and WMDE Tech wishes schemas - T267333, etc. (duration: 00m 59s) |
[production] |
15:02 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1166 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P14664 and previous config saved to /var/cache/conftool/dbconfig/20210308-150159-root.json |
[production] |
14:54 |
<elukey> |
drain + reimage an-worker110[7,8] to Buster |
[production] |
14:51 |
<otto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
14:51 |
<otto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |