2021-07-20
§
|
13:14 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 13 hosts with reason: Deploying schema change to s5 T281058 |
[production] |
13:13 |
<gehel@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=elastic1039.eqiad.wmnet |
[production] |
12:44 |
<moritzm> |
installing systemd security updates on buster |
[production] |
12:23 |
<elukey> |
reboot ml-serve-ctrl vms to pick up new vcores settings |
[production] |
12:22 |
<elukey> |
bump vcpus from 2 to 4 on ml-serve-ctrl VMs on Ganeti (load/cpu usage increased steadily since we deployed kubelets on them) |
[production] |
11:58 |
<Lucas_WMDE> |
EU config+backport window done |
[production] |
11:58 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/CommonSettings-labs.php: Config: [[gerrit:705505|Avoid using User::newFrom* methods]] (3/3) (duration: 00m 56s) |
[production] |
11:58 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on maps1007.eqiad.wmnet with reason: Testing impact of tilerator |
[production] |
11:58 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on maps1007.eqiad.wmnet with reason: Testing impact of tilerator |
[production] |
11:56 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/CommonSettings.php: Config: [[gerrit:705505|Avoid using User::newFrom* methods]] (2/3) (duration: 00m 56s) |
[production] |
11:55 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/wikitech.php: Config: [[gerrit:705505|Avoid using User::newFrom* methods]] (1/3) (duration: 00m 56s) |
[production] |
11:48 |
<urbanecm@deploy1002> |
Synchronized logos/config.yaml: e52ae37dc2010ed2483328921a274e4934940791: otrs_wikiwiki: Update logo to use VRT instead of OTRS (T280400; 3/3) (duration: 00m 56s) |
[production] |
11:47 |
<urbanecm@deploy1002> |
Synchronized wmf-config/logos.php: e52ae37dc2010ed2483328921a274e4934940791: otrs_wikiwiki: Update logo to use VRT instead of OTRS (T280400; 2/3) (duration: 00m 56s) |
[production] |
11:46 |
<urbanecm@deploy1002> |
Synchronized static/images/project-logos: e52ae37dc2010ed2483328921a274e4934940791: otrs_wikiwiki: Update logo to use VRT instead of OTRS (T280400; 1/3) (duration: 00m 57s) |
[production] |
11:35 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:705498|Add patroller group for ckbwiki (T285221)]] (duration: 00m 57s) |
[production] |
11:23 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/CommonSettings-labs.php: Config: [[gerrit:705107|Typo fix: "the the" -> "the" (T201491)]] (2/2, beta) (duration: 00m 56s) |
[production] |
11:22 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/CommonSettings.php: Config: [[gerrit:705107|Typo fix: "the the" -> "the" (T201491)]] (1/2, prod) (duration: 00m 57s) |
[production] |
11:18 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:704867|Update config for language switching on pilot wikis (T286459)]] (duration: 00m 59s) |
[production] |
11:06 |
<oblivian@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
11:03 |
<oblivian@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
10:58 |
<oblivian@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
10:57 |
<oblivian@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
10:53 |
<oblivian@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
10:43 |
<hnowlan@puppetmaster1001> |
conftool action : set/weight=10; selector: name=maps100[79].eqiad.wmnet |
[production] |
10:35 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=maps100[79].eqiad.wmnet |
[production] |
10:11 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
09:39 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 14 hosts with reason: Deploying schema change to s6 T281058 |
[production] |
09:39 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 14 hosts with reason: Deploying schema change to s6 T281058 |
[production] |
08:27 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mw2352.codfw.wmnet |
[production] |
08:21 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host mw2352.codfw.wmnet |
[production] |
08:02 |
<btullis> |
racadm serveraction powercycle on an-worker1106 due to CPU soft lock-ups on host |
[production] |
07:54 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host idp-test2001.wikimedia.org |
[production] |
07:50 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host idp-test2001.wikimedia.org |
[production] |
07:10 |
<jmm@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=ldap-replica1004.wikimedia.org |
[production] |
03:17 |
<eileen> |
civicrm revision changed from 20e9ef6bbb to 819c11307d, config revision is bb405c5232 |
[production] |
2021-07-19
§
|
20:48 |
<urbanecm> |
Deploy security patch for T286884 |
[production] |
20:29 |
<vgutierrez> |
pool text@codfw - T286921 |
[production] |
20:23 |
<volans@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:18 |
<volans@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
20:08 |
<dancy@deploy1002> |
Synchronized php-1.37.0-wmf.14/includes/export/WikiExporter.php: Backport: [[gerrit:705467|prevent PageIdentity checks in RevisionStore from breaking xml dumps (T286877)]] (duration: 00m 58s) |
[production] |
19:21 |
<Jeff_Green> |
authdns-update to remove payments100[1-4].frack.eqiad.wmnet |
[production] |
19:14 |
<dancy@deploy1002> |
Synchronized php-1.37.0-wmf.14/includes/Revision/RevisionStore.php: Backport: [[gerrit:705448|Add sanity check to newRevisionFromRowAndSlots. (T286877)]] (duration: 00m 57s) |
[production] |
18:53 |
<vgutierrez> |
running puppet and restarting pybal on lvs2009 - T286921 |
[production] |
18:46 |
<topranks> |
Running homer to re-enable port xe-2/0/43 on asw2-a2-codfw (lvs2009) - T286921 |
[production] |
18:46 |
<brennen> |
gerrit1001: restarting gerrit |
[production] |
18:40 |
<vgutierrez> |
stop pybal on lvs2009 - T286921 |
[production] |
18:38 |
<brennen> |
re-enabling puppet on gerrit1001] |
[production] |
18:35 |
<vgutierrez> |
running puppet and restarting pybal on lvs2010 - T286921 |
[production] |
18:27 |
<ryankemper> |
T264053 Deploying fix for timer issue on relforge: `ryankemper@cumin1001:~$ sudo cumin -b 2 'P{relforge*}' 'sudo systemctl stop elasticsearch-disable-readahead.timer && sudo systemctl disable elasticsearch-disable-readahead.timer && rm -fv /etc/systemd/system/elasticsearch-disable-readahead.timer && rm -fv /usr/lib/systemd/system/elasticsearch-disable-readahead.timer && sudo run-puppet-agent'` |
[production] |
18:27 |
<topranks> |
Running homer to re-enable port xe-2/0/44 on asw2-a2-codfw (lvs2010) |
[production] |