2023-02-07
§
|
09:22 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on bast2002.wikimedia.org with reason: host reimage |
[production] |
09:20 |
<akosiaris@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop: sync |
[production] |
09:20 |
<akosiaris@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop: sync |
[production] |
09:20 |
<akosiaris> |
add wiktionary to mobile-sections rerenders. T226931 |
[production] |
09:19 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on bast2002.wikimedia.org with reason: host reimage |
[production] |
09:19 |
<akosiaris@deploy1002> |
helmfile [staging] DONE helmfile.d/services/changeprop: sync |
[production] |
09:19 |
<akosiaris@deploy1002> |
helmfile [staging] START helmfile.d/services/changeprop: sync |
[production] |
09:08 |
<elukey@cumin1001> |
START - Cookbook sre.kafka.roll-restart-brokers for Kafka A:kafka-logging-codfw cluster: Roll restart of jvm daemons. |
[production] |
09:02 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host bast2002.wikimedia.org with OS bullseye |
[production] |
08:50 |
<vgutierrez> |
rolling upgrade to HAProxy 2.4.21 in cp nodes |
[production] |
08:48 |
<kostajh> |
UTC morning deploys done |
[production] |
08:48 |
<kharlan@deploy1002> |
Finished scap: Backport for [[gerrit:883236|[Growth] Remove mentor list variables (T321501)]], [[gerrit:883153|Remove GEMentorProvider (T321501)]] (duration: 12m 48s) |
[production] |
08:37 |
<kharlan@deploy1002> |
urbanecm and kharlan: Backport for [[gerrit:883236|[Growth] Remove mentor list variables (T321501)]], [[gerrit:883153|Remove GEMentorProvider (T321501)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
08:35 |
<kharlan@deploy1002> |
Started scap: Backport for [[gerrit:883236|[Growth] Remove mentor list variables (T321501)]], [[gerrit:883153|Remove GEMentorProvider (T321501)]] |
[production] |
08:30 |
<moritzm> |
installing imagemagick security updates on Thumbor T328901 |
[production] |
08:28 |
<kharlan@deploy1002> |
Finished scap: Backport for [[gerrit:886343|GrowthExperiments: Disable leveling up features in production (T328757)]] (duration: 12m 11s) |
[production] |
08:18 |
<kharlan@deploy1002> |
kharlan: Backport for [[gerrit:886343|GrowthExperiments: Disable leveling up features in production (T328757)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
08:16 |
<kharlan@deploy1002> |
Started scap: Backport for [[gerrit:886343|GrowthExperiments: Disable leveling up features in production (T328757)]] |
[production] |
08:14 |
<kharlan@deploy1002> |
backport aborted: (duration: 00m 07s) |
[production] |
07:00 |
<marostegui> |
Failover m3 from db1159 to db1164 - T328404 |
[production] |
06:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2110 in API', diff saved to https://phabricator.wikimedia.org/P43758 and previous config saved to /var/cache/conftool/dbconfig/20230207-063147-root.json |
[production] |
06:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1187', diff saved to https://phabricator.wikimedia.org/P43757 and previous config saved to /var/cache/conftool/dbconfig/20230207-062826-root.json |
[production] |
04:58 |
<mwpresync@deploy1002> |
Pruned MediaWiki: 1.40.0-wmf.20 (duration: 02m 20s) |
[production] |
04:55 |
<mwpresync@deploy1002> |
Finished scap: testwikis wikis to 1.40.0-wmf.22 refs T325585 (duration: 53m 11s) |
[production] |
04:02 |
<mwpresync@deploy1002> |
Started scap: testwikis wikis to 1.40.0-wmf.22 refs T325585 |
[production] |
2023-02-06
§
|
23:17 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2421.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
23:01 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.provision for host mw2421.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
22:55 |
<ryankemper> |
T327925 Depooled codfw wdqs hosts: `ryankemper@cumin2002:~$ sudo -E cumin -b 3 'wdqs[2003-2004,2009]*' 'sudo depool'` |
[production] |
22:51 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 13 hosts with reason: switch upgrade |
[production] |
22:51 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on 13 hosts with reason: switch upgrade |
[production] |
22:48 |
<ryankemper> |
T327925 Banned `elastic[2037-2040,2055-2056,2061-2062,2069,2073-2076]` on codfw elastic |
[production] |
22:42 |
<inflatador> |
bking@cumin2002 banning Elastic nodes from cluster in preparation for T327925 |
[production] |
22:17 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host mw2421.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
22:10 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.provision for host mw2421.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
22:08 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host mw2421 |
[production] |
22:07 |
<pt1979@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host mw2421 |
[production] |
22:06 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
22:06 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add mw2421 DNS - pt1979@cumin2002" |
[production] |
22:05 |
<pt1979@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add mw2421 DNS - pt1979@cumin2002" |
[production] |
22:05 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2420.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
22:01 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
22:00 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.provision for host mw2420.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
19:44 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host mw2420.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
19:32 |
<zabe@deploy1002> |
say aborted: (duration: 00m 39s) |
[production] |
19:30 |
<zabe@deploy1002> |
backport aborted: (duration: 00m 00s) |
[production] |
19:29 |
<urbanecm> |
[urbanecm@mwmaint1002 ~]$ mwscript resetAuthenticationThrottle.php --wiki=metawiki --signup --ip 92.62.231.190 # T328929 |
[production] |
19:27 |
<zabe@deploy1002> |
backport aborted: (duration: 00m 23s) |
[production] |
19:25 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:886910|Add a new throttle rule (T328929)]] (duration: 07m 43s) |
[production] |
19:18 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:886910|Add a new throttle rule (T328929)]] |
[production] |
19:17 |
<urbanecm@deploy1002> |
backport aborted: (duration: 00m 01s) |
[production] |