2020-09-03
§
|
07:02 |
<marostegui> |
Stop db2100:3317 and db2121 in sync to reload metawiki.content T261869 |
[production] |
07:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2121 T261869', diff saved to https://phabricator.wikimedia.org/P12445 and previous config saved to /var/cache/conftool/dbconfig/20200903-070104-marostegui.json |
[production] |
06:56 |
<hashar> |
contint2001: restarting CI Jenkins |
[production] |
06:56 |
<oblivian@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . |
[production] |
06:56 |
<_joe_> |
deployment of mobileapps to pick up changes to envoy config, new helmfile layout |
[production] |
06:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db2120 T261869', diff saved to https://phabricator.wikimedia.org/P12444 and previous config saved to /var/cache/conftool/dbconfig/20200903-065105-marostegui.json |
[production] |
06:48 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db2120 T261869', diff saved to https://phabricator.wikimedia.org/P12443 and previous config saved to /var/cache/conftool/dbconfig/20200903-064804-marostegui.json |
[production] |
06:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db2120 T261869', diff saved to https://phabricator.wikimedia.org/P12442 and previous config saved to /var/cache/conftool/dbconfig/20200903-064623-marostegui.json |
[production] |
06:43 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db2120 T261869', diff saved to https://phabricator.wikimedia.org/P12441 and previous config saved to /var/cache/conftool/dbconfig/20200903-064334-marostegui.json |
[production] |
06:24 |
<marostegui> |
Disconnect eqiad -> codfw replication |
[production] |
2020-09-02
§
|
22:55 |
<shdubsh> |
restart rsyslog on centrallog[12]001 |
[production] |
22:27 |
<ryankemper> |
`sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal' "sudo systemctl restart wdqs-blazegraph.service"` |
[production] |
22:26 |
<ryankemper> |
Puppet finished on all external wdqs codfw nodes, nginx automatically reloaded as intended |
[production] |
22:24 |
<ryankemper> |
`sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal' "sudo run-puppet-agent"` |
[production] |
21:47 |
<bd808@deploy1001> |
Finished deploy [striker/deploy@3c2090a]: Deploying r20200902 tag (T198114, T223610, T245804, T144111, T261810) (duration: 01m 34s) |
[production] |
21:46 |
<bd808@deploy1001> |
Started deploy [striker/deploy@3c2090a]: Deploying r20200902 tag (T198114, T223610, T245804, T144111, T261810) |
[production] |
21:10 |
<ryankemper> |
`sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal' "sudo systemctl restart wdqs-blazegraph.service"` |
[production] |
21:10 |
<ryankemper> |
`sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal' "sudo systemctl restart nginx.service"` |
[production] |
21:02 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
21:01 |
<ryankemper> |
Restarted nginx on `wdqs2007` |
[production] |
21:00 |
<cmjohnson@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
20:47 |
<ryankemper> |
restarted blazegraph on `wdqs2001` as well |
[production] |
20:46 |
<ryankemper> |
`sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal and not P{wdqs2001.codfw.wmnet}' "sudo systemctl restart wdqs-blazegraph.service"` (restarted everything but 2001, will restart 2001 next) |
[production] |
20:02 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:57 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:26 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:24 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:23 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:20 |
<robh> |
scs-c1-eqiad firmware update complete and back online T238036 |
[production] |
19:14 |
<robh> |
updating firmware on scs-c1-eqiad via T238036 |
[production] |
19:14 |
<urbanecm@deploy1001> |
Synchronized private/PrivateSettings.php: Revert "Update T250887 mitigations" (duration: 00m 32s) |
[production] |
19:14 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:12 |
<robh> |
updating firmware on scs-c1-eqiad via T238036 |
[production] |
19:09 |
<Urbanecm> |
21:08 <+logmsgbot> !log urbanecm@deploy1001 Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 00m 54s) |
[production] |
19:08 |
<urbanecm@deploy1001> |
Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 00m 54s) |
[production] |
18:58 |
<herron> |
freeing some disk space on centrallog1001 with 'tune2fs -m 0 /dev/centrallog1001-vg/data' |
[production] |
18:43 |
<ppchelko@deploy1001> |
Synchronized wmf-config/CommonSettings.php: gerrit:622898 Install OAuthRateLimiter III: Install where enabled, ouch, forgot to rebase (duration: 00m 55s) |
[production] |
18:40 |
<ppchelko@deploy1001> |
Synchronized wmf-config/CommonSettings.php: gerrit:622898 Install OAuthRateLimiter III: Install where enabled (duration: 00m 55s) |
[production] |
18:38 |
<ottomata> |
execute kafka topics --alter --topic codfw.resource_change --partitions 3 and kafka topics --alter --topic eqiad.resource_change --partitions 3 on kafka jumbo-eqiad (for consistency with main) - T261865 |
[production] |
18:37 |
<ottomata> |
execute kafka topics --alter --topic codfw.resource_change --partitions 3 and kafka topics --alter --topic eqiad.resource_change --partitions 3 on kafka main-codfw - T261865 |
[production] |
18:36 |
<ppchelko@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: gerrit:622897 Install OAuthRateLimiter extension II: Add flag to IS (duration: 00m 56s) |
[production] |
18:34 |
<ottomata> |
execute kafka topics --alter --topic codfw.resource_change --partitions 3 and kafka topics --alter --topic eqiad.resource_change --partitions 3 on kafka main-eqiad - T261865 |
[production] |
18:33 |
<ppchelko@deploy1001> |
Synchronized wmf-config/extension-list: (no justification provided) (duration: 00m 54s) |
[production] |
18:32 |
<ottomata> |
execute kafka topics --alter --topic codfw.resource-purge --partitions 3 and kafka topics --alter --topic eqiad.resource-purge --partitions 3 on kafka jumbo-eqiad (for consistency with main) - T261865 |
[production] |
18:28 |
<ppchelko@deploy1001> |
Synchronized php-1.36.0-wmf.6/extensions/DiscussionTools/: Backport [[gerrit:623561|Fix parsing localised digits in PHP discussion parser]] (duration: 00m 56s) |
[production] |
18:19 |
<ppchelko@deploy1001> |
Synchronized php-1.36.0-wmf.6/extensions/DiscussionTools/: Backport [[gerrit:623560|Re-apply new reply API patches (again)]] (duration: 00m 58s) |
[production] |
17:34 |
<bstorm> |
re-enabled puppet on labsdb10[09-12] |
[production] |
17:28 |
<bstorm> |
disabled puppet on labsdb10[09-12] |
[production] |
17:18 |
<herron> |
restarted elasticsearch on logstash1012 |
[production] |
16:39 |
<Pchelolo> |
creating oauth_ratelimit_client_tier table T258711 |
[production] |