2020-05-21
§
|
09:28 |
<mutante> |
deneb - sudo systemctl reset-failed to clear Icinga alerts about systemd degraded state |
[production] |
09:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1143 and db1091', diff saved to https://phabricator.wikimedia.org/P11266 and previous config saved to /var/cache/conftool/dbconfig/20200521-091245-marostegui.json |
[production] |
09:01 |
<mutante> |
LDAP - added lmata to wmf group (T253277) |
[production] |
08:55 |
<XioNoX> |
Advertise Anycast 198.35.27.0/24 from esams - T253196 |
[production] |
08:52 |
<XioNoX> |
Advertise Anycast 198.35.27.0/24 from eqsin - T253196 |
[production] |
08:49 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Pool db1143 with minimal weight for the first time T252512', diff saved to https://phabricator.wikimedia.org/P11265 and previous config saved to /var/cache/conftool/dbconfig/20200521-084933-marostegui.json |
[production] |
08:47 |
<XioNoX> |
Advertise Anycast 198.35.27.0/24 from eqiad/eqord - T253196 |
[production] |
08:42 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Add db1143 to the list of s4 hosts, depooled - T252512', diff saved to https://phabricator.wikimedia.org/P11264 and previous config saved to /var/cache/conftool/dbconfig/20200521-084226-marostegui.json |
[production] |
08:34 |
<XioNoX> |
Advertise Anycast 198.35.27.0/24 from dfw - T253196 |
[production] |
08:27 |
<XioNoX> |
Advertise Anycast 198.35.27.0/24 from ulsfo - T253196 |
[production] |
08:20 |
<XioNoX> |
Delete ARIN route object for 198.35.26.0/23 - T253196 |
[production] |
08:13 |
<XioNoX> |
Delete ROA for 198.35.26.0/23 - T253196 |
[production] |
08:10 |
<XioNoX> |
repool ulsfo - T253196 |
[production] |
08:03 |
<XioNoX> |
Shrink ulsfo's 198.35.26.0/23 to 198.35.26.0/24 - T253196 |
[production] |
07:29 |
<XioNoX> |
depool ulsfo - T253196 |
[production] |
07:22 |
<marostegui> |
Purge events from tendril.global_status_log older than 24h - T252331 |
[production] |
07:03 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Repool es1019 fully', diff saved to https://phabricator.wikimedia.org/P11263 and previous config saved to /var/cache/conftool/dbconfig/20200521-070335-jynus.json |
[production] |
06:58 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1091 - T252512', diff saved to https://phabricator.wikimedia.org/P11261 and previous config saved to /var/cache/conftool/dbconfig/20200521-065858-marostegui.json |
[production] |
06:28 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Repool es1019 with 50% weight', diff saved to https://phabricator.wikimedia.org/P11260 and previous config saved to /var/cache/conftool/dbconfig/20200521-062823-jynus.json |
[production] |
06:04 |
<vgutierrez> |
pool cp5012 - T251219 |
[production] |
05:42 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Repool es1019 with low weight', diff saved to https://phabricator.wikimedia.org/P11259 and previous config saved to /var/cache/conftool/dbconfig/20200521-054231-jynus.json |
[production] |
05:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set enwiki as read-only=off after maintenance T251982', diff saved to https://phabricator.wikimedia.org/P11258 and previous config saved to /var/cache/conftool/dbconfig/20200521-050328-marostegui.json |
[production] |
05:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set enwiki as read-only for maintenance T251982', diff saved to https://phabricator.wikimedia.org/P11257 and previous config saved to /var/cache/conftool/dbconfig/20200521-050029-marostegui.json |
[production] |
01:03 |
<krinkle@deploy1001> |
Synchronized wmf-config/mc.php: Ic9efa98312b (duration: 01m 08s) |
[production] |
2020-05-20
§
|
20:16 |
<herron> |
logstash1011:~# kafka-preferred-replica-election --zookeeper conf1004.eqiad.wmnet,conf1005.eqiad.wmnet,conf1006.eqiad.wmnet/kafka/logging-eqiad |
[production] |
19:27 |
<robh> |
cp5012 still offline for mem tests, "fast" testing complete without errors and extended testing in progress. system firmware was updated before testing. T251219 |
[production] |
18:10 |
<XioNoX> |
accept 198.35.27.0/24 from Anycast peers on all routers - T253196 |
[production] |
18:01 |
<XioNoX> |
add BGP between authdns2001 and cr1-codfw - T253196 |
[production] |
17:57 |
<XioNoX> |
accept 198.35.27.0/24 from Anycast peers on cr3-ulsfo - T253196 |
[production] |
17:44 |
<robh> |
cp5012 rebooting for troubleshooting |
[production] |
17:02 |
<bblack> |
dns* + authdns* - disabling puppet to test https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/597311/ |
[production] |
16:53 |
<bblack> |
kraz.wikimedia.org ( https://wikitech.wikimedia.org/wiki/IRCD ) - stopping ircecho then ircd, then restarting them in reverse order - T239993 |
[production] |
16:01 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'production' . |
[production] |
16:01 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'canary' . |
[production] |
15:42 |
<elukey> |
update puppet compiler's facts |
[production] |
15:21 |
<moritzm> |
installing libssh security updates |
[production] |
15:15 |
<jayme@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'mathoid' for release 'staging' . |
[production] |
15:00 |
<jforrester@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: T253096 [itwikivoyage] Undeploy Insider and Listings extensions (duration: 01m 08s) |
[production] |
14:43 |
<marostegui> |
Replace tendril_purge_global_status_log_5m event with the new one (purging every 2d of data and with a higher limit of rows) - T252331 |
[production] |
14:34 |
<hnowlan@deploy1001> |
Finished deploy [restbase/deploy@6d2f88c]: Add awa.wikipedia.org to wikipedia list (duration: 19m 49s) |
[production] |
14:15 |
<hnowlan@deploy1001> |
Started deploy [restbase/deploy@6d2f88c]: Add awa.wikipedia.org to wikipedia list |
[production] |
14:06 |
<XioNoX> |
special-ranges6, remove 4000::/2 and 8000::/1 |
[production] |
14:03 |
<bblack> |
authdns1001 - poweroff for T241770 |
[production] |
14:00 |
<bblack> |
cr2-eqiad - re-routing ns[01] public IPs from authdns1001 (going offline for hw work) to dns1002 - T241770 (redo from earlier, commit didn't take for whatever reason) |
[production] |
13:52 |
<bblack> |
cr[12]-eqiad - re-routing ns[01] public IPs from authdns1001 (going offline for hw work) to dns1002 - T241770 |
[production] |
13:51 |
<bblack> |
authdns1001 - downtimed for physical work - T241770 |
[production] |
13:24 |
<milimetric@deploy1001> |
Finished deploy [analytics/refinery@a891999] (thin): Regular analytics weekly train THIN [analytics/refinery@a891999] (duration: 00m 10s) |
[production] |
13:23 |
<milimetric@deploy1001> |
Started deploy [analytics/refinery@a891999] (thin): Regular analytics weekly train THIN [analytics/refinery@a891999] |
[production] |
13:23 |
<milimetric@deploy1001> |
Finished deploy [analytics/refinery@a891999]: Regular analytics weekly train [analytics/refinery@a891999] (duration: 38m 33s) |
[production] |
13:23 |
<godog> |
remove stale tcp service on lvs codfw low-traffic 10.2.1.53:10902 |
[production] |