2020-09-30
ยง
|
11:33 |
<arturo> |
disabling puppet and downtiming every virt/net server in the fleet in preparation for merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/631167 (T262979) |
[admin] |
11:27 |
<arturo> |
syncing facts from puppetmaster1001 |
[puppet-diffs] |
11:26 |
<arturo> |
trying a simple `webservice restart` |
[tools.sal] |
11:24 |
<arturo> |
tool webservice detected to be misbehaving, several uncaught exceptions in the source code |
[tools.sal] |
11:21 |
<nikerabbit@deploy1001> |
Synchronized wmf-config/CommonSettings.php: Config: [[gerrit:627744|Enable Special:TranslationStats (T263004)]] (duration: 00m 59s) |
[production] |
11:06 |
<effie> |
disable puppet on P:mediawiki::mcrouter_wancache for 630845 - T244340 |
[production] |
10:57 |
<moritzm> |
installing librsvg security updates |
[production] |
10:47 |
<hnowlan@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . |
[production] |
10:47 |
<hnowlan@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'api-gateway' for release 'production' . |
[production] |
10:44 |
<hnowlan@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'api-gateway' for release 'production' . |
[production] |
10:44 |
<hnowlan@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . |
[production] |
10:34 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . |
[production] |
10:34 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'api-gateway' for release 'production' . |
[production] |
10:24 |
<jayme@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'citoid' for release 'production' . |
[production] |
10:21 |
<jayme@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'citoid' for release 'production' . |
[production] |
10:07 |
<kormat> |
deploying schema change to s4/eqiad T259831 |
[production] |
10:07 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:07 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:59 |
<jayme@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'citoid' for release 'staging' . |
[production] |
09:50 |
<jayme> |
imported envoyproxy 1.15.1 to buster-wikimedia component/envoy-future - T264157 |
[production] |
09:32 |
<arturo> |
rebooting cloudvirt1012 to investigate linuxbridge agent issues |
[admin] |
09:12 |
<gehel@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:10 |
<gehel@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:45 |
<kormat> |
deploying schema change to s7/eqiad T259831 |
[production] |
08:45 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:45 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove es2016 from dbctl T264156', diff saved to https://phabricator.wikimedia.org/P12853 and previous config saved to /var/cache/conftool/dbconfig/20200930-080817-marostegui.json |
[production] |
08:06 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'termbox' for release 'production' . |
[production] |
08:00 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'termbox' for release 'production' . |
[production] |
07:56 |
<akosiaris> |
upgrade termbox to latest chart, fixing various prometheus-statsd-export configuration minor issues. |
[production] |
07:56 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . |
[production] |
07:55 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . |
[production] |
07:44 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Promote db1131 on s6 eqiad master T263227, also give weight to db1093 as new API host', diff saved to https://phabricator.wikimedia.org/P12852 and previous config saved to /var/cache/conftool/dbconfig/20200930-074417-marostegui.json |
[production] |
07:41 |
<marostegui> |
Starting s6 eqiad failover from db1093 to db1131 - T263227 |
[production] |
07:29 |
<elukey> |
execute "alter table superset_production.alerts drop key ix_alerts_active;" on db1108's analytics-meta instance to fix replication after Superset upgrade - T262162 |
[analytics] |
07:18 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set db1131 with weight 0 T263227', diff saved to https://phabricator.wikimedia.org/P12851 and previous config saved to /var/cache/conftool/dbconfig/20200930-071841-marostegui.json |
[production] |
07:05 |
<marostegui> |
Stop mysql on es2016 before decommissioning T264156 |
[production] |
07:04 |
<elukey> |
superset upgraded to 0.37.2 on analytics-tool1004 - T262162 |
[analytics] |
07:01 |
<elukey@deploy1001> |
Finished deploy [analytics/superset/deploy@7bdc414]: Upgrade to 0.37.2 (duration: 00m 49s) |
[production] |
07:00 |
<elukey@deploy1001> |
Started deploy [analytics/superset/deploy@7bdc414]: Upgrade to 0.37.2 |
[production] |
06:58 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es2016 T264156', diff saved to https://phabricator.wikimedia.org/P12850 and previous config saved to /var/cache/conftool/dbconfig/20200930-065838-marostegui.json |
[production] |
06:21 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) |
[production] |
06:19 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers |
[production] |
06:10 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2082', diff saved to https://phabricator.wikimedia.org/P12849 and previous config saved to /var/cache/conftool/dbconfig/20200930-061036-marostegui.json |
[production] |
06:10 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2082', diff saved to https://phabricator.wikimedia.org/P12848 and previous config saved to /var/cache/conftool/dbconfig/20200930-061005-marostegui.json |
[production] |
06:07 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2085:3318', diff saved to https://phabricator.wikimedia.org/P12847 and previous config saved to /var/cache/conftool/dbconfig/20200930-060754-marostegui.json |
[production] |
06:07 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2085:3318', diff saved to https://phabricator.wikimedia.org/P12846 and previous config saved to /var/cache/conftool/dbconfig/20200930-060705-marostegui.json |
[production] |
05:47 |
<elukey> |
"PURGE BINARY LOGS BEFORE '2020-09-22 00:00:00';" on an-coord1001's mariadb - T264081 |
[analytics] |
05:43 |
<marostegui> |
Remove es2019 from tendril and zarcillo T264063 |
[production] |
05:40 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |