2019-08-13
§
|
11:56 |
<fsero@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'blubberoid' for release 'production' . |
[production] |
11:49 |
<fsero@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'blubberoid' for release 'production' . |
[production] |
11:44 |
<fsero> |
recreating cxserver blubber and sessionstore namespace - T228836 |
[production] |
11:39 |
<fsero@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'mathoid' for release 'production' . |
[production] |
11:35 |
<gehel> |
restart wdqs-blazegraph on wdqs2001 |
[production] |
11:34 |
<gehel> |
restart wdqs-updater on wdqs2001 |
[production] |
11:30 |
<fsero@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-main' for release 'main' . |
[production] |
11:29 |
<fsero@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-analytics' for release 'analytics' . |
[production] |
11:25 |
<fsero@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'citoid' for release 'production' . |
[production] |
11:21 |
<fsero> |
recreating citoid eventgate-analytics eventgate-main mathoid namespace - T228836 |
[production] |
11:20 |
<fsero@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'termbox' for release 'production' . |
[production] |
11:18 |
<raynor> |
EU SWAT finished |
[production] |
11:15 |
<pmiazga@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:529925|Undeploy editor gender surveys (T227793)]] (duration: 00m 48s) |
[production] |
11:13 |
<fsero> |
recreating termbox namespace - T228836 |
[production] |
11:06 |
<oblivian@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'zotero' for release 'production' . |
[production] |
11:04 |
<fsero> |
resetting net.netfilter.nf_conntrack_tcp_timeout_time_wait to 65 in kubernetes2006 |
[production] |
10:59 |
<_joe_> |
[eqiad] downtiming zotero on icinga for 10 minutes while recreating the deployment with helmfile |
[production] |
10:57 |
<oblivian@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:57 |
<oblivian@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:56 |
<oblivian@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:56 |
<oblivian@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:49 |
<oblivian@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'coredns' . |
[production] |
10:44 |
<oblivian@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'coredns' . |
[production] |
10:39 |
<oblivian@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . |
[production] |
10:39 |
<_joe_> |
recreating rbac roles via helmfile |
[production] |
10:32 |
<oblivian@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
10:29 |
<_joe_> |
deleting calico deploy and configmap in kubernetes in eqiad, recreating with helmfile |
[production] |
10:25 |
<jbond42> |
rolling update of ghostscript |
[production] |
10:23 |
<fsero@puppetmaster1001> |
conftool action : set/pooled=false; selector: dnsdisc=sessionstore|citoid|cxserver|eventgate-analytics|eventgate-main|termbox|blubberoid|mathoid|zotero,name=eqiad |
[production] |
10:10 |
<fsero> |
initialize_cluster.sh kube-system kubemaster.svc.eqiad.wmnet 6443 - T228836 |
[production] |
10:10 |
<fsero> |
creating tiller in kube-system for helmfile T228836 |
[production] |
09:58 |
<vgutierrez> |
upgrading the rest of cache@upload to 8.0.3-1wm3 - T221594 |
[production] |
08:49 |
<marostegui> |
Stop MySQL on db2057 - T230394 |
[production] |
08:48 |
<marostegui> |
Remove db2057 from tendril and zarcillo T230394 |
[production] |
07:55 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Remove db2057 from config T230394 (duration: 00m 47s) |
[production] |
07:54 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-codfw.php: Remove db2057 from config T230394 (duration: 00m 48s) |
[production] |
06:59 |
<volans> |
upgrading spicerack to 0.0.26 on cumin2001 |
[production] |
06:49 |
<vgutierrez> |
Rolling restart of fifo-log-demux and atsmtail services across cache@upload |
[production] |
06:38 |
<vgutierrez> |
upgrading fifo-log-demux to version 0.5 in cache@upload |
[production] |
06:11 |
<vgutierrez> |
Upgrading ATS to 8.0.3-1wm3 in cp2002, cp1076, cp3034 and cp4021 - T221594 |
[production] |
05:47 |
<marostegui> |
Stop mysql on db2050 - T230391 |
[production] |
05:40 |
<marostegui> |
Remove db2050 from tendril and zarcillo T230391 |
[production] |
05:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove db2050 from config, host will be decommissioned T230391', diff saved to https://phabricator.wikimedia.org/P8904 and previous config saved to /var/cache/conftool/dbconfig/20190813-053514-marostegui.json |
[production] |
05:33 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-codfw.php: Remove db2050 from config T230391 (duration: 00m 48s) |
[production] |
05:32 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Remove db2050 from config T230391 (duration: 00m 48s) |
[production] |
05:12 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Provision db2122 into s7 T228969 (duration: 00m 47s) |
[production] |
05:11 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-codfw.php: Provision db2122 into s7 T228969 (duration: 00m 49s) |
[production] |
05:10 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Provision db2122 into s7 T228969', diff saved to https://phabricator.wikimedia.org/P8903 and previous config saved to /var/cache/conftool/dbconfig/20190813-051019-marostegui.json |
[production] |