2751-2800 of 10000 results (66ms)
2019-08-13 §
11:34 <gehel> restart wdqs-updater on wdqs2001 [production]
11:30 <fsero@> helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-main' for release 'main' . [production]
11:29 <fsero@> helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-analytics' for release 'analytics' . [production]
11:25 <fsero@> helmfile [EQIAD] Ran 'apply' command on namespace 'citoid' for release 'production' . [production]
11:21 <fsero> recreating citoid eventgate-analytics eventgate-main mathoid namespace - T228836 [production]
11:20 <fsero@> helmfile [EQIAD] Ran 'apply' command on namespace 'termbox' for release 'production' . [production]
11:18 <raynor> EU SWAT finished [production]
11:15 <pmiazga@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:529925|Undeploy editor gender surveys (T227793)]] (duration: 00m 48s) [production]
11:13 <fsero> recreating termbox namespace - T228836 [production]
11:06 <oblivian@> helmfile [EQIAD] Ran 'apply' command on namespace 'zotero' for release 'production' . [production]
11:04 <fsero> resetting net.netfilter.nf_conntrack_tcp_timeout_time_wait to 65 in kubernetes2006 [production]
10:59 <_joe_> [eqiad] downtiming zotero on icinga for 10 minutes while recreating the deployment with helmfile [production]
10:57 <oblivian@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
10:57 <oblivian@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:56 <oblivian@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
10:56 <oblivian@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:49 <oblivian@> helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'coredns' . [production]
10:44 <oblivian@> helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'coredns' . [production]
10:39 <oblivian@> helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . [production]
10:39 <_joe_> recreating rbac roles via helmfile [production]
10:32 <oblivian@> helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . [production]
10:29 <_joe_> deleting calico deploy and configmap in kubernetes in eqiad, recreating with helmfile [production]
10:25 <jbond42> rolling update of ghostscript [production]
10:23 <fsero@puppetmaster1001> conftool action : set/pooled=false; selector: dnsdisc=sessionstore|citoid|cxserver|eventgate-analytics|eventgate-main|termbox|blubberoid|mathoid|zotero,name=eqiad [production]
10:10 <fsero> initialize_cluster.sh kube-system kubemaster.svc.eqiad.wmnet 6443 - T228836 [production]
10:10 <fsero> creating tiller in kube-system for helmfile T228836 [production]
09:58 <vgutierrez> upgrading the rest of cache@upload to 8.0.3-1wm3 - T221594 [production]
08:49 <marostegui> Stop MySQL on db2057 - T230394 [production]
08:48 <marostegui> Remove db2057 from tendril and zarcillo T230394 [production]
07:55 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Remove db2057 from config T230394 (duration: 00m 47s) [production]
07:54 <marostegui@deploy1001> Synchronized wmf-config/db-codfw.php: Remove db2057 from config T230394 (duration: 00m 48s) [production]
06:59 <volans> upgrading spicerack to 0.0.26 on cumin2001 [production]
06:49 <vgutierrez> Rolling restart of fifo-log-demux and atsmtail services across cache@upload [production]
06:38 <vgutierrez> upgrading fifo-log-demux to version 0.5 in cache@upload [production]
06:11 <vgutierrez> Upgrading ATS to 8.0.3-1wm3 in cp2002, cp1076, cp3034 and cp4021 - T221594 [production]
05:47 <marostegui> Stop mysql on db2050 - T230391 [production]
05:40 <marostegui> Remove db2050 from tendril and zarcillo T230391 [production]
05:35 <marostegui@cumin1001> dbctl commit (dc=all): 'Remove db2050 from config, host will be decommissioned T230391', diff saved to https://phabricator.wikimedia.org/P8904 and previous config saved to /var/cache/conftool/dbconfig/20190813-053514-marostegui.json [production]
05:33 <marostegui@deploy1001> Synchronized wmf-config/db-codfw.php: Remove db2050 from config T230391 (duration: 00m 48s) [production]
05:32 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Remove db2050 from config T230391 (duration: 00m 48s) [production]
05:12 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Provision db2122 into s7 T228969 (duration: 00m 47s) [production]
05:11 <marostegui@deploy1001> Synchronized wmf-config/db-codfw.php: Provision db2122 into s7 T228969 (duration: 00m 49s) [production]
05:10 <marostegui@cumin1001> dbctl commit (dc=all): 'Provision db2122 into s7 T228969', diff saved to https://phabricator.wikimedia.org/P8903 and previous config saved to /var/cache/conftool/dbconfig/20190813-051019-marostegui.json [production]
2019-08-12 §
23:24 <XioNoX> add samplicator to buster-wikimedia repo [production]
21:33 <eileen> tools revision changed from 2a56e5e283 to 827ce3750e [production]
20:43 <eileen> civicrm revision changed from be5b5a150b to 569e52e23d, config revision is 1c76e94ac3 [production]
20:17 <mbsantos@deploy1001> Finished deploy [mobileapps/deploy@615004f]: Update service-mobileapp-node to f0a2847 (duration: 05m 05s) [production]
20:12 <mbsantos@deploy1001> Started deploy [mobileapps/deploy@615004f]: Update service-mobileapp-node to f0a2847 [production]
20:08 <gehel@cumin2001> END (ERROR) - Cookbook sre.elasticsearch.rolling-reboot (exit_code=97) [production]
19:15 <mforns@deploy1001> Finished deploy [analytics/refinery@5418d3b]: deploying analytics-refinery up to 5418d3be5f65f7325324d0c15c51b3ca722dde1c (duration: 39m 23s) [production]