2020-05-21
ยง
|
22:40 |
<bd808> |
Rebuilding all Docker containers for tools-webservice 0.70 (T252700) |
[tools] |
22:36 |
<bd808> |
Updated tools-webservice to 0.70 across instances (T252700) |
[tools] |
22:29 |
<bd808> |
Building tools-webservice 0.70 via wmcs-package-build.py |
[tools] |
22:22 |
<ZI_Jony> |
quit SULWatcher and SULWatcher2 on #cvn-unifications |
[cvn] |
22:14 |
<bd808> |
Building tools-webservice 0.70 via wmcs-package-build.py |
[toolsbeta] |
21:46 |
<eileen> |
civicrm revision changed from ed4c9522ac to b658fd8233, config revision is 9babae3954 |
[production] |
21:25 |
<wm-bot> |
<maurelio> "webservice --backend=kubernetes --canonical python3.7 start" refs. T253346 |
[tools.ldap] |
21:24 |
<wm-bot> |
<maurelio> refs. T253346 |
[tools.ldap] |
21:20 |
<wm-bot> |
<maurelio> Stopping webservice for T253346 |
[tools.ldap] |
21:10 |
<foks> |
removing two files for legal compliance |
[production] |
20:44 |
<bstorm_> |
labstore1005 is now running stretch and drbd devices are resyncing after several reboots and some significant effort T224582 |
[production] |
19:23 |
<andrewbogott> |
disabling puppet on cloudbackup2001 to prevent the backup job from starting during maintenance |
[admin] |
19:16 |
<andrewbogott> |
systemctl disable block_sync-tools-project.service on cloudbackup2001.codfw.wmnet to avoid stepping on current upgrade |
[admin] |
18:24 |
<twentyafterfour> |
restarting phabricator on phab1001 to deploy https://phabricator.wikimedia.org/rPHEX2687d08786a9dadcbaa96709de991f471f239830 |
[production] |
17:24 |
<elukey> |
add druid100[7,8] to the druid public cluster (not serving load balancer traffic for the moment, only joining the cluster) - T252771 |
[analytics] |
17:24 |
<bblack> |
anycast experiment done, all back to normal |
[production] |
17:20 |
<bblack> |
anycast experimentation commencing in ulsfo (test route withdrawal)... |
[production] |
17:04 |
<bstorm_> |
starting labstore1005 upgrades T224582 |
[production] |
16:44 |
<elukey> |
roll restart druid historical nodes on druid100[4-6] (public cluster) to pick up new settings - T252771 |
[analytics] |
16:42 |
<Reedy> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/597825 |
[releng] |
16:34 |
<Reedy> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/597820 |
[releng] |
16:19 |
<James_F> |
Zuul: [mediawiki/extensions/Bootstrap] Switch down to quibble-composer for now. |
[releng] |
16:14 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
16:12 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
16:04 |
<Urbanecm> |
Restart StewardBot |
[tools.stewardbots] |
16:04 |
<sbassett@deploy1001> |
Synchronized private/PrivateSettings.php: Update mitigations for T250887 (duration: 01m 08s) |
[production] |
16:01 |
<Urbanecm> |
Investigating StewardBot's outage |
[tools.stewardbots] |
15:55 |
<Reedy> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/597810 |
[releng] |
15:48 |
<andrewbogott> |
rebuilding cloudnet1003.eqiad.wmnet with Debian Buster for T253124 |
[production] |
15:48 |
<andrewbogott> |
re-imaging cloudnet1003 with Buster |
[admin] |
15:23 |
<ZI_Jony> |
staff restarted CVNBot21 on #cvn-mediawiki |
[cvn] |
15:22 |
<XioNoX> |
Add BGP between cr1/2-eqiad and authdns1001 - T253196 |
[production] |
15:09 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:09 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:08 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:08 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:07 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:07 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:59 |
<dzahn@cumin1001> |
conftool action : set/pooled=inactive; selector: name=mw217[0-2].codfw.wmnet |
[production] |
14:59 |
<dzahn@cumin1001> |
conftool action : set/pooled=inactive; selector: name=mw216[0-9].codfw.wmnet |
[production] |
14:58 |
<dzahn@cumin1001> |
conftool action : set/pooled=inactive; selector: name=mw215[8-9].codfw.wmnet |
[production] |
14:53 |
<bstorm_> |
adding the hiera values to horizon for bootstrapping k8s T211096 |
[paws] |
14:50 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:47 |
<bblack@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:44 |
<akosiaris@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'canary' . |
[production] |
14:39 |
<arturo> |
point record `k8s.svc.paws.eqiad1.wikimedia.cloud` to `172.16.1.186` (which is paws-k8s-control-1, for the initial bootstrap) (T211096) |
[paws] |
14:33 |
<akosiaris> |
upload helmfile 0.109.0 to apt.wikimedia.org/buster-wikimedia and stretch-wikimedia, component main |
[production] |
14:02 |
<elukey> |
restart druid kafka supervisor for wmf_netflow after maintenance |
[analytics] |
13:53 |
<elukey> |
restart druid-historical on an-druid100[1,2] to pick up new settings |
[analytics] |
13:51 |
<ZI_Jony> |
restarted Cubbie on #cvn-commons-uploads |
[cvn] |