2021-03-01
ยง
|
21:05 |
<mutante> |
re-enabling puppet on deploy1001 - running puppet on deploy*, switching eqiad scap master and deployment_server globally (T265963) |
[production] |
20:37 |
<mutante> |
deploy1001 - disable puppet and manually create scap-global-lock - NO DEPLOYMENTS |
[production] |
20:35 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1029.eqiad.wmnet |
[production] |
20:28 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host mc1029.eqiad.wmnet |
[production] |
20:28 |
<effie> |
upgrade mc1029, mc2029 to memcached 1.6 |
[production] |
20:12 |
<andrewbogott> |
removing novaadmin from all projects save 'admin' for T274385 |
[admin] |
19:55 |
<urbanecm@deploy1001> |
Synchronized wmf-config/config/hrwiki.yaml: d53834e: Enable Growth features on hrwiki in stealth modeEnable Growth features on hrwiki in stealth mode (3/3; T275684) (duration: 00m 54s) |
[production] |
19:54 |
<urbanecm@deploy1001> |
sync-file aborted: d53834e: Enable Growth features on hrwiki in stealth modeEnable Growth features on hrwiki in stealth mode (3/3; T275684) (duration: 00m 03s) |
[production] |
19:53 |
<urbanecm@deploy1001> |
Synchronized dblists/growthexperiments.dblist: d53834e: Enable Growth features on hrwiki in stealth modeEnable Growth features on hrwiki in stealth mode (2/3; T275684) (duration: 00m 56s) |
[production] |
19:52 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: d53834e: Enable Growth features on hrwiki in stealth modeEnable Growth features on hrwiki in stealth mode (1/3; T275684) (duration: 00m 55s) |
[production] |
19:51 |
<andrewbogott> |
removing novaobserver from all projects save 'observer' for T274385 |
[admin] |
19:50 |
<andrewbogott> |
adding inherited domain-wide roles to novaadmin and novaobserver as per T274385 |
[admin] |
19:41 |
<tgr@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:666842|EventLoggingSchemas: Bump HomepageVisit version (T275615)]] (duration: 00m 56s) |
[production] |
19:34 |
<phuedx@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:667680|Revert "Revert "vector: Stage 2 of WVUI search treatment A/B test"" (T249297)]] (duration: 00m 54s) |
[production] |
19:20 |
<pt1979@cumin2001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:02 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: 599b7390c840388d97dc4cdbf1796451d4024c22: Simplify deployment of Growth team features (3/3; T276091) (duration: 01m 00s) |
[production] |
19:01 |
<urbanecm@deploy1001> |
Synchronized wmf-config/CommonSettings.php: de0f74126eddafb5375b853d543b377e78544caa: Simplify deployment of Growth team features (2/3; T276091) (duration: 00m 57s) |
[production] |
18:56 |
<pt1979@cumin2001> |
START - Cookbook sre.dns.netbox |
[production] |
18:54 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: e991806eb9dc5ec018ebc59832d02e8a6563ba0a: Simplify deployment of Growth team features (1/3; T276091) (duration: 00m 57s) |
[production] |
18:42 |
<mutante> |
mwmaint2002.mgmt - racadm serveraction powerup |
[production] |
18:26 |
<ryankemper> |
[Relforge] Lifting downtime on `relforge1004` now that T275658 is done |
[production] |
18:25 |
<marxarelli> |
deleting unused docker-registry-uploader jenkins credential |
[releng] |
18:24 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1307.eqiad.wmnet |
[production] |
18:24 |
<mutante> |
mw1307 - back to stretch now |
[production] |
18:22 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1307.eqiad.wmnet |
[production] |
18:20 |
<mutante> |
mwmaint2002 - shutting down for maintenance |
[production] |
18:14 |
<razzi> |
restart timer that wasn't running on an-worker1101: sudo systemctl restart prometheus-debian-version-textfile.timer |
[analytics] |
18:12 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1098.eqiad.wmnet with reason: REIMAGE |
[production] |
18:10 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1098.eqiad.wmnet with reason: REIMAGE |
[production] |
18:03 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mwmaint2002.codfw.wmnet with reason: new install |
[production] |
18:03 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on mwmaint2002.codfw.wmnet with reason: new install |
[production] |
18:00 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1002.eqiad.wmnet with reason: REIMAGE |
[production] |
17:59 |
<mutante> |
puppetmaster1001 - generating mcrouter cert for mwmaint2002 T275905 |
[production] |
17:58 |
<volans@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1002.eqiad.wmnet with reason: REIMAGE |
[production] |
17:43 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1307.eqiad.wmnet with reason: REIMAGE |
[production] |
17:41 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1307.eqiad.wmnet with reason: REIMAGE |
[production] |
17:40 |
<elukey> |
reimage an-worker1098 (GPU worker node) to Buster |
[analytics] |
17:17 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1307.eqiad.wmnet |
[production] |
17:07 |
<mutante> |
our latest Wikipedia language edition ready to move on from the incubator https://tay.wikipedia.org |
[production] |
17:05 |
<mutante> |
new Wikimedia project language - tay - Atayal is spoken by the Atayal people of Taiwan |
[production] |
17:03 |
<jayme@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'mathoid' for release 'production' . |
[production] |
16:38 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1097.eqiad.wmnet with reason: REIMAGE |
[production] |
16:35 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1097.eqiad.wmnet with reason: REIMAGE |
[production] |
16:20 |
<jayme@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'mathoid' for release 'production' . |
[production] |
15:57 |
<jayme@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'mathoid' for release 'staging' . |
[production] |
15:11 |
<vgutierrez> |
rolling restart of ats-tls on cp[5007-5011] |
[production] |
14:49 |
<marostegui> |
Failover m3 proxy back to dbproxy1020 |
[production] |
14:48 |
<elukey> |
reimage an-worker1097 (gpu node) to debian buster |
[analytics] |
14:41 |
<andrewbogott> |
changed profile::redis::multidc::discovery from 'false' to "" to comply with strict typing in the deployment-memc puppet prefix. |
[deployment-prep] |
14:41 |
<andrewbogott> |
changed profile::redis::multidc::discovery from 'false' to "" to comply with strict typing in the deployment-memc puppet prefix. |
[releng] |