2021-03-01
ยง
|
21:52 |
<mstyles@deploy1002> |
Started deploy [wikimedia/discovery/analytics@ca2c5b5]: import commons ttl dag fix (T270103) |
[production] |
21:49 |
<mutante> |
deploy1002 - removed scap-global-lock, unlocked scap |
[production] |
21:43 |
<phamhi> |
rebooted clouddb1013 for maintenance |
[production] |
21:38 |
<mutante> |
cumin 'mw*' 'grep master_rsync /etc/scap.cfg' showed all mw servers are now using deploy1002 (T265963) |
[production] |
21:30 |
<shdubsh> |
completed removal of kafka logging inputs to legacy logstash cluster - T234854 |
[production] |
21:18 |
<mutante> |
mw1262 - running puppet to switch to new deployment server, scap pull |
[production] |
21:16 |
<effie> |
pooling mw1262 back |
[production] |
21:08 |
<mutante> |
[mwdebug1001:~] $ /usr/local/lib/nagios/plugins/check_mw_versions --deployhost deploy1002.eqiad.wmnet - OKAY: wikiversions in sync (T265963) |
[production] |
21:05 |
<mutante> |
re-enabling puppet on deploy1001 - running puppet on deploy*, switching eqiad scap master and deployment_server globally (T265963) |
[production] |
20:37 |
<mutante> |
deploy1001 - disable puppet and manually create scap-global-lock - NO DEPLOYMENTS |
[production] |
20:35 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1029.eqiad.wmnet |
[production] |
20:28 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host mc1029.eqiad.wmnet |
[production] |
20:28 |
<effie> |
upgrade mc1029, mc2029 to memcached 1.6 |
[production] |
19:55 |
<urbanecm@deploy1001> |
Synchronized wmf-config/config/hrwiki.yaml: d53834e: Enable Growth features on hrwiki in stealth modeEnable Growth features on hrwiki in stealth mode (3/3; T275684) (duration: 00m 54s) |
[production] |
19:54 |
<urbanecm@deploy1001> |
sync-file aborted: d53834e: Enable Growth features on hrwiki in stealth modeEnable Growth features on hrwiki in stealth mode (3/3; T275684) (duration: 00m 03s) |
[production] |
19:53 |
<urbanecm@deploy1001> |
Synchronized dblists/growthexperiments.dblist: d53834e: Enable Growth features on hrwiki in stealth modeEnable Growth features on hrwiki in stealth mode (2/3; T275684) (duration: 00m 56s) |
[production] |
19:52 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: d53834e: Enable Growth features on hrwiki in stealth modeEnable Growth features on hrwiki in stealth mode (1/3; T275684) (duration: 00m 55s) |
[production] |
19:41 |
<tgr@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:666842|EventLoggingSchemas: Bump HomepageVisit version (T275615)]] (duration: 00m 56s) |
[production] |
19:34 |
<phuedx@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:667680|Revert "Revert "vector: Stage 2 of WVUI search treatment A/B test"" (T249297)]] (duration: 00m 54s) |
[production] |
19:20 |
<pt1979@cumin2001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:02 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: 599b7390c840388d97dc4cdbf1796451d4024c22: Simplify deployment of Growth team features (3/3; T276091) (duration: 01m 00s) |
[production] |
19:01 |
<urbanecm@deploy1001> |
Synchronized wmf-config/CommonSettings.php: de0f74126eddafb5375b853d543b377e78544caa: Simplify deployment of Growth team features (2/3; T276091) (duration: 00m 57s) |
[production] |
18:56 |
<pt1979@cumin2001> |
START - Cookbook sre.dns.netbox |
[production] |
18:54 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: e991806eb9dc5ec018ebc59832d02e8a6563ba0a: Simplify deployment of Growth team features (1/3; T276091) (duration: 00m 57s) |
[production] |
18:42 |
<mutante> |
mwmaint2002.mgmt - racadm serveraction powerup |
[production] |
18:26 |
<ryankemper> |
[Relforge] Lifting downtime on `relforge1004` now that T275658 is done |
[production] |
18:24 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1307.eqiad.wmnet |
[production] |
18:24 |
<mutante> |
mw1307 - back to stretch now |
[production] |
18:22 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1307.eqiad.wmnet |
[production] |
18:20 |
<mutante> |
mwmaint2002 - shutting down for maintenance |
[production] |
18:12 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1098.eqiad.wmnet with reason: REIMAGE |
[production] |
18:10 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1098.eqiad.wmnet with reason: REIMAGE |
[production] |
18:03 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mwmaint2002.codfw.wmnet with reason: new install |
[production] |
18:03 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on mwmaint2002.codfw.wmnet with reason: new install |
[production] |
18:00 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1002.eqiad.wmnet with reason: REIMAGE |
[production] |
17:59 |
<mutante> |
puppetmaster1001 - generating mcrouter cert for mwmaint2002 T275905 |
[production] |
17:58 |
<volans@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1002.eqiad.wmnet with reason: REIMAGE |
[production] |
17:43 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1307.eqiad.wmnet with reason: REIMAGE |
[production] |
17:41 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1307.eqiad.wmnet with reason: REIMAGE |
[production] |
17:17 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1307.eqiad.wmnet |
[production] |
17:07 |
<mutante> |
our latest Wikipedia language edition ready to move on from the incubator https://tay.wikipedia.org |
[production] |
17:05 |
<mutante> |
new Wikimedia project language - tay - Atayal is spoken by the Atayal people of Taiwan |
[production] |
17:03 |
<jayme@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'mathoid' for release 'production' . |
[production] |
16:38 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1097.eqiad.wmnet with reason: REIMAGE |
[production] |
16:35 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1097.eqiad.wmnet with reason: REIMAGE |
[production] |
16:20 |
<jayme@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'mathoid' for release 'production' . |
[production] |
15:57 |
<jayme@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'mathoid' for release 'staging' . |
[production] |
15:11 |
<vgutierrez> |
rolling restart of ats-tls on cp[5007-5011] |
[production] |
14:49 |
<marostegui> |
Failover m3 proxy back to dbproxy1020 |
[production] |
14:32 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1030.eqiad.wmnet |
[production] |