5201-5250 of 10000 results (40ms)
2021-07-23 §
15:45 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
15:44 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
15:11 <elukey> stop ml-serve-ctrl1001 + gnt-instance modify -t plain ml-serve-ctrl1001.eqiad.wmnet on ganeti1009 + start instance back - T287238 [production]
14:36 <_joe_> rebuilding httpd-fcgi, mediawiki-http fixing logging T285384 [production]
14:16 <brennen> gitlab1001: running ansible to deploy [[gerrit:707236|fix puma exporter listen address]] (T275170) [production]
13:35 <otto@deploy1002> Finished deploy [analytics/refinery@15521b3]: Add property disabling gobblin lock - T271232 (duration: 03m 32s) [production]
13:31 <otto@deploy1002> Started deploy [analytics/refinery@15521b3]: Add property disabling gobblin lock - T271232 [production]
12:16 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on mw[1440-1442].eqiad.wmnet with reason: setup new canary mw api servers in eqiad D8 https://phabricator.wikimedia.org/T279309 [production]
12:16 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on mw[1440-1442].eqiad.wmnet with reason: setup new canary mw api servers in eqiad D8 https://phabricator.wikimedia.org/T279309 [production]
12:15 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on mw1439.eqiad.wmnet with reason: setup new canary mw api servers in eqiad D8 https://phabricator.wikimedia.org/T279309 [production]
12:15 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on mw1439.eqiad.wmnet with reason: setup new canary mw api servers in eqiad D8 https://phabricator.wikimedia.org/T279309 [production]
11:50 <marostegui> Change innodb_checksum_algorithm to full_crc32 on pc1011-1014 and pc2011-2014 - T287244 [production]
11:17 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1446.eqiad.wmnet [production]
11:17 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1445.eqiad.wmnet [production]
11:11 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1443.eqiad.wmnet [production]
11:11 <dzahn@cumin1001> conftool action : set/weight=30; selector: name=mw144[3-6].eqiad.wmnet [production]
11:00 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on mw[1443,1445-1446].eqiad.wmnet with reason: new host [production]
11:00 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on mw[1443,1445-1446].eqiad.wmnet with reason: new host [production]
10:58 <arturo> adding packages to buster-wikimedia/thirdparty/kubeadm-k8s-1-19 @ apt1001 [production]
10:02 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1442.eqiad.wmnet [production]
09:57 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1441.eqiad.wmnet [production]
09:49 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1440.eqiad.wmnet [production]
09:47 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1439.eqiad.wmnet [production]
09:20 <hashar@deploy1002> Finished deploy [integration/docroot@edae2b4]: doc: add footer link to wikitech documentation (duration: 00m 11s) [production]
09:20 <hashar@deploy1002> Started deploy [integration/docroot@edae2b4]: doc: add footer link to wikitech documentation [production]
08:59 <dzahn@cumin1001> conftool action : set/weight=30; selector: name=mw144[0-2].eqiad.wmnet [production]
08:58 <dzahn@cumin1001> conftool action : set/weight=30; selector: name=mw1439.eqiad.wmnet [production]
08:56 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on mw[1439-1442].eqiad.wmnet with reason: new host [production]
08:56 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on mw[1439-1442].eqiad.wmnet with reason: new host [production]
08:24 <elukey> run 'gnt-instance modify -t plain ml-serve-ctrl1002.eqiad.wmnet' on ganeti1009 as test to track down latency/perf issues with kubelets [production]
03:11 <ryankemper> T287223 Installed `nginx-light` on all of `cloudelastic*`, and it looks like `relforge` didn't need the upgrade. This operation is done. [production]
03:09 <ryankemper> T287223 Installed `nginx-light` on all of `elastic1*` (eqiad) [production]
03:06 <ryankemper> T287223 Installed `nginx-light` on all of `elastic2*` (codfw) [production]
02:53 <ejegg> updated Fundraising CiviCRM from 819c11307d to 739c936298 [production]
02:26 <ryankemper> [WDQS] Pooled `wdqs1004` (all caught up on its mountain of lag) [production]
01:28 <ejegg> updated payments-wiki from 844b59ee42 to cc5d14ea7f [production]
01:20 <legoktm> legoktm@deneb:~$ docker rmi docker-registry.wikimedia.org/mwcachedir:0.0.1 # T287222 [production]
2021-07-22 §
23:35 <derick@deploy1002> Synchronized php-1.37.0-wmf.15/includes/preferences/DefaultPreferencesFactory.php: Backport: [[gerrit:706003|Make sure enable responsive mode UI reflects actual preference value (T285402)]] (duration: 00m 56s) [production]
19:26 <otto@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Finalize several EventLogging -> Event Platfom migrations - T282855 T238138 T282562 T271168 (duration: 00m 55s) [production]
19:08 <legoktm@cumin1001> END (PASS) - Cookbook sre.switchdc.mediawiki.00-warmup-caches (exit_code=0) [production]
19:07 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw142[1-2].eqiad.wmnet [production]
19:07 <mutante> mw1421, mw1422 - scap pull, re-pool as new API servers after reimaging, previously appservers [production]
19:06 <legoktm@cumin1001> START - Cookbook sre.switchdc.mediawiki.00-warmup-caches [production]
19:05 <dzahn@cumin1001> conftool action : set/weight=30; selector: name=mw142[1-2].eqiad.wmnet [production]
19:04 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw142[1-2].eqiad.wmnet [production]
19:00 <urbanecm> Start server-side upload for 1 video file (T287061) [production]
18:59 <otto@deploy1002> Finished deploy [analytics/refinery@3115f9e]: Set gobblin job.lock.dir after all - T271232 (duration: 03m 22s) [production]
18:58 <urbanecm> Start server-side upload for 1 video file (T286489) [production]
18:56 <urbanecm> Start server-side upload for 1 video file (T286665) [production]
18:56 <otto@deploy1002> Started deploy [analytics/refinery@3115f9e]: Set gobblin job.lock.dir after all - T271232 [production]