751-800 of 10000 results (31ms)
2020-12-16 §
10:35 <jbond42> reboot rpki2001 [production]
10:35 <jbond@cumin1001> START - Cookbook sre.hosts.reboot-single [production]
10:35 <jbond@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1001.eqiad.wmnet with reason: REIMAGE [production]
10:34 <jbond@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) [production]
10:30 <jbond42> reboot rpki1001 [production]
10:30 <jbond@cumin1001> START - Cookbook sre.hosts.reboot-single [production]
10:05 <gehel@cumin1001> END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) [production]
10:02 <jbond@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1001.eqiad.wmnet with reason: REIMAGE [production]
10:00 <jbond@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1001.eqiad.wmnet with reason: REIMAGE [production]
09:49 <godog> swift eqiad-prod: add weight to ms-be106[0-3] - T268435 [production]
09:32 <_joe_> reset-failed for docker report jobs on deneb, failed because of a registry gateway timeout [production]
09:31 <dcaro> removing invalid backups from cloudvirt1024 (196 in total) (T269419) [admin]
09:29 <elukey> force execution of cumin-check-aliases.service on cumin[12]001 hosts to clear alarms [production]
08:35 <gehel@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
08:23 <vgutierrez> acme-chief and acme-chief-api restarts for openssl upgrades (CVE-2020-1971) [production]
08:13 <joal> Manually push updated pageview whitelist to HDFS [analytics]
07:55 <gehel> depool wdqs1005 (catching up on lag) [production]
07:20 <marostegui> Stop mysql on db2142 to clone db1151 - T269324 [production]
06:31 <wm-bot> <samwilson> Deployed new version. T250344. [tools.extjsonuploader]
2020-12-15 §
23:47 <dduvall@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
23:45 <dduvall@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
23:34 <dduvall@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . [production]
22:10 <mholloway-shell@deploy1001> Synchronized wmf-config/InitialiseSettings.php: WikimediaEvents: Promote SessionTick to group1 T248987 (duration: 01m 04s) [production]
20:29 <marxarelli> group0 to 1.36.0-wmf.22 complete. no new errors or concerning rates (refs T267415) [production]
20:26 <tzatziki> reset email for User:Cnk1220 [production]
20:24 <joal> Kill restart webrequest_load oozie job after deploy [analytics]
20:06 <dduvall@deploy1001> rebuilt and synchronized wikiversions files: group0 wikis to 1.36.0-wmf.22 [production]
19:43 <joal> Deploy refinery onto HDFS [analytics]
19:32 <joal@deploy1001> Finished deploy [analytics/refinery@2202db5] (thin): Regular analytics weekly train - THIN [analytics/refinery@2202db5] (duration: 00m 08s) [production]
19:32 <joal@deploy1001> Started deploy [analytics/refinery@2202db5] (thin): Regular analytics weekly train - THIN [analytics/refinery@2202db5] [production]
19:31 <joal@deploy1001> Finished deploy [analytics/refinery@2202db5]: Regular analytics weekly train [analytics/refinery@2202db5] (duration: 16m 36s) [production]
19:14 <joal> Scap deploy refinery [analytics]
19:14 <joal@deploy1001> Started deploy [analytics/refinery@2202db5]: Regular analytics weekly train [analytics/refinery@2202db5] [production]
19:11 <htz> Added the following wikis to CVNBot6: smnwiki, skrwiki, skrwiktionary, eowikivoyage and wa.wikisource [cvn]
18:48 <dduvall@deploy1001> Pruned MediaWiki: 1.36.0-wmf.20 (duration: 04m 19s) [production]
18:41 <dduvall@deploy1001> Finished scap: testwikis wikis to 1.36.0-wmf.22 (duration: 46m 41s) [production]
18:26 <joal> Release refinery-source v0.0.141 [analytics]
18:05 <marxarelli> reloading zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/649441 [releng]
18:01 <marxarelli> deploying 2 new jenkins jobs for https://gerrit.wikimedia.org/r/c/integration/config/+/649441 [releng]
17:55 <dduvall@deploy1001> Started scap: testwikis wikis to 1.36.0-wmf.22 [production]
17:43 <Reedy> quarry-worker-02 `systemctl restart uwsgi-quarry-web.service` again, after pulling patch for T270195 [quarry]
17:43 <Reedy> quarry-worker-01 `systemctl restart uwsgi-quarry-web.service` again, after pulling patch for T270195 [quarry]
17:40 <Reedy> quarry-web-01 `systemctl restart uwsgi-quarry-web.service` again, after pulling patch for T270195 [quarry]
17:31 <Reedy> quarry-web-01 `systemctl restart uwsgi-quarry-web.service` [quarry]
17:26 <Reedy> `find /tmp -type f -mtime +30 -delete;` on quarry-web-01 T270198 [quarry]
17:23 <Reedy> `apt-get clean && apt-get autoclean` on quarry-web-01 T270198 [quarry]
16:47 <ottomata> bumped eventate-main memory limits from 300M to 600M - T249745 [production]
16:47 <otto@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . [production]
16:47 <otto@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
16:45 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1265.eqiad.wmnet with reason: REIMAGE [production]