2019-02-15
§
|
10:35 |
<gtirloni> |
reboot cloudvirt1019 |
[production] |
09:44 |
<gehel> |
repool maps100[12] |
[production] |
09:33 |
<moritzm> |
imported php-defaults debs to thirdparty/php72 |
[production] |
08:42 |
<akosiaris> |
restart gerrit to pick up https://gerrit.wikimedia.org/r/490640 T177868 |
[production] |
08:40 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Repool db1109 (duration: 00m 46s) |
[production] |
08:28 |
<moritzm> |
rolling restart of apertium to pick up Python 3.4 security update |
[production] |
07:55 |
<godog> |
bounce prometheus@ops on prometheus2004 to take a snapshot |
[production] |
06:40 |
<marostegui> |
Stop puppet on labsdb1005 to leave "max_user_connections" on my.cnf - T216170 T216208 |
[production] |
06:39 |
<marostegui> |
Restart labsdb1005 with max_user_connections = 20 T216208 |
[production] |
06:17 |
<marostegui> |
Deploy schema change on db1109 - T210713 |
[production] |
06:16 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Depool db1109 (duration: 00m 49s) |
[production] |
06:13 |
<marostegui> |
Reload haproxy on dbproxy11 to repool labsdb1009 |
[production] |
00:39 |
<mutante> |
puppetmaster1001: sudo puppet node clean bast3003.wikimedia.org ; sudo puppet node deactivate bast3003.wikimedia.org (T216199) |
[production] |
00:15 |
<jynus> |
setting labsdb1005 back into read-write |
[production] |
2019-02-14
§
|
23:47 |
<jynus> |
restarting labsdb1005 mysql in read only mode |
[production] |
23:37 |
<niharika29@deploy1001> |
Finished deploy [scholarships/scholarships@25ea138]: Update app with updated dependencies to mitigate PHPMailer error T215302 (duration: 00m 02s) |
[production] |
23:37 |
<niharika29@deploy1001> |
Started deploy [scholarships/scholarships@25ea138]: Update app with updated dependencies to mitigate PHPMailer error T215302 |
[production] |
22:07 |
<andrewbogott> |
rebuilding labvirt1012 as cloudvirt1012, T216190 |
[production] |
20:38 |
<bstorm_> |
Restarted mariadb on labsdb1005 for https://wikitech.wikimedia.org/wiki/Incident_documentation/20190214-labsdb1005 |
[production] |
20:18 |
<thcipriani> |
thcipriani@deploy1001 rebuilt and synchronized wikiversions files: all wikis to 1.33.0-wmf.17 |
[production] |
20:14 |
<thcipriani@deploy1001> |
rebuilt and synchronized wikiversions files: all wikis to 1.33.0-wmf.17 |
[production] |
20:09 |
<ejegg> |
updated fundraising CiviCRM from 02ea871b88 to 165fbf5894 |
[production] |
19:42 |
<thcipriani@deploy1001> |
Synchronized php-1.33.0-wmf.17/extensions/GrowthExperiments/modules/help: SWAT: [[gerrit:490674|Help Panel: Fix IME broken in help panel search]] T216131 (duration: 00m 54s) |
[production] |
19:14 |
<thcipriani@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:487007|Stop NavPopups gadget conflict with PagePreviews on Wikivoyage]] T214878 (duration: 00m 54s) |
[production] |
19:01 |
<mutante> |
scandium - deleting parsoid clone dir and running puppet one more time, to fix permissions to allow wikidev |
[production] |
18:52 |
<mutante> |
scandium - deleting parsoid clone dir and running puppet one more time, to fix permissions to allow wikidev |
[production] |
18:12 |
<mutante> |
scandium - deleting parsoid clone dir and running puppet |
[production] |
18:03 |
<fsero> |
upgrading tiller to 2.12.2 on eqiad |
[production] |
17:34 |
<godog> |
bounce rsyslog on wezen/lithium, tls listener timeout in icinga |
[production] |
16:59 |
<moritzm> |
restarting apertium-apy on scb1001 to pick up Python security update |
[production] |
16:39 |
<marostegui> |
Depool labsdb1009 - T210713 |
[production] |
16:26 |
<fsero> |
upgrading tiller on codfw |
[production] |
16:11 |
<fsero> |
updating tiller version on staging cluster |
[production] |
16:10 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-codfw.php: Repool db2085 - T214840 (duration: 00m 52s) |
[production] |
15:50 |
<fsero> |
building and publishing new tiller docker image on boron |
[production] |
15:50 |
<END> |
(PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) (volans@cumin1001) |
[production] |
15:43 |
<START> |
- Cookbook sre.hosts.upgrade-and-reboot (volans@cumin1001) |
[production] |
15:28 |
<volans> |
upgraded spicerack to v0.0.15 on cumin[12]001 |
[production] |
15:26 |
<volans> |
uploaded spicerack_0.0.15-1_amd64.deb to apt.wikimedia.org stretch-wikimedia |
[production] |
15:12 |
<marostegui> |
Clear idrac logs from db2085 - T214840 |
[production] |
14:45 |
<godog> |
depool and stop logstash1009 for stretch reimage - T213898 |
[production] |
14:20 |
<marostegui> |
Stop MySQL on db2085 for on-site maintenance - T214840 |
[production] |
14:12 |
<jijiki> |
Enabling puppet on thumbor* servers - T214597 |
[production] |
13:39 |
<arturo> |
T215892 icinga downtime cloudvirt1024 for 2 weeks |
[production] |
12:22 |
<zeljkof> |
EU SWAT finished |
[production] |
12:21 |
<zfilipin@deploy1001> |
Synchronized php-1.33.0-wmf.17/extensions/ExternalGuidance/: SWAT: [[gerrit:490523|Fix the eventlogging schema definition as per manifest_version=2]] (duration: 00m 55s) |
[production] |
11:43 |
<_joe_> |
restarting hhvm on mw1338, hot tc exhausted T216084 |
[production] |
11:04 |
<_joe_> |
upgrading python3-etcd on stretch T209136 |
[production] |
11:03 |
<jbond42> |
rolling security updates for curl |
[production] |
11:02 |
<jijiki> |
Disabling puppet on thumbor* servers - T214597 |
[production] |