2020-03-16
§
|
10:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool es1015', diff saved to https://phabricator.wikimedia.org/P10703 and previous config saved to /var/cache/conftool/dbconfig/20200316-102829-marostegui.json |
[production] |
10:17 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool es1015', diff saved to https://phabricator.wikimedia.org/P10702 and previous config saved to /var/cache/conftool/dbconfig/20200316-101707-marostegui.json |
[production] |
10:10 |
<marostegui> |
Stop mysql for upgrade on es1015 T239791 |
[production] |
10:02 |
<Amir1> |
start of ladsgroup@mwmaint1002:~$ mwscript extensions/Wikibase/repo/maintenance/rebuildItemTerms.php --wiki=wikidatawiki --batch-size=50 --sleep=0 --file=15march2217-holes-nulls.list on screen (T219123) |
[production] |
09:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es1015 for upgrade and restart T239791', diff saved to https://phabricator.wikimedia.org/P10701 and previous config saved to /var/cache/conftool/dbconfig/20200316-093228-marostegui.json |
[production] |
09:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Promote es1011 to es2 master, this is a NOOP T239791', diff saved to https://phabricator.wikimedia.org/P10700 and previous config saved to /var/cache/conftool/dbconfig/20200316-093048-marostegui.json |
[production] |
08:15 |
<marostegui> |
Review and enable events on recently migrated 10.4 hosts - T247728 |
[production] |
08:02 |
<ema> |
cp4025 restart trafficserver-tls to clear 'tls process restarted' alert T241593 T185968 |
[production] |
07:57 |
<moritzm> |
installing libxslt security updates |
[production] |
07:52 |
<ema> |
cp4025: restart varnish-fe to clear 'child restarted' alert T185968 |
[production] |
07:47 |
<moritzm> |
installing lxml security updates |
[production] |
07:14 |
<moritzm> |
installing libgd2 security updates on jessie |
[production] |
06:54 |
<moritzm> |
removing some library packages from jessie/stretch after labstore1006/1007 dist-upgrade to buster |
[production] |
06:38 |
<_joe_> |
restart envoy with 10 requests per connection on mw2231, T247484 |
[production] |
2020-03-13
§
|
23:12 |
<bstorm_> |
rebooting labstore1006 for upgrade to stretch T224583 |
[production] |
22:49 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
22:45 |
<herron@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
22:27 |
<bstorm_> |
rebooting labstore1006 T224583 |
[production] |
22:21 |
<bstorm_> |
downtimed labstore1006 for upgrades T224583 |
[production] |
20:02 |
<mutante> |
stat1005 - ip link set en01 down ; ip link set en01 up (T247561) |
[production] |
19:30 |
<bstorm_> |
rebooting labstore1007 for upgrade to buster T224583 |
[production] |
18:51 |
<shdubsh> |
test increase fs.inotify.max_user_watches on prometheus2004 |
[production] |
17:58 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
17:21 |
<mutante> |
removed squid from install1002/install2002 (formerly webproxy.(eqiad|codfw).wmnet until 2 days ago, replaced by install1003/install2003) T224576 |
[production] |
17:20 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.kafka.roll-restart-mirror-maker (exit_code=0) |
[production] |
17:09 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
17:08 |
<elukey@cumin1001> |
START - Cookbook sre.kafka.roll-restart-mirror-maker |
[production] |
17:00 |
<krinkle@deploy1001> |
Synchronized dblists/: If4d17082f, Iadba5b01b, Ibe16d5f09 (duration: 01m 07s) |
[production] |
16:58 |
<krinkle@deploy1001> |
Synchronized wmf-config/config/: Ibe16d5f09 (duration: 01m 10s) |
[production] |
16:51 |
<bstorm_> |
rebooting labstore1007 for stretch upgrade T224583 |
[production] |
16:37 |
<krinkle@deploy1001> |
Synchronized wmf-config/config/: If4d17082f, Iadba5b01b (duration: 01m 11s) |
[production] |
16:18 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
16:15 |
<herron@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
16:04 |
<bstorm_> |
rebooting labstore1007 for first cycle of upgrades T224583 |
[production] |
16:02 |
<elukey> |
powercycle kafka-jumbo1006 after switch port changed - T247561 |
[production] |
15:28 |
<_joe_> |
switch envoy logging to debug on mw2231 |
[production] |
14:57 |
<cdanis> |
T247586 ✔️ cdanis@grafana1002.eqiad.wmnet ~ 🕥☕ sudo systemctl restart apache2.service |
[production] |
12:48 |
<Urbanecm> |
Password reset for SUL User:FuduBot (T247601) |
[production] |
12:16 |
<akosiaris@deploy1001> |
Synchronized wmf-config/ProductionServices.php: (no justification provided) (duration: 01m 16s) |
[production] |
10:26 |
<moritzm> |
installing python-werkzeug security updates |
[production] |
10:09 |
<vgutierrez> |
upload trafficserver 8.0.6-1wm3 to apt.wm.o (buster) - T245616 |
[production] |
09:55 |
<_joe_> |
running puppet across appservers to switch to http for eventgate-analytics T247484 |
[production] |
09:17 |
<moritzm> |
installing perl updates from Stretch point release |
[production] |
06:16 |
<vgutierrez> |
triggering OCSP response updates in eqiad,codfw and ulsfo - T247584 |
[production] |