2021-05-17
§
|
05:05 |
<kormat> |
Starting s6 eqiad failover from db1131 to db1173 - T282124 |
[production] |
04:53 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1112.eqiad.wmnet with reason: REIMAGE |
[production] |
04:50 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1112.eqiad.wmnet with reason: REIMAGE |
[production] |
04:46 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'Set db1173 with weight 0 T282124', diff saved to https://phabricator.wikimedia.org/P15976 and previous config saved to /var/cache/conftool/dbconfig/20210517-044657-kormat.json |
[production] |
04:46 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 26 hosts with reason: Master switchover s6 T282124 |
[production] |
04:46 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 26 hosts with reason: Master switchover s6 T282124 |
[production] |
04:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1112 T280492', diff saved to https://phabricator.wikimedia.org/P15975 and previous config saved to /var/cache/conftool/dbconfig/20210517-043551-marostegui.json |
[production] |
04:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1124', diff saved to https://phabricator.wikimedia.org/P15974 and previous config saved to /var/cache/conftool/dbconfig/20210517-043148-marostegui.json |
[production] |
02:10 |
<legoktm> |
uninstalled python3-dbg on lists1001 |
[production] |
01:31 |
<legoktm> |
restarted mailman3-web |
[production] |
00:13 |
<legoktm> |
installing python3-dbg on lists1001 |
[production] |
2021-05-16
§
|
22:45 |
<Urbanecm> |
[urbanecm@mwmaint1002 ~]$ mwscript extensions/WikimediaMaintenance/createExtensionTables.php --wiki=tawiki wikilove # T280326 |
[production] |
20:46 |
<legoktm> |
restarted mailman3-web |
[production] |
19:58 |
<Krinkle> |
deployment-mediawiki11$ apt-get install memkeys |
[releng] |
19:38 |
<legoktm> |
restarted mailman3-web |
[production] |
18:14 |
<wm-bot> |
<lucaswerkmeister> deployed 8784dddb07 (batch mode, rank per individual statement) |
[tools.ranker] |
17:29 |
<Amir1> |
restart mailman3-web |
[production] |
16:52 |
<Majavah> |
clear error state from tools-sgeexec-0905 tools-sgeexec-0907 tools-sgeexec-0936 tools-sgeexec-0941 |
[tools] |
13:44 |
<wm-bot> |
<lucaswerkmeister> deployed 72ec33e6f2 (minor improvements) |
[tools.ranker] |
09:29 |
<Majavah> |
fix labs/private merge conflicts on deployment-puppetmaster04 |
[releng] |
02:39 |
<legoktm> |
restarting mailman3-web on lists1001 again |
[production] |
00:53 |
<legoktm> |
restarted mailman3-web on lists1001, uwsgi looked like it got stuck, consuming all CPU/memory |
[production] |
2021-05-15
§
|
21:51 |
<James_F> |
Zuul: [mediawiki/tools/cli] Make mw-cli-test experimental for now T248779 |
[releng] |
21:40 |
<James_F> |
Zuul: [mediawiki/tools/cli] Add new bespoke job T248779 |
[releng] |
21:09 |
<James_F> |
Zuul: [mediawiki/extensions/MediaWikiAuth] Mark repo as archived T282955 |
[releng] |
18:57 |
<wm-bot> |
<lucaswerkmeister> deployed 93d904cb7e (batch mode, list+collective version) |
[tools.ranker] |
18:38 |
<James_F> |
Zuul: Temporarily remove TwoColConflict from gated extensions T234002 T282935. |
[releng] |
14:01 |
<wm-bot> |
<lucaswerkmeister> tool should be back up (uwsgi.log went from 181M to 77M after moving pre-2021 data to separate files) |
[tools.lexeme-forms] |
13:56 |
<wm-bot> |
<lucaswerkmeister> briefly stopping tool (few minutes) to cycle the uwsgi.log |
[tools.lexeme-forms] |
12:33 |
<Amir1> |
set fr_quality to 0 for all revisions on several wikis (T279761) |
[production] |
10:57 |
<Majavah> |
deploying https://phabricator.wikimedia.org/R2073:18138e67e4143572a96ffdeaab13ab627c4636b1 |
[tools.openstack-browser] |
09:30 |
<Majavah> |
create deployment-logstash04 to install elk7 |
[releng] |
09:22 |
<Majavah> |
beta: cherry-pick https://gerrit.wikimedia.org/r/c/operations/puppet/+/688315, remove cherry-pick for https://gerrit.wikimedia.org/r/c/operations/puppet/+/683837 T277990 |
[releng] |
07:52 |
<Majavah> |
set profile::wmcs::kubeadm::control::apiserver_cert_alternative_names hiera key and adjust config map T262562 |
[toolsbeta] |
06:54 |
<Amir1> |
migrating most of last mailing lists of T280322 |
[production] |
06:23 |
<Majavah> |
cherry pick https://gerrit.wikimedia.org/r/c/operations/puppet/+/691494/ T281986 |
[releng] |
2021-05-14
§
|
21:30 |
<Krinkle> |
Delete now-unreadable unread echo notifications from deploymentwiki and clear cache badge count cache (echo_unread_wikis: 9892 rows affected, Echo/maintenance/recomputeNotifCounts.php), T198673 |
[releng] |
21:10 |
<Krinkle> |
Delete beta cluster commonswiki.globalusage data for deploymentwiki, T198673, https://wikitech.wikimedia.org/wiki/Delete_a_wiki (86 rows affected) |
[releng] |
21:09 |
<Krinkle> |
Delete beta cluster centralauth rows relating to deploymentwiki, T198673, https://wikitech.wikimedia.org/wiki/Delete_a_wiki (12600 rows affected) |
[releng] |
20:51 |
<Krinkle> |
I broke beta `InvalidArgumentException: mcrouter-with-onhost-tier not present in $wgObjectCaches` - working on it |
[releng] |
20:42 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts people1002.eqiad.wmnet |
[production] |
20:32 |
<mutante> |
people1002 - decom'ing - please use people1003 and see list mail |
[production] |
20:31 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts people1002.eqiad.wmnet |
[production] |
19:18 |
<bstorm> |
adjusting the rate limits for bastions nfs_write upward a lot to make NFS writes faster now that the cluster is finally using 10Gb on the backend and frontend T218338 |
[tools] |
18:58 |
<cdanis@cumin1001> |
END (PASS) - Cookbook sre.network.cf (exit_code=0) |
[production] |
18:58 |
<cdanis@cumin1001> |
START - Cookbook sre.network.cf |
[production] |
18:39 |
<cdanis> |
✔️ cdanis@install1003.wikimedia.org ~ 🕝☕ sudo systemctl restart squid.service |
[production] |
18:14 |
<mutante> |
people1003/people2002: awk -F: '$6 ~ "^\/home" {print $1,$6}' /etc/passwd | while read line ; do user=${line% *}; dir=${line#* }; sudo mkdir -p ${dir}/public_html; sudo chown $user ${dir}/public_html; done (courtesy of Jbond) |
[production] |
17:49 |
<bblack> |
install1003 - restored normal resolv.conf + re-enabled+ran puppet |
[production] |
17:41 |
<bblack> |
install1003 - restart squid |
[production] |