2020-10-29
§
|
22:04 |
<mutante> |
scandium - puppet disabled again (but only until tomorrow), downtimed in Icinga, for ongoing parsoid tests from testreduce1001 |
[production] |
22:03 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
22:03 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
21:33 |
<legoktm> |
published docker-registry.tools.wmflabs.org/toolbeta-test image (T265681) |
[tools] |
21:10 |
<bstorm> |
Added another ingress node to k8s cluster in case the load spikes are the problem T266506 |
[tools] |
20:52 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
20:50 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
20:23 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:17 |
<herron@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
20:09 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
20:08 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
20:06 |
<robh@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
20:06 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
20:06 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
20:06 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
19:31 |
<cdanis@cumin1001> |
END (PASS) - Cookbook sre.network.cf (exit_code=0) |
[production] |
19:31 |
<cdanis@cumin1001> |
START - Cookbook sre.network.cf |
[production] |
19:25 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
19:25 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
19:22 |
<Urbanecm> |
Start of `mwscript extensions/AbuseFilter/maintenance/updateVarDumps.php --wiki=$wiki --print-orphaned-records-to=/tmp/urbanecm/$wiki-orphaned.log --progress-markers > $wiki.log` in a tmux session on mwmaint1002 (wiki=ukwiki; T246539) |
[production] |
19:13 |
<Amir1> |
rolling restart of ores uwsgi |
[production] |
19:00 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
18:58 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
18:16 |
<herron@cumin1001> |
END (ERROR) - Cookbook sre.ganeti.makevm (exit_code=97) |
[production] |
18:13 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Enable WikiLove on hewikiquote (T266744) (duration: 00m 57s) |
[production] |
18:09 |
<herron@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
18:07 |
<herron@cumin1001> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) |
[production] |
18:07 |
<herron@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
18:06 |
<herron@cumin1001> |
END (ERROR) - Cookbook sre.ganeti.makevm (exit_code=97) |
[production] |
18:06 |
<herron@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
18:06 |
<Urbanecm> |
[urbanecm@deploy1001 /srv/mediawiki-staging (master * u=)]$ sudo /usr/local/sbin/fix-staging-perms |
[production] |
18:05 |
<Urbanecm> |
[urbanecm@mwmaint1002 ~]$ mwscript extensions/WikimediaMaintenance/createExtensionTables.php --wiki=hewikiquote wikilove # T266744 |
[production] |
18:04 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: b7eaaab81e1665c478f5dc1fdb495e36c53e7863: [cswiki] Set wgGEHomepageManualAssignmentMentorsList to Wikipedie:Potřebuji pomoc/Mentoři/Manuální (T245639) (duration: 00m 57s) |
[production] |
17:50 |
<Phantom873> |
restarted CVNBot5 in #cvn-meta |
[cvn] |
17:49 |
<Phantom873> |
restarted CVNBot15 |
[cvn] |
17:48 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.cf (exit_code=0) |
[production] |
17:48 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.cf |
[production] |
17:33 |
<Operator873> |
restarted CVNBot6-10 and 19 |
[cvn] |
17:33 |
<bstorm> |
hard rebooting tools-sgeexec-0905 and tools-sgeexec-0916 to get the grid back to full capacity |
[tools] |
17:33 |
<Phantom873> |
restarted CVNBot6-10 and 19 |
[cvn] |
17:29 |
<hashar> |
Restarted CI Jenkins a bit ago |
[production] |
17:24 |
<andrewbogott> |
signing pending puppet certs for deployment-mediawiki-07.deployment-prep.eqiad1.wikimedia.cloud and deployment-mediawiki-09.deployment-prep.eqiad1.wikimedia.cloud |
[deployment-prep] |
17:24 |
<andrewbogott> |
signing pending puppet certs for deployment-mediawiki-07.deployment-prep.eqiad1.wikimedia.cloud and deployment-mediawiki-09.deployment-prep.eqiad1.wikimedia.cloud |
[releng] |
17:23 |
<andrewbogott> |
signing pending puppet certs for deployment-kafka* nodes |
[releng] |
17:23 |
<andrewbogott> |
signing pending puppet certs for deployment-kafka* nodes |
[deployment-prep] |
17:15 |
<hashar> |
CI: killed all java agents (java upgrade) |
[production] |
17:12 |
<hashar> |
Stopping CI Jenkins |
[production] |
16:59 |
<XioNoX> |
Delete cr1-eqiad:ae2.1120 and related static routes - T265288 |
[production] |
16:57 |
<bstorm> |
silenced deployment-prep project alerts for 60 days since the downtime expired |
[admin] |
16:57 |
<James_F> |
Zuul: Add CI for node-rdkafka-factory T266058 |
[releng] |