2021-05-03
§
|
09:38 |
<joal> |
Drop already sqooped data to restart jobs |
[analytics] |
09:12 |
<dcaro> |
draining and rebooting coludvirt1021 (T280641) |
[admin] |
09:12 |
<joal@deploy1002> |
Started deploy [analytics/refinery@584ed6a] (hadoop-test): Hotfix analytics deploy (monthly sqoop) HADOOP-TEST [analytics/refinery@584ed6a] |
[production] |
09:10 |
<joal@deploy1002> |
Finished deploy [analytics/refinery@584ed6a] (thin): Hotfix analytics deploy (monthly sqoop) THIN [analytics/refinery@584ed6a] (duration: 00m 07s) |
[production] |
09:10 |
<joal@deploy1002> |
Started deploy [analytics/refinery@584ed6a] (thin): Hotfix analytics deploy (monthly sqoop) THIN [analytics/refinery@584ed6a] |
[production] |
09:09 |
<joal@deploy1002> |
Finished deploy [analytics/refinery@584ed6a]: Hotfix analytics deploy (monthly sqoop) [analytics/refinery@584ed6a] (duration: 16m 06s) |
[production] |
08:53 |
<joal> |
Deploy refinery for sqoop hotfix |
[analytics] |
08:52 |
<joal@deploy1002> |
Started deploy [analytics/refinery@584ed6a]: Hotfix analytics deploy (monthly sqoop) [analytics/refinery@584ed6a] |
[production] |
08:33 |
<elukey> |
clean up libmariadb-java from hadoop workers and clients |
[analytics] |
08:26 |
<dcaro> |
draining and rebooting coludvirt1018 (T280641) |
[admin] |
08:01 |
<moritzm> |
installing edk2 security updates |
[production] |
07:46 |
<joal> |
Kill prod sqoop job to restart after fix |
[analytics] |
07:31 |
<moritzm> |
installing libimage-exiftool-perl security updates |
[production] |
06:21 |
<Majavah> |
apply https://phabricator.wikimedia.org/R2073:12a3fc4d4f16a76bb313d78443f4579f8a5c5531 |
[tools.openstack-browser-dev] |
2021-05-02
§
|
18:58 |
<Majavah> |
add dns record upload.wikimedia.beta.wmflabs.org. -> 185.15.56.35 (deployment-cache-upload floating address) |
[releng] |
18:50 |
<Majavah> |
adjust deployment-cache* hieradata to treat upload.wikimedia.beta.wmflabs.org like upload.beta.wmflabs.org |
[releng] |
18:42 |
<Krinkle> |
Cherry-pick "mediawiki: Remove 'deployment.wikimedia' vhost from Beta Cluster" - <https://gerrit.wikimedia.org/r/c/operations/puppet/+/684117>, ref T198673 |
[releng] |
18:41 |
<Krinkle> |
Run `puppe agent -tv` on deployment-cache-text06 and deployment-mediawiki11 |
[releng] |
18:37 |
<Krinkle> |
Cherry-pick "mediawiki: Remove 'deployment.wikimedia' vhost from Beta Cluster" - https://gerrit.wikimedia.org/r/c/operations/puppet/+/684117 |
[releng] |
13:40 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on cloudmetrics1002.eqiad.wmnet with reason: Flaky host |
[production] |
13:40 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on cloudmetrics1002.eqiad.wmnet with reason: Flaky host |
[production] |
11:34 |
<wm-bot> |
<lucaswerkmeister> deployed 4c9a5f0ebf (duplicate check JS fixes) |
[tools.lexeme-forms] |
2021-05-01
§
|
19:19 |
<James_F> |
Zuul: Add atagar to the CI allow list |
[releng] |
19:12 |
<Urbanecm> |
Invalidate password for MaraBot@SUL (T281586) |
[production] |
16:58 |
<legoktm@deploy1002> |
Synchronized logos/config.yaml: Add eswiki 20th anniversary logos (duration: 00m 57s) |
[production] |
16:56 |
<legoktm@deploy1002> |
Synchronized wmf-config/logos.php: Use eswiki 20th anniversary logos (T280908) (duration: 00m 56s) |
[production] |
16:50 |
<legoktm@deploy1002> |
Synchronized static/images/project-logos/: Add eswiki 20th anniversary logos (duration: 00m 57s) |
[production] |
14:07 |
<wm-bot> |
<lucaswerkmeister> deployed 61744950f0 (l10n updates) |
[tools.lexeme-forms] |
10:37 |
<Majavah> |
installing deployment-urldownloader03 to replace 02 - T278641 |
[releng] |
07:22 |
<elukey> |
powercycle elastic2033 - no ssh, no tty available via mgmt |
[production] |
04:05 |
<Krinkle> |
Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/684004 |
[releng] |
2021-04-30
§
|
21:54 |
<mutante> |
people1003 - rsycncing /home from peopel1002 |
[production] |
20:13 |
<dancy> |
Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/c/integration/config/+/683987 |
[releng] |
19:21 |
<James_F> |
Docker: Publishing mediawiki-phan-taint-check-demo:0.1.1 for T257301 |
[releng] |
15:30 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudmetrics1002.eqiad.wmnet with reason: Flaky host |
[production] |
15:29 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudmetrics1002.eqiad.wmnet with reason: Flaky host |
[production] |
15:25 |
<bstorm> |
hard rebooting cloudmetrics1002 T275605 |
[production] |
14:21 |
<Majavah> |
add profile::pki::client to all deployment-prep instances to trust deployment-prep cfssl certificates, already deployed on production |
[releng] |
14:15 |
<Majavah> |
revert above as it's not working, T206158 |
[releng] |
14:13 |
<Majavah> |
deployment-cache-text: trying out useusing HTTPS for backend traffic to deployment-mediawiki11 T206158 |
[releng] |
12:37 |
<Majavah> |
force reboot deployment-cache-text06, not letting me to log in, this will disrupt beta cluster availability |
[releng] |
11:40 |
<ladsgroup@deploy1002> |
Synchronized static/favicon/wikitech.ico: Config: [[gerrit:683835|Update wikitech logo]] (duration: 00m 56s) |
[production] |
11:36 |
<ladsgroup@deploy1002> |
Synchronized static/images/project-logos/wikitech-1.5x.png: Config: [[gerrit:683835|Update wikitech logo]] (duration: 00m 56s) |
[production] |
11:34 |
<ladsgroup@deploy1002> |
Synchronized static/images/project-logos/wikitech-2x.png: Config: [[gerrit:683835|Update wikitech logo]] (duration: 00m 57s) |
[production] |
11:33 |
<ladsgroup@deploy1002> |
Synchronized static/images/project-logos/wikitech.png: Config: [[gerrit:683835|Update wikitech logo]] (duration: 00m 57s) |
[production] |
11:31 |
<ladsgroup@deploy1002> |
Synchronized logos/config.yaml: Config: [[gerrit:683835|Update wikitech logo]] (duration: 00m 57s) |
[production] |
11:16 |
<dcaro> |
draining and rebooting coludvirt1017, last one today (T280641) |
[admin] |
10:37 |
<dcaro> |
draining coludvirt1016 for reboot (T280641) |
[admin] |
09:47 |
<dcaro> |
draining coludvirt1013 for reboot (T280641) |
[admin] |
09:04 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1040.eqiad.wmnet with reason: primary nic disconnected |
[production] |