2020-06-11
ยง
|
11:35 |
<mlitn@deploy1001> |
Synchronized php-1.35.0-wmf.36/extensions/GrowthExperiments: Help panel: Update guidance behavior rules (duration: 01m 06s) |
[production] |
11:34 |
<jayme@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
11:34 |
<jayme@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
11:33 |
<arturo> |
reboot toolsbeta-puppetmaster-03 to try cleaning up potential kernel/filesystem problems |
[toolsbeta] |
11:32 |
<arturo> |
apparently every python script segfaults in toolsbeta-puppetmaster-03 |
[toolsbeta] |
11:28 |
<kartik@deploy1001> |
Synchronized php-1.35.0-wmf.36/extensions/ContentTranslation/modules/tools/mw.cx.tools.IssueTrackingTool.js: Backport: [[gerrit|604587|IssueTrackingTool: Fix js error in getCurrentNodeId method (T254965)]] (duration: 01m 07s) |
[production] |
11:27 |
<arturo> |
puppetdb wasn't the problem. The problem is puppet-enc segfaulting in toolsbeta-puppetmaster-03 |
[toolsbeta] |
11:21 |
<arturo> |
puppet not working bc puppetdb, run `aborrero@toolsbeta-puppetdb-02:~ $ sudo systemctl restart puppetdb` |
[toolsbeta] |
11:11 |
<arturo> |
deployed nginx-ingress for some early testing (not definitive) with code https://github.com/crookedstorm/paws/commit/bee62b3fd57f9804aa27e7b8b41fde50bd93df94 (T195217) |
[paws] |
11:08 |
<jayme@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
11:04 |
<legoktm> |
restarting everything after gerrit-replica 502s fixed T255094 T255125 |
[codesearch] |
11:04 |
<mlitn@deploy1001> |
Synchronized php-1.35.0-wmf.36/extensions/MachineVision: $aliases should be an array of strings, not AliasGroup objects (duration: 01m 07s) |
[production] |
10:47 |
<moritzm> |
repooling mw1318,mw2139,mw2145,mw2147,mw2221,mw2219,mw2250,mw2350 (these were depooled, but seem all fine in Icinga and were probably just forgotten) |
[production] |
10:41 |
<filippo@cumin1001> |
conftool action : set/pooled=yes; selector: cluster=thanos,service=thanos-swift |
[production] |
10:40 |
<filippo@cumin1001> |
conftool action : set/pooled=yes; selector: cluster=thanos,service=thanos-query |
[production] |
10:37 |
<Urbanecm> |
Run `update page set page_content_model="json" where page_content_model = "CollaborationListContent" OR page_content_model = "CollaborationHubContent";` at beta enwiki (T255107) |
[releng] |
10:37 |
<moritzm> |
installing buster kernel security updates (no reboots yet, on hold for regression-free microcode update) |
[production] |
10:32 |
<godog> |
roll-restart pybal in eqiad lvs low-traffic |
[production] |
10:21 |
<mutante> |
restarting gerrit on gerrit-replica (gerrit2001) - java.lang.OutOfMemoryError: Java heap space |
[production] |
10:21 |
<Urbanecm> |
Run scap pull at mwdebug1001 to revert temporary changes |
[production] |
10:18 |
<RhinosF1> |
tools.zppixbot-test@tools-sgebastion-08:~$ grep -r -D skip "last_event_at" (in case anything seems slow, may take a while, please don't kill anything while I do it) END |
[tools.zppixbot-test] |
10:15 |
<arturo> |
added role (just a label) for ingress nodes: `kubectl label node paws-k8s-ingress-1 kubernetes.io/role=ingress` (T195217) |
[paws] |
10:14 |
<RhinosF1> |
tools.zppixbot-test@tools-sgebastion-08:~$ grep -r -D skip "last_event_at" (in case anything seems slow, may take a while, please don't kill anything while I do it) |
[tools.zppixbot-test] |
10:14 |
<Urbanecm> |
Applying temporary changes on mwdebug1001 |
[production] |
09:58 |
<moritzm> |
upgrading netmon* to PHP 7.2.31 |
[production] |
09:55 |
<marostegui> |
Upgrade es2025 |
[production] |
09:54 |
<moritzm> |
upgrading mwmaint* to PHP 7.2.31 |
[production] |
09:46 |
<moritzm> |
upgrading labweb* PHP 7.2.31 |
[production] |
09:36 |
<elukey> |
switch piwik.wikimedia.org from matomo1001 to matomo1002 (new buster node) |
[production] |
09:02 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:00 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:48 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . |
[production] |
08:48 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
08:42 |
<moritzm> |
imported memcached 1.6.6-1~wmf10u1 |
[production] |
08:39 |
<marostegui> |
Reimage es2024 to buster |
[production] |
08:30 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:30 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:25 |
<akosiaris@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:25 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:25 |
<akosiaris@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:25 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:24 |
<akosiaris@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:24 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:24 |
<akosiaris@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
08:24 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:23 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-logging-external' for release 'canary' . |
[production] |
08:23 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-logging-external' for release 'production' . |
[production] |
08:22 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:20 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:18 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |