2020-03-05
ยง
|
15:01 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
14:55 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
14:55 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
14:52 |
<moritzm> |
copied hpssacli to thirdparty/hwraid for buster-wikimedia (current Gen 10 releases are named ssaducli now, but retain the old package (which only uses libc anyway) for backwards compat with gen9 on Buster) |
[production] |
14:45 |
<moritzm> |
copied hpssaducli to thirdparty/hwraid for buster-wikimedia (current releases are named ssaducli now, but retain the old package (which only uses libc anyway) for backwards compat |
[production] |
14:45 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
14:45 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
14:25 |
<XioNoX> |
push BGP to Cloud on cr2-codfw - T245606 |
[production] |
14:13 |
<Urbanecm> |
Password reset for SUL User:Yezi Brook (T246988) |
[production] |
14:09 |
<XioNoX> |
push BGP to Cloud on cr1-codfw - T245606 |
[production] |
14:05 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:05 |
<liw@deploy1001> |
rebuilt and synchronized wikiversions files: all wikis to 1.35.0-wmf.22 |
[production] |
14:03 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:03 |
<XioNoX> |
set all eqiad/codfw PDUs, cord W thresholds to 3440 - T245655 |
[production] |
13:54 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
13:51 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
13:50 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:49 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:48 |
<marostegui> |
Stop MySQL on db1078 for reimage - T246604 |
[production] |
13:47 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1078 for reimage to buster - T246604', diff saved to https://phabricator.wikimedia.org/P10623 and previous config saved to /var/cache/conftool/dbconfig/20200305-134701-marostegui.json |
[production] |
13:26 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
13:24 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:56 |
<addshore> |
stop that cache warming .... |
[production] |
12:52 |
<addshore> |
START warm cache for db1111 & db1126 for Q30-32 million (100k batch selects, 30s sleep) T219123 (pass 1) |
[production] |
12:06 |
<Amir1> |
the property terms removal is finished. 312K rows deleted (T225054) |
[production] |
11:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2109 after reimage to buster - T246604', diff saved to https://phabricator.wikimedia.org/P10622 and previous config saved to /var/cache/conftool/dbconfig/20200305-115322-marostegui.json |
[production] |
11:45 |
<Amir1> |
deleting property terms from wb_terms in wikidatawiki (T225054) |
[production] |
11:43 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: [[gerrit:577218|Stop writing to the old term store for properties (T219301 T225054)]], take II (duration: 01m 04s) |
[production] |
11:42 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: [[gerrit:577218|Stop writing to the old term store for properties (T219301 T225054)]] (duration: 01m 04s) |
[production] |
11:29 |
<ladsgroup@deploy1001> |
Synchronized php-1.35.0-wmf.22/extensions/Wikibase: [[gerrit:576963|Schedule 1 CleanTermsIfUnusedJob per ID to clean (T244115 T246898)]] (duration: 01m 08s) |
[production] |
11:25 |
<ladsgroup@deploy1001> |
Synchronized php-1.35.0-wmf.22/extensions/Cognate: [[gerrit:576876|Exit undelete hook early if revision not found (T245869)]] (duration: 01m 04s) |
[production] |
11:20 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Write to new term store up to Q87 million, was 86 (T219123) cache bust (duration: 01m 03s) |
[production] |
11:19 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Write to new term store up to Q87 million, was 86 (T219123) (duration: 01m 04s) |
[production] |
11:10 |
<vgutierrez> |
Disable parent proxies on ats-tls in ulsfo - T244464 |
[production] |
11:06 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Reading up to Q30M for the new term store everywhere (was Q25M) + warm db1126 & db1111 caches (T219123) cache bust (duration: 01m 04s) |
[production] |
11:04 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Reading up to Q30M for the new term store everywhere (was Q25M) + warm db1126 & db1111 caches (T219123) (duration: 01m 05s) |
[production] |
11:04 |
<jbond42> |
small update to PCC https://gerrit.wikimedia.org/r/c/operations/software/puppet-compiler/+/576663 |
[production] |
10:50 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:48 |
<hnowlan@deploy1001> |
Synchronized multiversion/MWScript.php: T244549: enable running MWScript with phpdbg (duration: 01m 04s) |
[production] |
10:48 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:18 |
<oblivian@deploy1001> |
Synchronized wmf-config/ProductionServices.php: Switch parsoid calls to use envoy as a proxy (duration: 01m 07s) |
[production] |
10:14 |
<vgutierrez> |
Enable keep alive between ats-tls and varnish-fe globally - T244464 |
[production] |
10:12 |
<marostegui> |
Stop MySQL on db2109 for reimage - T246604 |
[production] |
10:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2109 for reimage to buster - T246604', diff saved to https://phabricator.wikimedia.org/P10621 and previous config saved to /var/cache/conftool/dbconfig/20200305-101111-marostegui.json |
[production] |
10:11 |
<addshore> |
START warm cache for db1111 & db1126 for Q25-30 million T219123 (pass 2 today) |
[production] |
09:53 |
<hashar> |
Restarting Zuul, it no more process Gerrit events due to a thread stuck waiting on Gerrit.. T246973 |
[production] |
08:50 |
<addshore> |
START warm cache for db1111 & db1126 for Q25-30 million T219123 (pass 1 today) |
[production] |
08:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1103:3312 db1103:3314 after reimage to buster T246604', diff saved to https://phabricator.wikimedia.org/P10619 and previous config saved to /var/cache/conftool/dbconfig/20200305-081227-marostegui.json |
[production] |
07:33 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1103:3312 db1103:3314 after reimage to buster T246604', diff saved to https://phabricator.wikimedia.org/P10618 and previous config saved to /var/cache/conftool/dbconfig/20200305-073319-marostegui.json |
[production] |
07:19 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1103:3312 db1103:3314 after reimage to buster T246604', diff saved to https://phabricator.wikimedia.org/P10617 and previous config saved to /var/cache/conftool/dbconfig/20200305-071915-marostegui.json |
[production] |