2022-02-10
ยง
|
19:12 |
<jgiannelos@deploy1002> |
Started deploy [kartotherian/deploy@828a428] (eqiad): Configure geoshapes postgres max conns |
[production] |
19:11 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn |
[production] |
19:11 |
<bblack> |
lvs1017 rebooting for sanity-check after prod config - T301142 |
[production] |
19:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1181 (T300382)', diff saved to https://phabricator.wikimedia.org/P20576 and previous config saved to /var/cache/conftool/dbconfig/20220210-190840-marostegui.json |
[production] |
19:03 |
<otto@deploy1002> |
Finished deploy [airflow-dags/research@b871faf]: (no justification provided) (duration: 00m 03s) |
[production] |
19:03 |
<otto@deploy1002> |
Started deploy [airflow-dags/research@b871faf]: (no justification provided) |
[production] |
19:01 |
<otto@deploy1002> |
Finished deploy [airflow-dags/research@b871faf]: (no justification provided) (duration: 00m 27s) |
[production] |
19:01 |
<otto@deploy1002> |
Started deploy [airflow-dags/research@b871faf]: (no justification provided) |
[production] |
18:59 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1179 (T298554)', diff saved to https://phabricator.wikimedia.org/P20575 and previous config saved to /var/cache/conftool/dbconfig/20220210-185956-ladsgroup.json |
[production] |
18:53 |
<ebernhardson> |
restart all mjolnir daemons on search-loader1001 and 2001 to purge old cached node lists |
[production] |
18:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P20574 and previous config saved to /var/cache/conftool/dbconfig/20220210-185336-marostegui.json |
[production] |
18:52 |
<jgiannelos@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mobileapps: sync on production |
[production] |
18:51 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn |
[production] |
18:50 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn |
[production] |
18:49 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn |
[production] |
18:49 |
<jgiannelos@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply on staging |
[production] |
18:49 |
<jgiannelos@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mobileapps: apply on production |
[production] |
18:49 |
<jgiannelos@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mobileapps: sync on production |
[production] |
18:49 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn |
[production] |
18:46 |
<jgiannelos@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mobileapps: apply on staging |
[production] |
18:46 |
<jgiannelos@deploy1002> |
helmfile [codfw] START helmfile.d/services/mobileapps: apply on production |
[production] |
18:45 |
<jgiannelos@deploy1002> |
helmfile [staging] DONE helmfile.d/services/mobileapps: sync on staging |
[production] |
18:45 |
<cmjohnson@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1031.eqiad.wmnet with OS buster |
[production] |
18:45 |
<cmjohnson@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1032.eqiad.wmnet with OS buster |
[production] |
18:45 |
<jgiannelos@deploy1002> |
helmfile [staging] DONE helmfile.d/services/mobileapps: apply on production |
[production] |
18:45 |
<cmjohnson@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1033.eqiad.wmnet with OS buster |
[production] |
18:45 |
<jgiannelos@deploy1002> |
helmfile [staging] START helmfile.d/services/mobileapps: apply on staging |
[production] |
18:44 |
<jgiannelos@deploy1002> |
helmfile [staging] START helmfile.d/services/mobileapps: apply on staging |
[production] |
18:43 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn |
[production] |
18:43 |
<bblack> |
lvs1013 - stopping puppet+pybal for move to lvs1017, high-traffic1 traffic fails over to lvs1020 for now - T301142 |
[production] |
18:42 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn |
[production] |
18:42 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn |
[production] |
18:42 |
<ladsgroup@deploy1002> |
Synchronized php-1.38.0-wmf.21/includes/content/ContentHandler.php: Backport: [[gerrit:761419|ContentHandler: Avoding saving in ParserCache in search index jobs (T285993)]] (duration: 00m 50s) |
[production] |
18:41 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn |
[production] |
18:40 |
<ladsgroup@deploy1002> |
Synchronized php-1.38.0-wmf.20/includes/content/ContentHandler.php: Backport: [[gerrit:761420|ContentHandler: Avoding saving in ParserCache in search index jobs (T285993)]] (duration: 00m 50s) |
[production] |
18:40 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1096:3315 (T300775)', diff saved to https://phabricator.wikimedia.org/P20573 and previous config saved to /var/cache/conftool/dbconfig/20220210-184012-marostegui.json |
[production] |
18:40 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance |
[production] |
18:40 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance |
[production] |
18:40 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1110 (T300775)', diff saved to https://phabricator.wikimedia.org/P20572 and previous config saved to /var/cache/conftool/dbconfig/20220210-184004-marostegui.json |
[production] |
18:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P20571 and previous config saved to /var/cache/conftool/dbconfig/20220210-183831-marostegui.json |
[production] |
18:31 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2088:3312 (T300510)', diff saved to https://phabricator.wikimedia.org/P20570 and previous config saved to /var/cache/conftool/dbconfig/20220210-183107-ladsgroup.json |
[production] |
18:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1179 (T298554)', diff saved to https://phabricator.wikimedia.org/P20569 and previous config saved to /var/cache/conftool/dbconfig/20220210-182959-ladsgroup.json |
[production] |
18:29 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1179.eqiad.wmnet with reason: Maintenance |
[production] |
18:29 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1179.eqiad.wmnet with reason: Maintenance |
[production] |
18:29 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1175 (T298554)', diff saved to https://phabricator.wikimedia.org/P20568 and previous config saved to /var/cache/conftool/dbconfig/20220210-182952-ladsgroup.json |
[production] |
18:29 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
18:28 |
<jgiannelos@deploy1002> |
Finished deploy [kartotherian/deploy@a5be8ac] (eqiad): Remove references to cassandra `storage_id` (duration: 01m 01s) |
[production] |
18:27 |
<jgiannelos@deploy1002> |
Started deploy [kartotherian/deploy@a5be8ac] (eqiad): Remove references to cassandra `storage_id` |
[production] |
18:26 |
<bblack@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:25 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2088:3311 (T300510)', diff saved to https://phabricator.wikimedia.org/P20567 and previous config saved to /var/cache/conftool/dbconfig/20220210-182547-ladsgroup.json |
[production] |