2501-2550 of 10000 results (51ms)
2022-02-10 ยง
19:12 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@828a428] (eqiad): Configure geoshapes postgres max conns [production]
19:11 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
19:11 <bblack> lvs1017 rebooting for sanity-check after prod config - T301142 [production]
19:08 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1181 (T300382)', diff saved to https://phabricator.wikimedia.org/P20576 and previous config saved to /var/cache/conftool/dbconfig/20220210-190840-marostegui.json [production]
19:03 <otto@deploy1002> Finished deploy [airflow-dags/research@b871faf]: (no justification provided) (duration: 00m 03s) [production]
19:03 <otto@deploy1002> Started deploy [airflow-dags/research@b871faf]: (no justification provided) [production]
19:01 <otto@deploy1002> Finished deploy [airflow-dags/research@b871faf]: (no justification provided) (duration: 00m 27s) [production]
19:01 <otto@deploy1002> Started deploy [airflow-dags/research@b871faf]: (no justification provided) [production]
18:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1179 (T298554)', diff saved to https://phabricator.wikimedia.org/P20575 and previous config saved to /var/cache/conftool/dbconfig/20220210-185956-ladsgroup.json [production]
18:53 <ebernhardson> restart all mjolnir daemons on search-loader1001 and 2001 to purge old cached node lists [production]
18:53 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P20574 and previous config saved to /var/cache/conftool/dbconfig/20220210-185336-marostegui.json [production]
18:52 <jgiannelos@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: sync on production [production]
18:51 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
18:50 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
18:49 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
18:49 <jgiannelos@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply on staging [production]
18:49 <jgiannelos@deploy1002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply on production [production]
18:49 <jgiannelos@deploy1002> helmfile [codfw] DONE helmfile.d/services/mobileapps: sync on production [production]
18:49 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
18:46 <jgiannelos@deploy1002> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply on staging [production]
18:46 <jgiannelos@deploy1002> helmfile [codfw] START helmfile.d/services/mobileapps: apply on production [production]
18:45 <jgiannelos@deploy1002> helmfile [staging] DONE helmfile.d/services/mobileapps: sync on staging [production]
18:45 <cmjohnson@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1031.eqiad.wmnet with OS buster [production]
18:45 <cmjohnson@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1032.eqiad.wmnet with OS buster [production]
18:45 <jgiannelos@deploy1002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply on production [production]
18:45 <cmjohnson@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1033.eqiad.wmnet with OS buster [production]
18:45 <jgiannelos@deploy1002> helmfile [staging] START helmfile.d/services/mobileapps: apply on staging [production]
18:44 <jgiannelos@deploy1002> helmfile [staging] START helmfile.d/services/mobileapps: apply on staging [production]
18:43 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
18:43 <bblack> lvs1013 - stopping puppet+pybal for move to lvs1017, high-traffic1 traffic fails over to lvs1020 for now - T301142 [production]
18:42 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
18:42 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
18:42 <ladsgroup@deploy1002> Synchronized php-1.38.0-wmf.21/includes/content/ContentHandler.php: Backport: [[gerrit:761419|ContentHandler: Avoding saving in ParserCache in search index jobs (T285993)]] (duration: 00m 50s) [production]
18:41 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
18:40 <ladsgroup@deploy1002> Synchronized php-1.38.0-wmf.20/includes/content/ContentHandler.php: Backport: [[gerrit:761420|ContentHandler: Avoding saving in ParserCache in search index jobs (T285993)]] (duration: 00m 50s) [production]
18:40 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1096:3315 (T300775)', diff saved to https://phabricator.wikimedia.org/P20573 and previous config saved to /var/cache/conftool/dbconfig/20220210-184012-marostegui.json [production]
18:40 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance [production]
18:40 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance [production]
18:40 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1110 (T300775)', diff saved to https://phabricator.wikimedia.org/P20572 and previous config saved to /var/cache/conftool/dbconfig/20220210-184004-marostegui.json [production]
18:38 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P20571 and previous config saved to /var/cache/conftool/dbconfig/20220210-183831-marostegui.json [production]
18:31 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2088:3312 (T300510)', diff saved to https://phabricator.wikimedia.org/P20570 and previous config saved to /var/cache/conftool/dbconfig/20220210-183107-ladsgroup.json [production]
18:30 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1179 (T298554)', diff saved to https://phabricator.wikimedia.org/P20569 and previous config saved to /var/cache/conftool/dbconfig/20220210-182959-ladsgroup.json [production]
18:29 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1179.eqiad.wmnet with reason: Maintenance [production]
18:29 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1179.eqiad.wmnet with reason: Maintenance [production]
18:29 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1175 (T298554)', diff saved to https://phabricator.wikimedia.org/P20568 and previous config saved to /var/cache/conftool/dbconfig/20220210-182952-ladsgroup.json [production]
18:29 <bblack@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:28 <jgiannelos@deploy1002> Finished deploy [kartotherian/deploy@a5be8ac] (eqiad): Remove references to cassandra `storage_id` (duration: 01m 01s) [production]
18:27 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@a5be8ac] (eqiad): Remove references to cassandra `storage_id` [production]
18:26 <bblack@cumin1001> START - Cookbook sre.dns.netbox [production]
18:25 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2088:3311 (T300510)', diff saved to https://phabricator.wikimedia.org/P20567 and previous config saved to /var/cache/conftool/dbconfig/20220210-182547-ladsgroup.json [production]