2022-05-25
§
|
07:30 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:29 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
07:23 |
<kart_> |
Config: [[gerrit:798389|Enable Section Translation for Hindi in testwiki (T308834)]] |
[production] |
07:11 |
<kart_> |
Config: [[gerrit:797977|Enable Content and Section Translation in Serbian and Zulu Wikipedias (T304834 T304858)]] |
[production] |
06:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1143 (re)pooling @ 100%: After migrating to 10.4.25', diff saved to https://phabricator.wikimedia.org/P28475 and previous config saved to /var/cache/conftool/dbconfig/20220525-063856-root.json |
[production] |
06:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1143 (re)pooling @ 75%: After migrating to 10.4.25', diff saved to https://phabricator.wikimedia.org/P28474 and previous config saved to /var/cache/conftool/dbconfig/20220525-062352-root.json |
[production] |
06:20 |
<elukey> |
`elukey@an-tool1011:~$ sudo systemctl reset-failed ifup@ens13.service` - T273026 |
[production] |
06:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1143 (re)pooling @ 50%: After migrating to 10.4.25', diff saved to https://phabricator.wikimedia.org/P28473 and previous config saved to /var/cache/conftool/dbconfig/20220525-060848-root.json |
[production] |
05:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1143 (re)pooling @ 25%: After migrating to 10.4.25', diff saved to https://phabricator.wikimedia.org/P28472 and previous config saved to /var/cache/conftool/dbconfig/20220525-055344-root.json |
[production] |
05:41 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1130 (T298560)', diff saved to https://phabricator.wikimedia.org/P28471 and previous config saved to /var/cache/conftool/dbconfig/20220525-054135-ladsgroup.json |
[production] |
05:41 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1130.eqiad.wmnet with reason: Maintenance |
[production] |
05:41 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1130.eqiad.wmnet with reason: Maintenance |
[production] |
05:41 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1096:3315 (T298560)', diff saved to https://phabricator.wikimedia.org/P28470 and previous config saved to /var/cache/conftool/dbconfig/20220525-054127-ladsgroup.json |
[production] |
05:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1143 (re)pooling @ 10%: After migrating to 10.4.25', diff saved to https://phabricator.wikimedia.org/P28469 and previous config saved to /var/cache/conftool/dbconfig/20220525-053840-root.json |
[production] |
05:30 |
<marostegui> |
Rename revision_actor_temp on s1 T307906 |
[production] |
05:26 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1096:3315', diff saved to https://phabricator.wikimedia.org/P28468 and previous config saved to /var/cache/conftool/dbconfig/20220525-052622-ladsgroup.json |
[production] |
05:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1143 (re)pooling @ 5%: After migrating to 10.4.25', diff saved to https://phabricator.wikimedia.org/P28467 and previous config saved to /var/cache/conftool/dbconfig/20220525-052336-root.json |
[production] |
05:18 |
<marostegui> |
Rename revision_actor_temp on s5 T307906 |
[production] |
05:14 |
<marostegui> |
Rename revision_actor_temp on s7 T307906 |
[production] |
05:11 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1096:3315', diff saved to https://phabricator.wikimedia.org/P28466 and previous config saved to /var/cache/conftool/dbconfig/20220525-051117-ladsgroup.json |
[production] |
05:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1143 (re)pooling @ 1%: After migrating to 10.4.25', diff saved to https://phabricator.wikimedia.org/P28465 and previous config saved to /var/cache/conftool/dbconfig/20220525-050833-root.json |
[production] |
05:06 |
<marostegui> |
Install 10.4.25 on db1143 T308915 |
[production] |
05:05 |
<marostegui@cumin2002> |
dbctl commit (dc=all): 'Depool db1143', diff saved to https://phabricator.wikimedia.org/P28464 and previous config saved to /var/cache/conftool/dbconfig/20220525-050538-marostegui.json |
[production] |
05:03 |
<marostegui> |
Rename revision_actor_temp on s2 T307906 |
[production] |
04:57 |
<marostegui> |
Rename revision_actor_temp on s4 T307906 |
[production] |
04:56 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1096:3315 (T298560)', diff saved to https://phabricator.wikimedia.org/P28463 and previous config saved to /var/cache/conftool/dbconfig/20220525-045612-ladsgroup.json |
[production] |
02:02 |
<ebernhardson> |
restart elasticsearch_6@production-search-psi-eqiad to resolve CirrusSearchJVMGCOldPoolFlatlined alert |
[production] |
00:15 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1096:3315 (T298560)', diff saved to https://phabricator.wikimedia.org/P28462 and previous config saved to /var/cache/conftool/dbconfig/20220525-001552-ladsgroup.json |
[production] |
00:15 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance |
[production] |
00:15 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance |
[production] |
2022-05-24
§
|
22:09 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
22:08 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
22:08 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
22:04 |
<cjming> |
end of UTC late backport window |
[production] |
22:04 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
22:03 |
<mutante> |
centrallog2002 - alerted because running out of disk. /srv/syslog# find . -name *.gz -mtime +100 -delete |
[production] |
22:02 |
<cjming@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:798813|Revert "Start writing to cuc_actor in s3, kcgwiki and labtestwiki" (T233004 T309148)]] (duration: 00m 49s) |
[production] |
21:59 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:58 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
21:58 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:57 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
21:56 |
<cjming@deploy1002> |
Synchronized php-1.39.0-wmf.13/extensions/MobileFrontend: Backport: [[gerrit:798811|Follow-up I97c27fd7: Fix after-edit reload in source editor (T309068)]] (duration: 00m 48s) |
[production] |
21:42 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:41 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
21:41 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:40 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
21:36 |
<cjming@deploy1002> |
Synchronized wmf-config/InitialiseSettings-labs.php: Config: [[gerrit:798976|Update beta cluster DiscussionTools A/B test config (T304030)]] (duration: 00m 49s) |
[production] |
21:35 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:34 |
<cjming@deploy1002> |
Synchronized wmf-config/CommonSettings.php: Config: [[gerrit:771872|Disable autotopicsub user option by default (T297966)]] (duration: 00m 48s) |
[production] |
21:34 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster relforge: relforge cluster reimage - ryankemper@cumin1001 - T308606 |
[production] |