2022-05-12
§
|
08:45 |
<jmm@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:40 |
<klausman@cumin1001> |
START - Cookbook sre.hosts.reimage for host ores1003.eqiad.wmnet with OS buster |
[production] |
08:32 |
<jmm@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
08:31 |
<jmm@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host idp-test2002.wikimedia.org |
[production] |
08:18 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase traffic on db1127 to test 10.6 T308126', diff saved to https://phabricator.wikimedia.org/P27799 and previous config saved to /var/cache/conftool/dbconfig/20220512-081814-marostegui.json |
[production] |
07:57 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase traffic on db1127 to test 10.6 T308126', diff saved to https://phabricator.wikimedia.org/P27798 and previous config saved to /var/cache/conftool/dbconfig/20220512-075703-marostegui.json |
[production] |
07:45 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ores1001.eqiad.wmnet with OS buster |
[production] |
07:34 |
<jmm@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti4001.ulsfo.wmnet with OS bullseye |
[production] |
07:33 |
<marostegui> |
dbmaint s7@codfw T308206 |
[production] |
07:32 |
<marostegui> |
dbmaint s6@eqiad T308206 |
[production] |
07:32 |
<marostegui> |
dbmaint s6@codfw T308206 |
[production] |
07:29 |
<marostegui> |
dbmaint s3@codfw T308206 |
[production] |
07:29 |
<marostegui> |
dbmaint s3@eqiad T308206 |
[production] |
07:18 |
<marostegui> |
dbmaint s7@codfw T308206 |
[production] |
07:16 |
<jmm@cumin1001> |
START - Cookbook sre.hosts.reimage for host ganeti4001.ulsfo.wmnet with OS bullseye |
[production] |
07:13 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:13 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
07:12 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:12 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
07:09 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ores1001.eqiad.wmnet with reason: host reimage |
[production] |
07:08 |
<kartik@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:791107|Enable Section Translation in cs, el, he, ko, sw and tr WPs (T304855 T304854 T298239 T304863 T304853 T304828)]] (duration: 00m 51s) |
[production] |
07:06 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:06 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ores1001.eqiad.wmnet with reason: host reimage |
[production] |
07:06 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
07:05 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:05 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
06:44 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host ores1001.eqiad.wmnet with OS buster |
[production] |
06:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase traffic on db1127 to test 10.6 T308126', diff saved to https://phabricator.wikimedia.org/P27797 and previous config saved to /var/cache/conftool/dbconfig/20220512-063217-marostegui.json |
[production] |
06:22 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase traffic on db1127 to test 10.6 T308126', diff saved to https://phabricator.wikimedia.org/P27796 and previous config saved to /var/cache/conftool/dbconfig/20220512-062241-marostegui.json |
[production] |
06:13 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Pool db1127 with low weight T308126', diff saved to https://phabricator.wikimedia.org/P27795 and previous config saved to /var/cache/conftool/dbconfig/20220512-061305-marostegui.json |
[production] |
05:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1127 T308126', diff saved to https://phabricator.wikimedia.org/P27794 and previous config saved to /var/cache/conftool/dbconfig/20220512-055918-marostegui.json |
[production] |
05:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2122 T307501', diff saved to https://phabricator.wikimedia.org/P27793 and previous config saved to /var/cache/conftool/dbconfig/20220512-054138-marostegui.json |
[production] |
05:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2122 T307501', diff saved to https://phabricator.wikimedia.org/P27792 and previous config saved to /var/cache/conftool/dbconfig/20220512-053444-marostegui.json |
[production] |
05:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2140 T308202', diff saved to https://phabricator.wikimedia.org/P27791 and previous config saved to /var/cache/conftool/dbconfig/20220512-051106-marostegui.json |
[production] |
04:07 |
<kart_> |
Updated cxserver to 2022-05-11-135122-production (T307967, T306999, T298239, T304853, T307507, T308039) |
[production] |
04:05 |
<kartik@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
04:04 |
<kartik@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
04:01 |
<kartik@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
04:01 |
<kartik@deploy1002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
03:57 |
<kartik@deploy1002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
03:56 |
<kartik@deploy1002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
2022-05-11
§
|
22:28 |
<robh> |
cp305[67] returned to service and all green in icinga, cp305[89] depooling for firmware update T243167 |
[production] |
22:00 |
<robh> |
cp305[45] returned to service and all green in icinga, cp305[67] depooling for firmware update T243167 |
[production] |
21:34 |
<robh> |
cp30[23] returned to service and all green in icinga, cp30[45] depooling for firmware update T243167 |
[production] |
21:34 |
<robh> |
cp50[23] returned to service and all green in icinga, cp50[45] depooling for firmware update T243167 |
[production] |
21:33 |
<robh> |
cp50[23] returned to service and all green in icinga, cp50[45] depooling for firmware update |
[production] |
21:01 |
<robh> |
cp305[23] going offline via T243167 for firmware updates (puppet agent disabled and depooled prior to reboot) |
[production] |
20:28 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:28 |
<tgr> |
T304542 running mwscript extensions/GrowthExperiments/maintenance/refreshLinkRecommendations.php hiwiki --verbose |
[production] |
20:27 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |