2022-05-12
§
|
07:16 |
<jmm@cumin1001> |
START - Cookbook sre.hosts.reimage for host ganeti4001.ulsfo.wmnet with OS bullseye |
[production] |
07:13 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:13 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
07:12 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:12 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
07:09 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ores1001.eqiad.wmnet with reason: host reimage |
[production] |
07:08 |
<kartik@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:791107|Enable Section Translation in cs, el, he, ko, sw and tr WPs (T304855 T304854 T298239 T304863 T304853 T304828)]] (duration: 00m 51s) |
[production] |
07:06 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:06 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ores1001.eqiad.wmnet with reason: host reimage |
[production] |
07:06 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
07:05 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
07:05 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
06:44 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host ores1001.eqiad.wmnet with OS buster |
[production] |
06:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase traffic on db1127 to test 10.6 T308126', diff saved to https://phabricator.wikimedia.org/P27797 and previous config saved to /var/cache/conftool/dbconfig/20220512-063217-marostegui.json |
[production] |
06:22 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase traffic on db1127 to test 10.6 T308126', diff saved to https://phabricator.wikimedia.org/P27796 and previous config saved to /var/cache/conftool/dbconfig/20220512-062241-marostegui.json |
[production] |
06:13 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Pool db1127 with low weight T308126', diff saved to https://phabricator.wikimedia.org/P27795 and previous config saved to /var/cache/conftool/dbconfig/20220512-061305-marostegui.json |
[production] |
05:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1127 T308126', diff saved to https://phabricator.wikimedia.org/P27794 and previous config saved to /var/cache/conftool/dbconfig/20220512-055918-marostegui.json |
[production] |
05:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2122 T307501', diff saved to https://phabricator.wikimedia.org/P27793 and previous config saved to /var/cache/conftool/dbconfig/20220512-054138-marostegui.json |
[production] |
05:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2122 T307501', diff saved to https://phabricator.wikimedia.org/P27792 and previous config saved to /var/cache/conftool/dbconfig/20220512-053444-marostegui.json |
[production] |
05:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2140 T308202', diff saved to https://phabricator.wikimedia.org/P27791 and previous config saved to /var/cache/conftool/dbconfig/20220512-051106-marostegui.json |
[production] |
04:12 |
<andrewbogott> |
rebooting primary bastion (bastion-eqiad1-03.bastion.eqiad1.wikimedia.cloud) in hopes of resolving a problem with ssh proxying |
[admin] |
04:07 |
<kart_> |
Updated cxserver to 2022-05-11-135122-production (T307967, T306999, T298239, T304853, T307507, T308039) |
[production] |
04:05 |
<kartik@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
04:04 |
<kartik@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
04:01 |
<kartik@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
04:01 |
<kartik@deploy1002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
03:57 |
<kartik@deploy1002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
03:56 |
<kartik@deploy1002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
00:46 |
<brennen> |
gitlab: disabling container registries on all existing projects (T307537) |
[releng] |
00:27 |
<wm-bot> |
<root> Set concurrencyPolicy: Forbid on oabotrefresh cronjob and deleted stale job pods |
[tools.oabot] |
00:19 |
<bd808> |
Increased count/cronjobs.batch to 60 and count/jobs.batch quota to 30 |
[tools.jarbot-ii] |
00:16 |
<bd808> |
Increased count/jobs.batch quota to 30 |
[tools.citationhunt] |
00:10 |
<bd808> |
Increased count/cronjobs.batch to 60 and count/jobs.batch quota to 30 |
[tools.jarbot-iii] |
00:06 |
<bd808> |
Increased count/jobs.batch quota to 20 |
[tools.citationhunt] |
00:04 |
<bd808> |
Increased count/jobs.batch quota to 30 |
[tools.jarbot] |
2022-05-11
§
|
23:56 |
<bd808> |
Increased count/cronjobs.batch quota from 50 to 60 |
[tools.jarbot] |
23:20 |
<brennen> |
gitlab-prod-1001.devtools: container registry currently enabled |
[releng] |
22:59 |
<wm-bot> |
<root> kubectl delete deployment.apps/scholiaanalytics |
[tools.scholiaanalytics] |
22:59 |
<wm-bot> |
<root> kubectl delete deployment.apps/scholia-analytics |
[tools.scholiaanalytics] |
22:58 |
<wm-bot> |
<root> kubectl delete deployment.apps/nginx |
[tools.scholiaanalytics] |
22:54 |
<wm-bot> |
<root> webservice stop |
[tools.scholia-analytics] |
22:52 |
<wm-bot> |
<root> kubectl delete deployment.apps/scholia-analytics4 |
[tools.scholia-analytics] |
22:52 |
<wm-bot> |
<root> kubectl delete deployment.apps/scholia-analytics3 |
[tools.scholia-analytics] |
22:52 |
<wm-bot> |
<root> kubectl delete deployment.apps/scholia-analytics2 |
[tools.scholia-analytics] |
22:50 |
<wm-bot> |
<root> kubectl delete deployment.apps/nginx-app |
[tools.scholia-analytics] |
22:28 |
<robh> |
cp305[67] returned to service and all green in icinga, cp305[89] depooling for firmware update T243167 |
[production] |
22:00 |
<robh> |
cp305[45] returned to service and all green in icinga, cp305[67] depooling for firmware update T243167 |
[production] |
21:34 |
<robh> |
cp30[23] returned to service and all green in icinga, cp30[45] depooling for firmware update T243167 |
[production] |
21:34 |
<robh> |
cp50[23] returned to service and all green in icinga, cp50[45] depooling for firmware update T243167 |
[production] |
21:33 |
<robh> |
cp50[23] returned to service and all green in icinga, cp50[45] depooling for firmware update |
[production] |