2001-2050 of 10000 results (57ms)
2022-05-31 §
07:34 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
07:34 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
07:31 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
07:30 <marostegui@cumin1001> dbctl commit (dc=all): 'es1022 (re)pooling @ 5%: After migrating it to 10.6', diff saved to https://phabricator.wikimedia.org/P29180 and previous config saved to /var/cache/conftool/dbconfig/20220531-073026-root.json [production]
07:27 <elukey> add profile k8s_mlstaging + authkey for ml-staging k8s - T302195 [production]
07:21 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
07:20 <kartik@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:801018|Stream config for android breadcrumbs schema]] (duration: 03m 09s) [production]
07:20 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
07:20 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
07:17 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
07:17 <XioNoX> push new pfw firewall rules - T309236 [production]
07:15 <marostegui@cumin1001> dbctl commit (dc=all): 'es1022 (re)pooling @ 1%: After migrating it to 10.6', diff saved to https://phabricator.wikimedia.org/P29179 and previous config saved to /var/cache/conftool/dbconfig/20220531-071522-root.json [production]
07:12 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
07:10 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
07:10 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
07:09 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
07:09 <kartik@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:800833|testwiki: Enable Section Translation in 10 Wikipedias (T308829)]] (duration: 03m 02s) [production]
07:01 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es1022 for migration to 10.6', diff saved to https://phabricator.wikimedia.org/P29178 and previous config saved to /var/cache/conftool/dbconfig/20220531-070058-root.json [production]
06:26 <elukey> `elukey@an-master1001:~$ sudo systemctl reset-failed hadoop-clean-fairscheduler-event-logs.service` [production]
06:10 <marostegui> dbmaint s5@eqiad T298557 [production]
06:05 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1100 T308725', diff saved to https://phabricator.wikimedia.org/P29176 and previous config saved to /var/cache/conftool/dbconfig/20220531-060518-root.json [production]
06:01 <marostegui@cumin1001> dbctl commit (dc=all): 'Promote db1130 to s5 primary and set section read-write T308725', diff saved to https://phabricator.wikimedia.org/P29175 and previous config saved to /var/cache/conftool/dbconfig/20220531-060140-root.json [production]
06:01 <marostegui@cumin1001> dbctl commit (dc=all): 'Set s5 eqiad as read-only for maintenance - T308725', diff saved to https://phabricator.wikimedia.org/P29174 and previous config saved to /var/cache/conftool/dbconfig/20220531-060112-root.json [production]
06:00 <marostegui> Starting s5 eqiad failover from db1100 to db1130 - T308725 [production]
05:03 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 22 hosts with reason: Primary switchover s5 T308725 [production]
05:03 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 22 hosts with reason: Primary switchover s5 T308725 [production]
04:58 <marostegui@cumin1001> dbctl commit (dc=all): 'Set db1130 with weight 0 T308725', diff saved to https://phabricator.wikimedia.org/P29173 and previous config saved to /var/cache/conftool/dbconfig/20220531-045824-root.json [production]
02:30 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
02:30 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
02:30 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
02:27 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
02:07 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
02:04 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
02:04 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
02:03 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
01:58 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1147 (T60674)', diff saved to https://phabricator.wikimedia.org/P29172 and previous config saved to /var/cache/conftool/dbconfig/20220531-015850-ladsgroup.json [production]
01:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1147', diff saved to https://phabricator.wikimedia.org/P29171 and previous config saved to /var/cache/conftool/dbconfig/20220531-014345-ladsgroup.json [production]
01:28 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1147', diff saved to https://phabricator.wikimedia.org/P29170 and previous config saved to /var/cache/conftool/dbconfig/20220531-012840-ladsgroup.json [production]
01:13 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1147 (T60674)', diff saved to https://phabricator.wikimedia.org/P29169 and previous config saved to /var/cache/conftool/dbconfig/20220531-011335-ladsgroup.json [production]
00:39 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]
00:39 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]
00:39 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1118 (T60674)', diff saved to https://phabricator.wikimedia.org/P29168 and previous config saved to /var/cache/conftool/dbconfig/20220531-003947-ladsgroup.json [production]
00:24 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1118', diff saved to https://phabricator.wikimedia.org/P29167 and previous config saved to /var/cache/conftool/dbconfig/20220531-002442-ladsgroup.json [production]
00:09 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1118', diff saved to https://phabricator.wikimedia.org/P29166 and previous config saved to /var/cache/conftool/dbconfig/20220531-000937-ladsgroup.json [production]
00:04 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1168 (T60674)', diff saved to https://phabricator.wikimedia.org/P29165 and previous config saved to /var/cache/conftool/dbconfig/20220531-000452-ladsgroup.json [production]
2022-05-30 §
23:54 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1118 (T60674)', diff saved to https://phabricator.wikimedia.org/P29164 and previous config saved to /var/cache/conftool/dbconfig/20220530-235432-ladsgroup.json [production]
23:49 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P29163 and previous config saved to /var/cache/conftool/dbconfig/20220530-234947-ladsgroup.json [production]
23:49 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1147 (T60674)', diff saved to https://phabricator.wikimedia.org/P29162 and previous config saved to /var/cache/conftool/dbconfig/20220530-234929-ladsgroup.json [production]
23:49 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1147.eqiad.wmnet with reason: Maintenance [production]
23:49 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1147.eqiad.wmnet with reason: Maintenance [production]