2023-01-05
ยง
|
15:08 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1113.eqiad.wmnet with reason: Maintenance |
[production] |
15:08 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1113.eqiad.wmnet with reason: Maintenance |
[production] |
15:08 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1110 (T326156)', diff saved to https://phabricator.wikimedia.org/P42866 and previous config saved to /var/cache/conftool/dbconfig/20230105-150804-ladsgroup.json |
[production] |
14:58 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.reboot-cluster |
[production] |
14:58 |
<cgoubert@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97) |
[production] |
14:56 |
<claime> |
hard resetting mw1486 |
[production] |
14:52 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1110', diff saved to https://phabricator.wikimedia.org/P42865 and previous config saved to /var/cache/conftool/dbconfig/20230105-145257-ladsgroup.json |
[production] |
14:37 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1110', diff saved to https://phabricator.wikimedia.org/P42864 and previous config saved to /var/cache/conftool/dbconfig/20230105-143751-ladsgroup.json |
[production] |
14:30 |
<mlitn@deploy1002> |
Finished scap: Backport for [[gerrit:875908|Also get central description (T325831)]] (duration: 08m 32s) |
[production] |
14:23 |
<mlitn@deploy1002> |
mlitn and mlitn: Backport for [[gerrit:875908|Also get central description (T325831)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
14:22 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1110 (T326156)', diff saved to https://phabricator.wikimedia.org/P42862 and previous config saved to /var/cache/conftool/dbconfig/20230105-142244-ladsgroup.json |
[production] |
14:21 |
<mlitn@deploy1002> |
Started scap: Backport for [[gerrit:875908|Also get central description (T325831)]] |
[production] |
14:20 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1110 (T326156)', diff saved to https://phabricator.wikimedia.org/P42861 and previous config saved to /var/cache/conftool/dbconfig/20230105-142029-ladsgroup.json |
[production] |
14:20 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1110.eqiad.wmnet with reason: Maintenance |
[production] |
14:20 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1110.eqiad.wmnet with reason: Maintenance |
[production] |
14:20 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1100 (T326156)', diff saved to https://phabricator.wikimedia.org/P42860 and previous config saved to /var/cache/conftool/dbconfig/20230105-142008-ladsgroup.json |
[production] |
14:17 |
<mlitn@deploy1002> |
Finished scap: Backport for [[gerrit:875906|Also get central description (T325831)]] (duration: 07m 57s) |
[production] |
14:11 |
<mlitn@deploy1002> |
mlitn and mlitn: Backport for [[gerrit:875906|Also get central description (T325831)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
14:09 |
<mlitn@deploy1002> |
Started scap: Backport for [[gerrit:875906|Also get central description (T325831)]] |
[production] |
14:05 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1100', diff saved to https://phabricator.wikimedia.org/P42859 and previous config saved to /var/cache/conftool/dbconfig/20230105-140501-ladsgroup.json |
[production] |
13:58 |
<Amir1> |
start of externallinks migration in elwiki (and rest of large wikis in s3) (T326314) |
[production] |
13:49 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1100', diff saved to https://phabricator.wikimedia.org/P42858 and previous config saved to /var/cache/conftool/dbconfig/20230105-134955-ladsgroup.json |
[production] |
13:46 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:875892|Enable write both for externallinks in ten largest s3 wikis (T321662)]] (duration: 08m 54s) |
[production] |
13:42 |
<urbanecm> |
aswikiquote: Run importDump.php to import a XML dump (per new wiki importers request, running into issues with a largish page) |
[production] |
13:39 |
<ladsgroup@deploy1002> |
ladsgroup and ladsgroup: Backport for [[gerrit:875892|Enable write both for externallinks in ten largest s3 wikis (T321662)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
13:38 |
<XioNoX> |
start [eqiad] faulty VC optics maintenance - T325803 |
[production] |
13:37 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:875892|Enable write both for externallinks in ten largest s3 wikis (T321662)]] |
[production] |
13:34 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1100 (T326156)', diff saved to https://phabricator.wikimedia.org/P42857 and previous config saved to /var/cache/conftool/dbconfig/20230105-133448-ladsgroup.json |
[production] |
13:32 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1100 (T326156)', diff saved to https://phabricator.wikimedia.org/P42856 and previous config saved to /var/cache/conftool/dbconfig/20230105-133234-ladsgroup.json |
[production] |
13:32 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
13:32 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
13:32 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1096:3315 (T326156)', diff saved to https://phabricator.wikimedia.org/P42855 and previous config saved to /var/cache/conftool/dbconfig/20230105-133211-ladsgroup.json |
[production] |
13:30 |
<oblivian@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply |
[production] |
13:29 |
<oblivian@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-debug: apply |
[production] |
13:21 |
<effie> |
enable puppet on all mw servers |
[production] |
13:17 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1096:3315', diff saved to https://phabricator.wikimedia.org/P42854 and previous config saved to /var/cache/conftool/dbconfig/20230105-131705-ladsgroup.json |
[production] |
13:03 |
<oblivian@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply |
[production] |
13:03 |
<oblivian@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply |
[production] |
13:03 |
<oblivian@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply |
[production] |
13:03 |
<oblivian@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-api-ext: apply |
[production] |
13:03 |
<oblivian@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply |
[production] |
13:02 |
<oblivian@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-api-int: apply |
[production] |
13:02 |
<oblivian@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply |
[production] |
13:02 |
<oblivian@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-api-int: apply |
[production] |
13:02 |
<oblivian@deploy1002> |
helmfile [eqiad] [main] DONE helmfile.d/services/mw-jobrunner : sync |
[production] |
13:02 |
<oblivian@deploy1002> |
helmfile [eqiad] [canary] DONE helmfile.d/services/mw-jobrunner : sync |
[production] |
13:02 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1096:3315', diff saved to https://phabricator.wikimedia.org/P42853 and previous config saved to /var/cache/conftool/dbconfig/20230105-130158-ladsgroup.json |
[production] |
13:01 |
<oblivian@deploy1002> |
helmfile [eqiad] [main] START helmfile.d/services/mw-jobrunner : sync |
[production] |
13:01 |
<oblivian@deploy1002> |
helmfile [eqiad] [canary] START helmfile.d/services/mw-jobrunner : sync |
[production] |
13:01 |
<oblivian@deploy1002> |
helmfile [codfw] [main] DONE helmfile.d/services/mw-jobrunner : sync |
[production] |