2022-05-31
ยง
|
14:20 |
<jbond@deploy1002> |
Started deploy [netbox/deploy@7bbf659]: deploying v2.10.4-wmf6 to new hosts |
[production] |
14:19 |
<jbond@deploy1002> |
Finished deploy [netbox/deploy@7bbf659]: deploying v2.10.4-wmf6 to new hosts (duration: 02m 15s) |
[production] |
14:19 |
<tgr> |
doing an emergency revert for T309616 |
[production] |
14:18 |
<bking@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudelastic1006.wikimedia.org with OS bullseye |
[production] |
14:17 |
<jbond@deploy1002> |
Started deploy [netbox/deploy@7bbf659]: deploying v2.10.4-wmf6 to new hosts |
[production] |
14:07 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1109', diff saved to https://phabricator.wikimedia.org/P29202 and previous config saved to /var/cache/conftool/dbconfig/20220531-140702-ladsgroup.json |
[production] |
14:06 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P29201 and previous config saved to /var/cache/conftool/dbconfig/20220531-140611-ladsgroup.json |
[production] |
14:05 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1166', diff saved to https://phabricator.wikimedia.org/P29200 and previous config saved to /var/cache/conftool/dbconfig/20220531-140528-ladsgroup.json |
[production] |
14:03 |
<jbond@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=netbox,name=eqiad |
[production] |
14:03 |
<bking@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1004.wikimedia.org with OS bullseye |
[production] |
13:51 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1109 (T60674)', diff saved to https://phabricator.wikimedia.org/P29199 and previous config saved to /var/cache/conftool/dbconfig/20220531-135157-ladsgroup.json |
[production] |
13:51 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1127 (T309311)', diff saved to https://phabricator.wikimedia.org/P29198 and previous config saved to /var/cache/conftool/dbconfig/20220531-135105-ladsgroup.json |
[production] |
13:50 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1166 (T307525)', diff saved to https://phabricator.wikimedia.org/P29197 and previous config saved to /var/cache/conftool/dbconfig/20220531-135022-ladsgroup.json |
[production] |
13:38 |
<elukey> |
move ml-etcd100[1-3] from drdb to plain to investigate high k8s latencies for the control plane |
[production] |
13:35 |
<bking@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudelastic1004.wikimedia.org with OS bullseye |
[production] |
13:34 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1166 (T307525)', diff saved to https://phabricator.wikimedia.org/P29195 and previous config saved to /var/cache/conftool/dbconfig/20220531-133356-ladsgroup.json |
[production] |
13:33 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1166.eqiad.wmnet with reason: Maintenance |
[production] |
13:33 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1166.eqiad.wmnet with reason: Maintenance |
[production] |
13:25 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1109 (T60674)', diff saved to https://phabricator.wikimedia.org/P29194 and previous config saved to /var/cache/conftool/dbconfig/20220531-132530-ladsgroup.json |
[production] |
13:25 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1109.eqiad.wmnet with reason: Maintenance |
[production] |
13:25 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1109.eqiad.wmnet with reason: Maintenance |
[production] |
13:15 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:15 |
<taavi> |
taavi@mwmaint1002 ~ $ mwscript namespaceDupes.php --wiki zhwiktionary --fix # T309564 |
[production] |
13:14 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
13:14 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:14 |
<taavi@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:801420|zhwiktionary: Create namespace "Thesaurus" and "Citations" (T309564)]] (duration: 02m 56s) |
[production] |
13:13 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
12:59 |
<Amir1> |
killed kowiki's refreshLinkRecommendations.php (T299021) |
[production] |
12:49 |
<jbond@deploy1002> |
Finished deploy [netbox/deploy@7bbf659]: deploying v2.10.4-wmf6 to new hosts (duration: 01m 04s) |
[production] |
12:48 |
<jbond@deploy1002> |
Started deploy [netbox/deploy@7bbf659]: deploying v2.10.4-wmf6 to new hosts |
[production] |
12:48 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
12:48 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1127 (T309311)', diff saved to https://phabricator.wikimedia.org/P29193 and previous config saved to /var/cache/conftool/dbconfig/20220531-124807-ladsgroup.json |
[production] |
12:48 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1127.eqiad.wmnet with reason: Maintenance |
[production] |
12:48 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1127.eqiad.wmnet with reason: Maintenance |
[production] |
12:45 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
12:45 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
12:44 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
12:42 |
<ladsgroup@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:801661|Migrate zhwiki to read new for templatelinks (T306673)]] (duration: 03m 10s) |
[production] |
12:41 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:41 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:39 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:39 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:36 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:36 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:35 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:35 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:33 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:33 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:32 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host idp2002.wikimedia.org |
[production] |
12:13 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) idp2002.wikimedia.org on all recursors |
[production] |