2023-02-22
§
|
10:19 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2020.codfw.wmnet with reason: host reimage |
[production] |
10:18 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2019.codfw.wmnet with reason: host reimage |
[production] |
10:18 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2018.codfw.wmnet with reason: host reimage |
[production] |
10:13 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.switchdc.mediawiki.00-reduce-ttl (exit_code=0) |
[production] |
10:08 |
<claime> |
Starting sre.switchdc.mediawiki live test preparation steps |
[production] |
10:07 |
<cgoubert@cumin1001> |
START - Cookbook sre.switchdc.mediawiki.00-reduce-ttl |
[production] |
10:05 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host kubernetes2021.codfw.wmnet with OS bullseye |
[production] |
10:04 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host kubernetes2020.codfw.wmnet with OS bullseye |
[production] |
10:04 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2017.codfw.wmnet with reason: host reimage |
[production] |
10:04 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host kubernetes2019.codfw.wmnet with OS bullseye |
[production] |
10:04 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host kubernetes2018.codfw.wmnet with OS bullseye |
[production] |
10:01 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2017.codfw.wmnet with reason: host reimage |
[production] |
09:59 |
<hashar@deploy1002> |
Synchronized php: group1 wikis to 1.40.0-wmf.24 refs T325587 (duration: 06m 33s) |
[production] |
09:52 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.40.0-wmf.24 refs T325587 |
[production] |
09:51 |
<nfraison@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-presto1005.eqiad.wmnet with reason: host reimage |
[production] |
09:48 |
<nfraison@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-presto1005.eqiad.wmnet with reason: host reimage |
[production] |
09:46 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host kubernetes2017.codfw.wmnet with OS bullseye |
[production] |
09:30 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: Revert "group1 wikis to 1.40.0-wmf.23" - T325587 |
[production] |
09:14 |
<hashar@deploy1002> |
Synchronized php: group1 wikis to 1.40.0-wmf.24 refs T325587 (duration: 06m 38s) |
[production] |
09:07 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.40.0-wmf.24 refs T325587 |
[production] |
09:03 |
<nfraison@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1004.eqiad.wmnet with OS bullseye |
[production] |
08:50 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.cf (exit_code=0) |
[production] |
08:50 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.cf |
[production] |
08:49 |
<vgutierrez> |
rolling upgrade to HAProxy 2.6.9 in codfw, eqsin, drmrs, esams and eqiad |
[production] |
08:47 |
<nfraison@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-presto1004.eqiad.wmnet with reason: host reimage |
[production] |
08:43 |
<nfraison@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-presto1004.eqiad.wmnet with reason: host reimage |
[production] |
08:36 |
<ryankemper> |
[WDQS] Repooled `wdqs20[05,07,10]` |
[production] |
08:22 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:890947|Content Translation: Set MT threshold to 45% for Kurdish WP (T324941)]] (duration: 10m 41s) |
[production] |
08:17 |
<nfraison@cumin1001> |
START - Cookbook sre.hosts.reimage for host an-presto1004.eqiad.wmnet with OS bullseye |
[production] |
08:14 |
<kartik@deploy1002> |
kartik: Backport for [[gerrit:890947|Content Translation: Set MT threshold to 45% for Kurdish WP (T324941)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
08:12 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:890947|Content Translation: Set MT threshold to 45% for Kurdish WP (T324941)]] |
[production] |
08:00 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Depool db1128', diff saved to https://phabricator.wikimedia.org/P44724 and previous config saved to /var/cache/conftool/dbconfig/20230222-080050-jynus.json |
[production] |
01:53 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2186.codfw.wmnet with OS bullseye |
[production] |
01:52 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" |
[production] |
01:44 |
<pt1979@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" |
[production] |
01:35 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2186.codfw.wmnet with reason: host reimage |
[production] |
01:33 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2186.codfw.wmnet with reason: host reimage |
[production] |
01:31 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host db2186.codfw.wmnet with OS bullseye |
[production] |
2023-02-21
§
|
23:55 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
23:50 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2169:3317 (T328255)', diff saved to https://phabricator.wikimedia.org/P44723 and previous config saved to /var/cache/conftool/dbconfig/20230221-235012-ladsgroup.json |
[production] |
23:35 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2169:3317', diff saved to https://phabricator.wikimedia.org/P44722 and previous config saved to /var/cache/conftool/dbconfig/20230221-233506-ladsgroup.json |
[production] |
23:20 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2169:3317', diff saved to https://phabricator.wikimedia.org/P44721 and previous config saved to /var/cache/conftool/dbconfig/20230221-232000-ladsgroup.json |
[production] |
23:09 |
<tzatziki> |
removing 5 files for legal compliance |
[production] |
23:04 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2169:3317 (T328255)', diff saved to https://phabricator.wikimedia.org/P44720 and previous config saved to /var/cache/conftool/dbconfig/20230221-230454-ladsgroup.json |
[production] |
23:01 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2169:3317 (T328255)', diff saved to https://phabricator.wikimedia.org/P44719 and previous config saved to /var/cache/conftool/dbconfig/20230221-230109-ladsgroup.json |
[production] |
23:01 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2169.codfw.wmnet with reason: Maintenance |
[production] |
23:00 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db2169.codfw.wmnet with reason: Maintenance |
[production] |
23:00 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2168:3317 (T328255)', diff saved to https://phabricator.wikimedia.org/P44718 and previous config saved to /var/cache/conftool/dbconfig/20230221-230048-ladsgroup.json |
[production] |
22:45 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2168:3317', diff saved to https://phabricator.wikimedia.org/P44717 and previous config saved to /var/cache/conftool/dbconfig/20230221-224542-ladsgroup.json |
[production] |
22:44 |
<eevans@cumin1001> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore1003.eqiad.wmnet: Restarting Cassandra to apply JVM 1.8.0_362 - eevans@cumin1001 |
[production] |