2022-10-12
ยง
|
14:54 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2178.codfw.wmnet with reason: Maintenance |
[production] |
14:54 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2178.codfw.wmnet with reason: Maintenance |
[production] |
14:54 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2171:3315 (T318955)', diff saved to https://phabricator.wikimedia.org/P35440 and previous config saved to /var/cache/conftool/dbconfig/20221012-145423-ladsgroup.json |
[production] |
14:39 |
<oblivian@deploy1002> |
helmfile [staging] DONE helmfile.d/services/eventstreams-internal: apply |
[production] |
14:39 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2171:3315', diff saved to https://phabricator.wikimedia.org/P35439 and previous config saved to /var/cache/conftool/dbconfig/20221012-143917-ladsgroup.json |
[production] |
14:39 |
<oblivian@deploy1002> |
helmfile [staging] START helmfile.d/services/eventstreams-internal: apply |
[production] |
14:35 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:841874|Revert "rdbms: Instead of reconfiguring all of LB, just remove depooled db"]] (duration: 04m 37s) |
[production] |
14:31 |
<ladsgroup@deploy1002> |
ladsgroup and ladsgroup: Backport for [[gerrit:841874|Revert "rdbms: Instead of reconfiguring all of LB, just remove depooled db"]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
14:30 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:841874|Revert "rdbms: Instead of reconfiguring all of LB, just remove depooled db"]] |
[production] |
14:24 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2171:3315', diff saved to https://phabricator.wikimedia.org/P35438 and previous config saved to /var/cache/conftool/dbconfig/20221012-142410-ladsgroup.json |
[production] |
14:19 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp4045.ulsfo.wmnet with OS buster |
[production] |
14:18 |
<sukhe@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4045.ulsfo.wmnet with OS buster |
[production] |
14:09 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2171:3315 (T318955)', diff saved to https://phabricator.wikimedia.org/P35436 and previous config saved to /var/cache/conftool/dbconfig/20221012-140903-ladsgroup.json |
[production] |
14:08 |
<ladsgroup@deploy1002> |
Sync cancelled. |
[production] |
14:07 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repool db1175', diff saved to https://phabricator.wikimedia.org/P35435 and previous config saved to /var/cache/conftool/dbconfig/20221012-140746-ladsgroup.json |
[production] |
14:06 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depool db1175', diff saved to https://phabricator.wikimedia.org/P35434 and previous config saved to /var/cache/conftool/dbconfig/20221012-140626-ladsgroup.json |
[production] |
14:04 |
<volans@cumin2002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host lvs4008.mgmt.ulsfo.wmnet with reboot policy FORCED |
[production] |
13:53 |
<ladsgroup@deploy1002> |
ladsgroup and ladsgroup: Backport for [[gerrit:841873|rdbms: Instead of reconfiguring all of LB, just remove depooled db (T298485)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
13:53 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:841873|rdbms: Instead of reconfiguring all of LB, just remove depooled db (T298485)]] |
[production] |
13:49 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp4045.ulsfo.wmnet with OS buster |
[production] |
13:47 |
<volans@cumin2002> |
START - Cookbook sre.hosts.provision for host lvs4008.mgmt.ulsfo.wmnet with reboot policy FORCED |
[production] |
13:43 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2171:3315 (T318955)', diff saved to https://phabricator.wikimedia.org/P35433 and previous config saved to /var/cache/conftool/dbconfig/20221012-134306-ladsgroup.json |
[production] |
13:43 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2171.codfw.wmnet with reason: Maintenance |
[production] |
13:42 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2171.codfw.wmnet with reason: Maintenance |
[production] |
13:42 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2157 (T318955)', diff saved to https://phabricator.wikimedia.org/P35432 and previous config saved to /var/cache/conftool/dbconfig/20221012-134245-ladsgroup.json |
[production] |
13:27 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P35431 and previous config saved to /var/cache/conftool/dbconfig/20221012-132738-ladsgroup.json |
[production] |
13:26 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:841895|Remove Research Incentive survey from eswiki (T318331)]] (duration: 05m 21s) |
[production] |
13:24 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts d-i-test.eqiad.wmnet |
[production] |
13:24 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
13:22 |
<urbanecm@deploy1002> |
urbanecm and dani: Backport for [[gerrit:841895|Remove Research Incentive survey from eswiki (T318331)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet |
[production] |
13:21 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:841895|Remove Research Incentive survey from eswiki (T318331)]] |
[production] |
13:21 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:829563|Move wmgSiteLogoWordmark and wmgSiteLogoTagline to logos.php (T307705)]] (duration: 07m 06s) |
[production] |
13:18 |
<jmm@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
13:14 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.decommission for hosts d-i-test.eqiad.wmnet |
[production] |
13:14 |
<urbanecm@deploy1002> |
urbanecm and stang: Backport for [[gerrit:829563|Move wmgSiteLogoWordmark and wmgSiteLogoTagline to logos.php (T307705)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
13:14 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:829563|Move wmgSiteLogoWordmark and wmgSiteLogoTagline to logos.php (T307705)]] |
[production] |
13:13 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:841854|Enable show nearby feature on a small group of wikis (T316782)]] (duration: 07m 03s) |
[production] |
13:12 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P35430 and previous config saved to /var/cache/conftool/dbconfig/20221012-131232-ladsgroup.json |
[production] |
13:09 |
<moritzm> |
draining ganeti1007 T320419 |
[production] |
13:06 |
<urbanecm@deploy1002> |
urbanecm and wmde-fisch: Backport for [[gerrit:841854|Enable show nearby feature on a small group of wikis (T316782)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet |
[production] |
13:06 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:841854|Enable show nearby feature on a small group of wikis (T316782)]] |
[production] |
13:05 |
<urbanecm@deploy1002> |
backport aborted: (duration: 00m 09s) |
[production] |
13:04 |
<urbanecm@deploy1002> |
Backport cancelled. |
[production] |
12:57 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2157 (T318955)', diff saved to https://phabricator.wikimedia.org/P35429 and previous config saved to /var/cache/conftool/dbconfig/20221012-125725-ladsgroup.json |
[production] |
12:32 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2157 (T318955)', diff saved to https://phabricator.wikimedia.org/P35428 and previous config saved to /var/cache/conftool/dbconfig/20221012-123223-ladsgroup.json |
[production] |
12:32 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2157.codfw.wmnet with reason: Maintenance |
[production] |
12:32 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2157.codfw.wmnet with reason: Maintenance |
[production] |
12:32 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2137:3315 (T318955)', diff saved to https://phabricator.wikimedia.org/P35427 and previous config saved to /var/cache/conftool/dbconfig/20221012-123201-ladsgroup.json |
[production] |
12:28 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1002.eqiad.wmnet with reason: host reimage |
[production] |
12:25 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1002.eqiad.wmnet with reason: host reimage |
[production] |