2023-11-15
§
|
09:09 |
<jmm@cumin2002> |
START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev |
[production] |
08:37 |
<moritzm> |
rolling restart of Cassandra in cassandra-dev following migration to Puppet 7 |
[production] |
08:27 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: cassandra_dev |
[production] |
08:02 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: cassandra_dev |
[production] |
08:01 |
<marostegui@deploy2002> |
Finished scap: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] (duration: 06m 54s) |
[production] |
08:00 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'depool db1127', diff saved to https://phabricator.wikimedia.org/P53483 and previous config saved to /var/cache/conftool/dbconfig/20231115-080033-arnaudb.json |
[production] |
07:55 |
<marostegui@deploy2002> |
marostegui: Continuing with sync |
[production] |
07:55 |
<marostegui@deploy2002> |
marostegui: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:54 |
<marostegui@deploy2002> |
Started scap: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] |
[production] |
07:51 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc2013.codfw.wmnet with OS bookworm |
[production] |
07:47 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: pybaltest |
[production] |
07:37 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc2013.codfw.wmnet with reason: host reimage |
[production] |
07:35 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: pybaltest |
[production] |
07:34 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.puppet.migrate-role (exit_code=99) for role: mariadb::misc::analytics::backup |
[production] |
07:34 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on pc2013.codfw.wmnet with reason: host reimage |
[production] |
07:17 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.reimage for host pc2013.codfw.wmnet with OS bookworm |
[production] |
07:16 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc[2013-2014].codfw.wmnet,pc[1013-1014].eqiad.wmnet with reason: Reimage |
[production] |
07:16 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on pc[2013-2014].codfw.wmnet,pc[1013-1014].eqiad.wmnet with reason: Reimage |
[production] |
07:15 |
<marostegui@deploy2002> |
Finished scap: Backport for [[gerrit:974230|Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master""]] (duration: 06m 53s) |
[production] |
07:10 |
<marostegui@deploy2002> |
marostegui: Continuing with sync |
[production] |
07:10 |
<marostegui@deploy2002> |
marostegui: Backport for [[gerrit:974230|Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master""]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:10 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 40934 |
[production] |
07:08 |
<marostegui@deploy2002> |
Started scap: Backport for [[gerrit:974230|Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master""]] |
[production] |
07:07 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.peering with action 'configure' for AS: 40934 |
[production] |
07:06 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 983 |
[production] |
07:05 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.peering with action 'configure' for AS: 983 |
[production] |
01:22 |
<eevans@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
01:11 |
<eevans@cumin1001> |
START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
00:22 |
<eevans@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
00:05 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T348183)', diff saved to https://phabricator.wikimedia.org/P53482 and previous config saved to /var/cache/conftool/dbconfig/20231115-000545-arnaudb.json |
[production] |
2023-11-14
§
|
23:50 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P53481 and previous config saved to /var/cache/conftool/dbconfig/20231114-235039-arnaudb.json |
[production] |
23:37 |
<eevans@cumin1001> |
START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
23:35 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P53480 and previous config saved to /var/cache/conftool/dbconfig/20231114-233532-arnaudb.json |
[production] |
23:26 |
<eevans@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
23:20 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T348183)', diff saved to https://phabricator.wikimedia.org/P53479 and previous config saved to /var/cache/conftool/dbconfig/20231114-232026-arnaudb.json |
[production] |
22:52 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db2182 (T348183)', diff saved to https://phabricator.wikimedia.org/P53478 and previous config saved to /var/cache/conftool/dbconfig/20231114-225258-arnaudb.json |
[production] |
22:52 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2182.codfw.wmnet with reason: Maintenance |
[production] |
22:52 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2182.codfw.wmnet with reason: Maintenance |
[production] |
22:52 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2169:3317 (T348183)', diff saved to https://phabricator.wikimedia.org/P53477 and previous config saved to /var/cache/conftool/dbconfig/20231114-225236-arnaudb.json |
[production] |
22:37 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2169:3317', diff saved to https://phabricator.wikimedia.org/P53476 and previous config saved to /var/cache/conftool/dbconfig/20231114-223730-arnaudb.json |
[production] |
22:33 |
<eevans@cumin1001> |
START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
22:22 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2169:3317', diff saved to https://phabricator.wikimedia.org/P53474 and previous config saved to /var/cache/conftool/dbconfig/20231114-222224-arnaudb.json |
[production] |
22:19 |
<eevans@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
22:07 |
<eevans@cumin1001> |
START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
22:07 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2169:3317 (T348183)', diff saved to https://phabricator.wikimedia.org/P53473 and previous config saved to /var/cache/conftool/dbconfig/20231114-220717-arnaudb.json |
[production] |
22:05 |
<andrew@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirt1046.eqiad.wmnet with OS bookworm |
[production] |
22:02 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db2169:3317 (T348183)', diff saved to https://phabricator.wikimedia.org/P53472 and previous config saved to /var/cache/conftool/dbconfig/20231114-220241-arnaudb.json |
[production] |
22:02 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2169.codfw.wmnet with reason: Maintenance |
[production] |
22:02 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2169.codfw.wmnet with reason: Maintenance |
[production] |
22:02 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2168:3317 (T348183)', diff saved to https://phabricator.wikimedia.org/P53471 and previous config saved to /var/cache/conftool/dbconfig/20231114-220220-arnaudb.json |
[production] |