801-850 of 10000 results (74ms)
2023-11-15 §
08:37 <moritzm> rolling restart of Cassandra in cassandra-dev following migration to Puppet 7 [production]
08:27 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: cassandra_dev [production]
08:02 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: cassandra_dev [production]
08:01 <marostegui@deploy2002> Finished scap: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] (duration: 06m 54s) [production]
08:00 <arnaudb@cumin1001> dbctl commit (dc=all): 'depool db1127', diff saved to https://phabricator.wikimedia.org/P53483 and previous config saved to /var/cache/conftool/dbconfig/20231115-080033-arnaudb.json [production]
07:55 <marostegui@deploy2002> marostegui: Continuing with sync [production]
07:55 <marostegui@deploy2002> marostegui: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:54 <marostegui@deploy2002> Started scap: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] [production]
07:51 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc2013.codfw.wmnet with OS bookworm [production]
07:47 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: pybaltest [production]
07:37 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc2013.codfw.wmnet with reason: host reimage [production]
07:35 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: pybaltest [production]
07:34 <jmm@cumin2002> END (FAIL) - Cookbook sre.puppet.migrate-role (exit_code=99) for role: mariadb::misc::analytics::backup [production]
07:34 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on pc2013.codfw.wmnet with reason: host reimage [production]
07:17 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host pc2013.codfw.wmnet with OS bookworm [production]
07:16 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc[2013-2014].codfw.wmnet,pc[1013-1014].eqiad.wmnet with reason: Reimage [production]
07:16 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on pc[2013-2014].codfw.wmnet,pc[1013-1014].eqiad.wmnet with reason: Reimage [production]
07:15 <marostegui@deploy2002> Finished scap: Backport for [[gerrit:974230|Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master""]] (duration: 06m 53s) [production]
07:10 <marostegui@deploy2002> marostegui: Continuing with sync [production]
07:10 <marostegui@deploy2002> marostegui: Backport for [[gerrit:974230|Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master""]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:10 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 40934 [production]
07:08 <marostegui@deploy2002> Started scap: Backport for [[gerrit:974230|Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master""]] [production]
07:07 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 40934 [production]
07:06 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 983 [production]
07:05 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 983 [production]
01:22 <eevans@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host aqs1012.eqiad.wmnet with OS bullseye [production]
01:11 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
00:22 <eevans@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host aqs1012.eqiad.wmnet with OS bullseye [production]
00:05 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2182 (T348183)', diff saved to https://phabricator.wikimedia.org/P53482 and previous config saved to /var/cache/conftool/dbconfig/20231115-000545-arnaudb.json [production]
2023-11-14 §
23:50 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P53481 and previous config saved to /var/cache/conftool/dbconfig/20231114-235039-arnaudb.json [production]
23:37 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
23:35 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P53480 and previous config saved to /var/cache/conftool/dbconfig/20231114-233532-arnaudb.json [production]
23:26 <eevans@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host aqs1012.eqiad.wmnet with OS bullseye [production]
23:20 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2182 (T348183)', diff saved to https://phabricator.wikimedia.org/P53479 and previous config saved to /var/cache/conftool/dbconfig/20231114-232026-arnaudb.json [production]
22:52 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2182 (T348183)', diff saved to https://phabricator.wikimedia.org/P53478 and previous config saved to /var/cache/conftool/dbconfig/20231114-225258-arnaudb.json [production]
22:52 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2182.codfw.wmnet with reason: Maintenance [production]
22:52 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2182.codfw.wmnet with reason: Maintenance [production]
22:52 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2169:3317 (T348183)', diff saved to https://phabricator.wikimedia.org/P53477 and previous config saved to /var/cache/conftool/dbconfig/20231114-225236-arnaudb.json [production]
22:37 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2169:3317', diff saved to https://phabricator.wikimedia.org/P53476 and previous config saved to /var/cache/conftool/dbconfig/20231114-223730-arnaudb.json [production]
22:33 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
22:22 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2169:3317', diff saved to https://phabricator.wikimedia.org/P53474 and previous config saved to /var/cache/conftool/dbconfig/20231114-222224-arnaudb.json [production]
22:19 <eevans@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1012.eqiad.wmnet with OS bullseye [production]
22:07 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
22:07 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2169:3317 (T348183)', diff saved to https://phabricator.wikimedia.org/P53473 and previous config saved to /var/cache/conftool/dbconfig/20231114-220717-arnaudb.json [production]
22:05 <andrew@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirt1046.eqiad.wmnet with OS bookworm [production]
22:02 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2169:3317 (T348183)', diff saved to https://phabricator.wikimedia.org/P53472 and previous config saved to /var/cache/conftool/dbconfig/20231114-220241-arnaudb.json [production]
22:02 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2169.codfw.wmnet with reason: Maintenance [production]
22:02 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2169.codfw.wmnet with reason: Maintenance [production]
22:02 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2168:3317 (T348183)', diff saved to https://phabricator.wikimedia.org/P53471 and previous config saved to /var/cache/conftool/dbconfig/20231114-220220-arnaudb.json [production]
22:00 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1043.eqiad.wmnet with OS bookworm [production]