|
2023-11-15
ยง
|
| 12:33 |
<cmooney@cumin1001> |
START - Cookbook sre.hosts.reimage for host sretest2003.codfw.wmnet with OS bullseye |
[production] |
| 11:57 |
<stevemunene@deploy2002> |
Finished deploy [airflow-dags/wmde@91810bc]: (no justification provided) (duration: 00m 10s) |
[production] |
| 11:56 |
<stevemunene@deploy2002> |
Started deploy [airflow-dags/wmde@91810bc]: (no justification provided) |
[production] |
| 11:52 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: insetup::unowned |
[production] |
| 11:48 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: insetup::unowned |
[production] |
| 11:25 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host thanos-fe2001.codfw.wmnet |
[production] |
| 11:24 |
<taavi> |
update cr*-{codfw,eqiad} firewall policy via homer to update cloudcontrol1006 addressing |
[production] |
| 11:24 |
<btullis@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main |
[production] |
| 11:21 |
<btullis@deploy2002> |
helmfile [eqiad] START helmfile.d/services/datahub: apply on main |
[production] |
| 11:20 |
<btullis@cumin1001> |
END (ERROR) - Cookbook sre.druid.roll-restart-workers (exit_code=97) for Druid analytics cluster: Roll restart of Druid jvm daemons. |
[production] |
| 11:18 |
<btullis@cumin1001> |
START - Cookbook sre.druid.roll-restart-workers for Druid analytics cluster: Roll restart of Druid jvm daemons. |
[production] |
| 11:17 |
<btullis@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/datahub: sync on main |
[production] |
| 11:15 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-host for host thanos-fe2001.codfw.wmnet |
[production] |
| 11:14 |
<btullis@deploy2002> |
helmfile [codfw] START helmfile.d/services/datahub: apply on main |
[production] |
| 10:46 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: miscweb |
[production] |
| 10:44 |
<tchanders@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/ipoid: apply |
[production] |
| 10:42 |
<tchanders@deploy2002> |
helmfile [eqiad] START helmfile.d/services/ipoid: apply |
[production] |
| 10:41 |
<tchanders@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/ipoid: apply |
[production] |
| 10:40 |
<tchanders@deploy2002> |
helmfile [eqiad] START helmfile.d/services/ipoid: apply |
[production] |
| 10:39 |
<oblivian@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/mobileapps: sync |
[production] |
| 10:39 |
<oblivian@deploy2002> |
helmfile [codfw] START helmfile.d/services/mobileapps: sync |
[production] |
| 10:39 |
<oblivian@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mobileapps: sync |
[production] |
| 10:39 |
<oblivian@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mobileapps: sync |
[production] |
| 10:39 |
<_joe_> |
roll restart of mobileapps in codfw and eqiad |
[production] |
| 10:34 |
<oblivian@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply |
[production] |
| 10:31 |
<oblivian@deploy2002> |
helmfile [codfw] START helmfile.d/services/mw-api-int: apply |
[production] |
| 10:31 |
<oblivian@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply |
[production] |
| 10:30 |
<oblivian@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-api-int: apply |
[production] |
| 10:22 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: miscweb |
[production] |
| 09:42 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev |
[production] |
| 09:37 |
<moritzm> |
imported php-igbinary 3.2.1+2.0.8-2+wmf1+bullseye1 to component/php74 for bullseye-wikimedia |
[production] |
| 09:26 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: insetup_noferm |
[production] |
| 09:19 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: insetup_noferm |
[production] |
| 09:09 |
<jmm@cumin2002> |
START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev |
[production] |
| 08:37 |
<moritzm> |
rolling restart of Cassandra in cassandra-dev following migration to Puppet 7 |
[production] |
| 08:27 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: cassandra_dev |
[production] |
| 08:02 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: cassandra_dev |
[production] |
| 08:01 |
<marostegui@deploy2002> |
Finished scap: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] (duration: 06m 54s) |
[production] |
| 08:00 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'depool db1127', diff saved to https://phabricator.wikimedia.org/P53483 and previous config saved to /var/cache/conftool/dbconfig/20231115-080033-arnaudb.json |
[production] |
| 07:55 |
<marostegui@deploy2002> |
marostegui: Continuing with sync |
[production] |
| 07:55 |
<marostegui@deploy2002> |
marostegui: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
| 07:54 |
<marostegui@deploy2002> |
Started scap: Backport for [[gerrit:974232|Revert "Revert "Revert "ProductionServices.php: Promote pc2014 to pc3 master"""]] |
[production] |
| 07:51 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc2013.codfw.wmnet with OS bookworm |
[production] |
| 07:47 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: pybaltest |
[production] |
| 07:37 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc2013.codfw.wmnet with reason: host reimage |
[production] |
| 07:35 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: pybaltest |
[production] |
| 07:34 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.puppet.migrate-role (exit_code=99) for role: mariadb::misc::analytics::backup |
[production] |
| 07:34 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on pc2013.codfw.wmnet with reason: host reimage |
[production] |
| 07:17 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.reimage for host pc2013.codfw.wmnet with OS bookworm |
[production] |
| 07:16 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc[2013-2014].codfw.wmnet,pc[1013-1014].eqiad.wmnet with reason: Reimage |
[production] |