2025-05-27
ยง
|
16:48 |
<jasmine@cumin1002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1022-1025].eqiad.wmnet |
[production] |
16:47 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1210 (T395241)', diff saved to https://phabricator.wikimedia.org/P76526 and previous config saved to /var/cache/conftool/dbconfig/20250527-164757-fceratto.json |
[production] |
16:43 |
<jasmine@cumin1002> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1022-1025].eqiad.wmnet |
[production] |
16:43 |
<dwisehaupt> |
stopping process-control and coworker on civi1002 and frdev1002 for updates and reboots. |
[production] |
16:42 |
<sfaci@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply |
[production] |
16:42 |
<sfaci@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply |
[production] |
16:41 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1210 (T395241)', diff saved to https://phabricator.wikimedia.org/P76525 and previous config saved to /var/cache/conftool/dbconfig/20250527-164136-fceratto.json |
[production] |
16:41 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1210.eqiad.wmnet with reason: Maintenance |
[production] |
16:41 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1200 (T395241)', diff saved to https://phabricator.wikimedia.org/P76524 and previous config saved to /var/cache/conftool/dbconfig/20250527-164110-fceratto.json |
[production] |
16:36 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch1103.eqiad.wmnet with OS bullseye |
[production] |
16:34 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2214', diff saved to https://phabricator.wikimedia.org/P76523 and previous config saved to /var/cache/conftool/dbconfig/20250527-163416-fceratto.json |
[production] |
16:26 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1200', diff saved to https://phabricator.wikimedia.org/P76522 and previous config saved to /var/cache/conftool/dbconfig/20250527-162602-fceratto.json |
[production] |
16:21 |
<klausman@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host ml-serve1007.eqiad.wmnet |
[production] |
16:21 |
<klausman@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node pool for host ml-serve1007.eqiad.wmnet |
[production] |
16:20 |
<dwisehaupt> |
payments back out of maintenance mode after update/reboot |
[production] |
16:19 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2214', diff saved to https://phabricator.wikimedia.org/P76521 and previous config saved to /var/cache/conftool/dbconfig/20250527-161909-fceratto.json |
[production] |
16:16 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cirrussearch1103.eqiad.wmnet with reason: host reimage |
[production] |
16:15 |
<dwisehaupt> |
payments into maintenance mode for kernel update/reboot of frqueue1003 |
[production] |
16:14 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2186.codfw.wmnet |
[production] |
16:14 |
<klausman@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve1007.eqiad.wmnet with OS bookworm |
[production] |
16:12 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cirrussearch1103.eqiad.wmnet with reason: host reimage |
[production] |
16:10 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1200', diff saved to https://phabricator.wikimedia.org/P76520 and previous config saved to /var/cache/conftool/dbconfig/20250527-161055-fceratto.json |
[production] |
16:10 |
<jynus@cumin1002> |
DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 4:00:00 on backup2013.codfw.wmnet with reason: Downtime hosts for reboot |
[production] |
16:10 |
<jynus@cumin1002> |
DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 4:00:00 on backup[2001-2003].codfw.wmnet,backup1013.eqiad.wmnet with reason: Downtime hosts for reboot |
[production] |
16:05 |
<marostegui@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2186.codfw.wmnet |
[production] |
16:04 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2214 (T395241)', diff saved to https://phabricator.wikimedia.org/P76519 and previous config saved to /var/cache/conftool/dbconfig/20250527-160401-fceratto.json |
[production] |
15:59 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db2191 gradually with 4 steps - Pool db2191.codfw.wmnet in after cloning |
[production] |
15:57 |
<klausman@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1007.eqiad.wmnet with reason: host reimage |
[production] |
15:57 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2214 (T395241)', diff saved to https://phabricator.wikimedia.org/P76517 and previous config saved to /var/cache/conftool/dbconfig/20250527-155720-fceratto.json |
[production] |
15:57 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2214.codfw.wmnet with reason: Maintenance |
[production] |
15:56 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host cirrussearch1103 |
[production] |
15:56 |
<bking@cumin2002> |
START - Cookbook sre.hosts.move-vlan for host cirrussearch1103 |
[production] |
15:56 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host cirrussearch1103.eqiad.wmnet with OS bullseye |
[production] |
15:56 |
<swfrench@deploy1003> |
Finished scap sync-world: Noop deployment to test scap 4.170.0 - T388761 (duration: 04m 03s) |
[production] |
15:55 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1200 (T395241)', diff saved to https://phabricator.wikimedia.org/P76516 and previous config saved to /var/cache/conftool/dbconfig/20250527-155546-fceratto.json |
[production] |
15:54 |
<klausman@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1007.eqiad.wmnet with reason: host reimage |
[production] |
15:52 |
<swfrench@deploy1003> |
Started scap sync-world: Noop deployment to test scap 4.170.0 - T388761 |
[production] |
15:51 |
<jynus@cumin1002> |
DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 4:00:00 on backup[1001-1003].eqiad.wmnet with reason: Downtime hosts for reboot |
[production] |
15:51 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2197.codfw.wmnet with reason: Maintenance |
[production] |
15:51 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2193 (T395241)', diff saved to https://phabricator.wikimedia.org/P76515 and previous config saved to /var/cache/conftool/dbconfig/20250527-155125-fceratto.json |
[production] |
15:49 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1200 (T395241)', diff saved to https://phabricator.wikimedia.org/P76514 and previous config saved to /var/cache/conftool/dbconfig/20250527-154912-fceratto.json |
[production] |
15:49 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1200.eqiad.wmnet with reason: Maintenance |
[production] |
15:48 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185 (T395241)', diff saved to https://phabricator.wikimedia.org/P76513 and previous config saved to /var/cache/conftool/dbconfig/20250527-154846-fceratto.json |
[production] |
15:47 |
<dancy@deploy1003> |
Installation of scap version "4.170.0" completed for 2 hosts |
[production] |
15:45 |
<dancy@deploy1003> |
Installing scap version "4.170.0" for 2 host(s) |
[production] |
15:45 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch1110.eqiad.wmnet with OS bullseye |
[production] |
15:42 |
<jforrester@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1148951|[wikifunctions] Don't grant new generic-enum rights to Functioneers for now (T391913)]], [[gerrit:1148423|Wikifunctions: Enable Wikifunction client mode on the first five Wiktionaries (T390552)]] (duration: 10m 50s) |
[production] |
15:36 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2193', diff saved to https://phabricator.wikimedia.org/P76511 and previous config saved to /var/cache/conftool/dbconfig/20250527-153618-fceratto.json |
[production] |
15:35 |
<jforrester@deploy1003> |
jforrester: Continuing with sync |
[production] |
15:33 |
<jforrester@deploy1003> |
jforrester: Backport for [[gerrit:1148951|[wikifunctions] Don't grant new generic-enum rights to Functioneers for now (T391913)]], [[gerrit:1148423|Wikifunctions: Enable Wikifunction client mode on the first five Wiktionaries (T390552)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |