451-500 of 10000 results (15ms)
2025-09-04 ยง
16:42 <rzl@deploy1003> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
16:42 <rzl@deploy1003> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
16:39 <btullis> upgrading and restarting envoyproxy on cephosd100[2-5] for T402584 [production]
16:39 <swfrench-wmf> started single-replica PHP 8.3 pilot on shellbox-syntaxhighlight in codfw - T403284 [production]
16:38 <swfrench@deploy1003> helmfile [codfw] DONE helmfile.d/services/shellbox-syntaxhighlight: apply [production]
16:37 <swfrench@deploy1003> helmfile [codfw] START helmfile.d/services/shellbox-syntaxhighlight: apply [production]
16:37 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2216 (T401906)', diff saved to https://phabricator.wikimedia.org/P82582 and previous config saved to /var/cache/conftool/dbconfig/20250904-163727-fceratto.json [production]
16:36 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=99) (T395910) [admin]
16:35 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2216 (T401906)', diff saved to https://phabricator.wikimedia.org/P82581 and previous config saved to /var/cache/conftool/dbconfig/20250904-163517-fceratto.json [production]
16:35 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2216.codfw.wmnet with reason: Maintenance [production]
16:35 <btullis> upgrading and restarting envoyproxy on cephosd1001 for T402584 [production]
16:33 <rzl@deploy1003> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
16:33 <swfrench@deploy1003> helmfile [staging] DONE helmfile.d/services/shellbox-syntaxhighlight: apply [production]
16:33 <rzl@deploy1003> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
16:33 <swfrench@deploy1003> helmfile [staging] START helmfile.d/services/shellbox-syntaxhighlight: apply [production]
16:32 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add (T395910) [admin]
15:50 <jhancock@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:49 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2203 (T401906)', diff saved to https://phabricator.wikimedia.org/P82580 and previous config saved to /var/cache/conftool/dbconfig/20250904-154934-fceratto.json [production]
15:48 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2203 (T401906)', diff saved to https://phabricator.wikimedia.org/P82579 and previous config saved to /var/cache/conftool/dbconfig/20250904-154824-fceratto.json [production]
15:48 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2203.codfw.wmnet with reason: Maintenance [production]
15:48 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2202.codfw.wmnet with reason: Maintenance [production]
15:47 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2188 (T401906)', diff saved to https://phabricator.wikimedia.org/P82578 and previous config saved to /var/cache/conftool/dbconfig/20250904-154744-fceratto.json [production]
15:32 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P82577 and previous config saved to /var/cache/conftool/dbconfig/20250904-153236-fceratto.json [production]
15:31 <jhancock@cumin1002> START - Cookbook sre.hosts.provision for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:27 <jhancock@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:25 <tappof> migration from prometheus3003.esams to prometheus3004 has been completed T403620 [production]
15:22 <wmbot~lucaswerkmeister@tools-bastion-13> deployed c35d575859 (l10n updates: nb) [tools.lexeme-forms]
15:22 <jhancock@cumin1002> START - Cookbook sre.hosts.provision for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:22 <moritzm> upgrade Envoyproxy on cloudweb servers T402584 [production]
15:22 <jhancock@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:20 <jhancock@cumin1002> START - Cookbook sre.hosts.provision for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:18 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:17 <moritzm> installing apache2 security updates [production]
15:17 <wmbot~lucaswerkmeister@tools-bastion-13> deployed 3226a38be4 (l10n updates: ps) [tools.ranker]
15:17 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P82576 and previous config saved to /var/cache/conftool/dbconfig/20250904-151729-fceratto.json [production]
15:16 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:13 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-worker1236.eqiad.wmnet with OS bullseye [production]
15:12 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2155 (T402925)', diff saved to https://phabricator.wikimedia.org/P82575 and previous config saved to /var/cache/conftool/dbconfig/20250904-151235-ladsgroup.json [production]
15:12 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2155.codfw.wmnet with reason: Maintenance [production]
15:12 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147 (T402925)', diff saved to https://phabricator.wikimedia.org/P82574 and previous config saved to /var/cache/conftool/dbconfig/20250904-151223-ladsgroup.json [production]
15:11 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-worker1235.eqiad.wmnet with OS bullseye [production]
15:06 <tappof@dns1004> END - running authdns-update [production]
15:05 <tappof@dns1004> START - running authdns-update [production]
15:02 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2188 (T401906)', diff saved to https://phabricator.wikimedia.org/P82573 and previous config saved to /var/cache/conftool/dbconfig/20250904-150221-fceratto.json [production]
15:02 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:00 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:00 <jhancock@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:00 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2188 (T401906)', diff saved to https://phabricator.wikimedia.org/P82572 and previous config saved to /var/cache/conftool/dbconfig/20250904-150011-fceratto.json [production]
15:00 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2188.codfw.wmnet with reason: Maintenance [production]
14:59 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176 (T401906)', diff saved to https://phabricator.wikimedia.org/P82571 and previous config saved to /var/cache/conftool/dbconfig/20250904-145948-fceratto.json [production]