301-350 of 10000 results (29ms)
2025-11-11 ยง
09:46 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
09:46 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
09:45 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
09:45 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
09:45 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
09:45 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
09:36 <aklapper@deploy2002> rebuilt and synchronized wikiversions files: group0 to 1.46.0-wmf.2 refs T408272 [production]
09:34 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2206 (T407997)', diff saved to https://phabricator.wikimedia.org/P85148 and previous config saved to /var/cache/conftool/dbconfig/20251111-093410-marostegui.json [production]
09:31 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
09:31 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
09:07 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P85145 and previous config saved to /var/cache/conftool/dbconfig/20251111-090712-marostegui.json [production]
08:52 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P85144 and previous config saved to /var/cache/conftool/dbconfig/20251111-085204-marostegui.json [production]
08:46 <ryankemper@cumin1002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REBOOT (2 nodes at a time) for ElasticSearch cluster search_codfw: codfw cluster reboot (apply updates) - ryankemper@cumin1002 - T390860 [production]
08:37 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2028.codfw.wmnet with OS trixie [production]
08:36 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2172 (T407997)', diff saved to https://phabricator.wikimedia.org/P85143 and previous config saved to /var/cache/conftool/dbconfig/20251111-083657-marostegui.json [production]
08:29 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2172 (T407997)', diff saved to https://phabricator.wikimedia.org/P85142 and previous config saved to /var/cache/conftool/dbconfig/20251111-082950-marostegui.json [production]
08:29 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2172.codfw.wmnet with reason: Maintenance [production]
08:29 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2155 (T407997)', diff saved to https://phabricator.wikimedia.org/P85141 and previous config saved to /var/cache/conftool/dbconfig/20251111-082927-marostegui.json [production]
08:14 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P85140 and previous config saved to /var/cache/conftool/dbconfig/20251111-081419-marostegui.json [production]
08:14 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2028.codfw.wmnet with reason: host reimage [production]
08:13 <moritzm> installing intel-microcode security updates [production]
08:08 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on es2028.codfw.wmnet with reason: host reimage [production]
07:59 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P85139 and previous config saved to /var/cache/conftool/dbconfig/20251111-075911-marostegui.json [production]
07:52 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host es2028.codfw.wmnet with OS trixie [production]
07:52 <jmm@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host es2028.codfw.wmnet with OS trixie [production]
07:44 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2155 (T407997)', diff saved to https://phabricator.wikimedia.org/P85138 and previous config saved to /var/cache/conftool/dbconfig/20251111-074404-marostegui.json [production]
07:36 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2155 (T407997)', diff saved to https://phabricator.wikimedia.org/P85137 and previous config saved to /var/cache/conftool/dbconfig/20251111-073659-marostegui.json [production]
07:36 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2155.codfw.wmnet with reason: Maintenance [production]
07:36 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147 (T407997)', diff saved to https://phabricator.wikimedia.org/P85136 and previous config saved to /var/cache/conftool/dbconfig/20251111-073635-marostegui.json [production]
07:21 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P85135 and previous config saved to /var/cache/conftool/dbconfig/20251111-072127-marostegui.json [production]
07:15 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host es2028.codfw.wmnet with OS trixie [production]
07:06 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P85134 and previous config saved to /var/cache/conftool/dbconfig/20251111-070620-marostegui.json [production]
06:56 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db1220 - Depool db1220.eqiad.wmnet to then clone it to db1264.eqiad.wmnet - marostegui@cumin1003 [production]
06:56 <marostegui@cumin1003> START - Cookbook sre.mysql.depool db1220 - Depool db1220.eqiad.wmnet to then clone it to db1264.eqiad.wmnet - marostegui@cumin1003 [production]
06:55 <marostegui@cumin1003> START - Cookbook sre.mysql.clone of db1220.eqiad.wmnet onto db1264.eqiad.wmnet [production]
06:55 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1220.eqiad.wmnet with reason: Cloning another host [production]
06:53 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1224.eqiad.wmnet with reason: Cloning another host [production]
06:51 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147 (T407997)', diff saved to https://phabricator.wikimedia.org/P85132 and previous config saved to /var/cache/conftool/dbconfig/20251111-065112-marostegui.json [production]
06:42 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2147 (T407997)', diff saved to https://phabricator.wikimedia.org/P85131 and previous config saved to /var/cache/conftool/dbconfig/20251111-064257-marostegui.json [production]
06:42 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2147.codfw.wmnet with reason: Maintenance [production]
06:33 <marostegui> Deploy schema change on x1 codfw master with replication T409733 [production]
06:31 <kart_> apertium: staging: Update to 2025-11-10-034557-production (T408515) [production]
06:29 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/apertium: apply [production]
06:29 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/apertium: apply [production]
06:24 <marostegui> Deploy schema change on x1 codfw master with replication T409101 [production]
06:21 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2203.codfw.wmnet with reason: Maintenance [production]
05:02 <mwpresync@deploy2002> Pruned MediaWiki: 1.45.0-wmf.24 (duration: 02m 30s) [production]
04:49 <mwpresync@deploy2002> Finished scap sync-world: testwikis to 1.46.0-wmf.2 refs T408272 (duration: 46m 27s) [production]
04:03 <mwpresync@deploy2002> Started scap sync-world: testwikis to 1.46.0-wmf.2 refs T408272 [production]
01:19 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcontrol1008-dev.eqiad.wmnet with OS trixie [production]