1-50 of 10000 results (18ms)
2026-04-30 ยง
15:20 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1206 (T419961)', diff saved to https://phabricator.wikimedia.org/P92094 and previous config saved to /var/cache/conftool/dbconfig/20260430-152011-fceratto.json [production]
15:19 <mutante> upgrading zuul to 14.2.0-1 on "new zuul" machines (T424879) [releng]
15:16 <bearloga@deploy1003> Finished scap sync-world: Backport for [[gerrit:1270454|EventStreamConfig: remove ABST contextual attribute (T422001)]] (duration: 07m 25s) [production]
15:11 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1206 (T419961)', diff saved to https://phabricator.wikimedia.org/P92093 and previous config saved to /var/cache/conftool/dbconfig/20260430-151157-fceratto.json [production]
15:11 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1206.eqiad.wmnet with reason: Maintenance [production]
15:11 <bearloga@deploy1003> bearloga: Continuing with deployment [production]
15:11 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1196 (T419961)', diff saved to https://phabricator.wikimedia.org/P92092 and previous config saved to /var/cache/conftool/dbconfig/20260430-151128-fceratto.json [production]
15:11 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-logging2005.codfw.wmnet with OS trixie [production]
15:10 <bearloga@deploy1003> bearloga: Backport for [[gerrit:1270454|EventStreamConfig: remove ABST contextual attribute (T422001)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:08 <bearloga@deploy1003> Started scap sync-world: Backport for [[gerrit:1270454|EventStreamConfig: remove ABST contextual attribute (T422001)]] [production]
15:06 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
15:06 <cscott@deploy1003> Finished scap sync-world: Backport for [[gerrit:1279453|Increase Parsoid Read Views to 60% of enwiki mobile web traffic (T424880)]] (duration: 13m 15s) [production]
15:05 <eevans@deploy1003> helmfile [staging] DONE helmfile.d/services/linked-artifacts: apply [production]
15:05 <eevans@deploy1003> helmfile [staging] START helmfile.d/services/linked-artifacts: apply [production]
15:04 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
15:03 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
15:02 <cscott@deploy1003> cscott: Continuing with deployment [production]
15:01 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1196', diff saved to https://phabricator.wikimedia.org/P92091 and previous config saved to /var/cache/conftool/dbconfig/20260430-150120-fceratto.json [production]
14:59 <dcausse@deploy1003> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
14:59 <dcausse@deploy1003> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
14:55 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1378.eqiad.wmnet with OS trixie [production]
14:55 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
14:54 <cscott@deploy1003> cscott: Backport for [[gerrit:1279453|Increase Parsoid Read Views to 60% of enwiki mobile web traffic (T424880)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:54 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
14:53 <akhatun@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-feature-counts-change-enrich: apply [production]
14:53 <cscott@deploy1003> Started scap sync-world: Backport for [[gerrit:1279453|Increase Parsoid Read Views to 60% of enwiki mobile web traffic (T424880)]] [production]
14:51 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1196', diff saved to https://phabricator.wikimedia.org/P92089 and previous config saved to /var/cache/conftool/dbconfig/20260430-145112-fceratto.json [production]
14:47 <akhatun@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-feature-counts-change-enrich: apply [production]
14:45 <moritzm> installing pdns security updates [production]
14:42 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1007.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:41 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host kafka-logging1007.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:41 <gkyziridis@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
14:41 <gkyziridis@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
14:41 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1196 (T419961)', diff saved to https://phabricator.wikimedia.org/P92088 and previous config saved to /var/cache/conftool/dbconfig/20260430-144105-fceratto.json [production]
14:37 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1378.eqiad.wmnet with reason: host reimage [production]
14:36 <dancy@deploy1003> Installation of scap version "4.255.0" completed for 2 hosts [production]
14:34 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1378.eqiad.wmnet with reason: host reimage [production]
14:34 <dancy@deploy1003> Installing scap version "4.255.0" for 2 host(s) [production]
14:33 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-logging2005.codfw.wmnet with reason: host reimage [production]
14:33 <dancy@deploy1003> Installation of scap version "4.252.0" completed for 2 hosts [production]
14:31 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1196 (T419961)', diff saved to https://phabricator.wikimedia.org/P92087 and previous config saved to /var/cache/conftool/dbconfig/20260430-143143-fceratto.json [production]
14:31 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1013,1017].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
14:31 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1196.eqiad.wmnet with reason: Maintenance [production]
14:31 <dancy@deploy1003> Installing scap version "4.252.0" for 2 host(s) [production]
14:30 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1195 (T419961)', diff saved to https://phabricator.wikimedia.org/P92086 and previous config saved to /var/cache/conftool/dbconfig/20260430-143054-fceratto.json [production]
14:30 <urbanecm@deploy1003> Finished scap sync-world: Backport for [[gerrit:1280368|ReassignMentees: Add logging information (T418194)]], [[gerrit:1280370|ReassignMentees: Add logging information (T418194)]] (duration: 131m 31s) [production]
14:27 <herron@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging2005.codfw.wmnet with reason: host reimage [production]
14:26 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1230 (T419635)', diff saved to Unable to send diff to phaste and previous config saved to /var/cache/conftool/dbconfig/20260430-142639-fceratto.json [production]
14:26 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1230.eqiad.wmnet with reason: Maintenance [production]
14:25 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1007.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]