401-450 of 10000 results (22ms)
2025-07-09 §
08:18 <slyngs> Deploying Netbox v4.0.11 to production T397300 [production]
08:17 <slyngshede@cumin1002> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox [production]
08:17 <slyngshede@cumin1002> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox [production]
08:09 <aklapper@deploy1003> Finished scap sync-world: Backport for [[gerrit:1167296|Remove stdClass type hint from ApiFeedContributions::feedItem() for now (T398925)]] (duration: 08m 21s) [production]
08:04 <aklapper@deploy1003> zabe, aklapper: Continuing with sync [production]
08:03 <aklapper@deploy1003> zabe, aklapper: Backport for [[gerrit:1167296|Remove stdClass type hint from ApiFeedContributions::feedItem() for now (T398925)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:01 <aklapper@deploy1003> Started scap sync-world: Backport for [[gerrit:1167296|Remove stdClass type hint from ApiFeedContributions::feedItem() for now (T398925)]] [production]
07:58 <jelto@cumin1003> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Upgrade Replica to GitLab 18.0 [production]
07:58 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-ml: apply [production]
07:58 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-ml: apply [production]
07:50 <fceratto@cumin1002> END (PASS) - Cookbook sre.mysql.parsercache (exit_code=0) [production]
07:50 <fceratto@cumin1002> START - Cookbook sre.mysql.parsercache [production]
07:42 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.parsercache (exit_code=99) [production]
07:42 <fceratto@cumin1002> START - Cookbook sre.mysql.parsercache [production]
07:42 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.parsercache (exit_code=99) [production]
07:42 <fceratto@cumin1002> START - Cookbook sre.mysql.parsercache [production]
07:39 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1036.eqiad.wmnet with reason: Maintenance [production]
07:34 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es1036', diff saved to https://phabricator.wikimedia.org/P78817 and previous config saved to /var/cache/conftool/dbconfig/20250709-073458-marostegui.json [production]
07:32 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
07:32 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
07:31 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
07:31 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
07:31 <kartik@deploy1003> Finished scap sync-world: Backport for [[gerrit:1152065|CX: Add virtual-cx-shared DatabaseVirtualDomains (T348513)]] (duration: 25m 21s) [production]
07:31 <moritzm> installing nginx security updates [production]
07:26 <kartik@deploy1003> kartik, abi: Continuing with sync [production]
07:23 <elukey> upload python3-docker-report 0.0.16 to bookworm-wikimedia [production]
07:23 <elukey> upload python3-docker-report to bookworm-wikimedia [production]
07:20 <elukey@deploy1003> helmfile [staging] DONE helmfile.d/services/machinetranslation: sync [production]
07:08 <kartik@deploy1003> kartik, abi: Backport for [[gerrit:1152065|CX: Add virtual-cx-shared DatabaseVirtualDomains (T348513)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:05 <kartik@deploy1003> Started scap sync-world: Backport for [[gerrit:1152065|CX: Add virtual-cx-shared DatabaseVirtualDomains (T348513)]] [production]
07:05 <elukey@deploy1003> helmfile [staging] START helmfile.d/services/machinetranslation: sync [production]
06:47 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2160,2232].codfw.wmnet,db[1207,1217].eqiad.wmnet with reason: migration to mariadb 10.11 [production]
06:36 <kartik@deploy1003> helmfile [staging] DONE helmfile.d/services/machinetranslation: apply [production]
06:29 <marostegui> Failover m3 from db1213 to db1250 - T398818 [production]
06:21 <kartik@deploy1003> helmfile [staging] START helmfile.d/services/machinetranslation: apply [production]
06:20 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2160,2234].codfw.wmnet,db[1213,1217,1250].eqiad.wmnet with reason: m3 master switchover T398818 [production]
06:14 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2160,2234].codfw.wmnet,db[1213,1217,1250].eqiad.wmnet with reason: m3 master switchover T398818 [production]
06:13 <kartik@deploy1003> helmfile [staging] DONE helmfile.d/services/machinetranslation: apply [production]
05:58 <kartik@deploy1003> helmfile [staging] START helmfile.d/services/machinetranslation: apply [production]
04:23 <cjming@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
04:23 <cjming@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
01:26 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1073.eqiad.wmnet' (T394333) [admin]
01:12 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1073.eqiad.wmnet' (T394333) [admin]
2025-07-08 §
23:58 <zabe@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply [production]
23:58 <zabe@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-experimental: apply [production]
23:43 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es1048.eqiad.wmnet with OS bookworm [production]
23:43 <vriley@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1002" [production]
23:43 <vriley@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1002" [production]
23:34 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1053.eqiad.wmnet with OS bookworm [production]
23:19 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es1048.eqiad.wmnet with reason: host reimage [production]