251-300 of 10000 results (27ms)
2025-03-13 ยง
14:54 <moritzm> restarting FPM on Phabricator to pick up gnutls security updates [production]
14:54 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/api-gateway: sync [production]
14:50 <lucaswerkmeister-wmde@deploy2002> tgr, lucaswerkmeister-wmde: Continuing with sync [production]
14:50 <moritzm> restarting slapd on serpens/seaborgium to pick up gnutls updates [production]
14:50 <lucaswerkmeister-wmde@deploy2002> tgr, lucaswerkmeister-wmde: Backport for [[gerrit:1127462|Enable SUL3 signup for everyone (T384218)]], [[gerrit:1127505|Set $wgSul3RolloutUserPercentage on some testwikis (T384153)]], [[gerrit:1127516|Reapply "Make WikibaseQualityConstraints use split-graph query service" (T374021)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:48 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudelastic1012.eqiad.wmnet with reason: host reimage [production]
14:47 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1127462|Enable SUL3 signup for everyone (T384218)]], [[gerrit:1127505|Set $wgSul3RolloutUserPercentage on some testwikis (T384153)]], [[gerrit:1127516|Reapply "Make WikibaseQualityConstraints use split-graph query service" (T374021)]] [production]
14:45 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudelastic1012.eqiad.wmnet with reason: host reimage [production]
14:44 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1127208|Follow-up Ia4b9f65b6: Fix argument order passed to EditCheckFactory#create (T388722)]] (duration: 11m 31s) [production]
14:37 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, kemayo: Continuing with sync [production]
14:35 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, kemayo: Backport for [[gerrit:1127208|Follow-up Ia4b9f65b6: Fix argument order passed to EditCheckFactory#create (T388722)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:35 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cloudelastic1012.eqiad.wmnet with OS bullseye [production]
14:35 <jmm@cumin2002> END (PASS) - Cookbook sre.o11y.roll-restart-reboot-logstash-collectors (exit_code=0) rolling restart_daemons on A:logstash-collector [production]
14:33 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
14:33 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
14:32 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1127208|Follow-up Ia4b9f65b6: Fix argument order passed to EditCheckFactory#create (T388722)]] [production]
14:32 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-main: apply [production]
14:32 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-main: apply [production]
14:31 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:31 <bking@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudelastic1012.eqiad.wmnet with OS bullseye [production]
14:31 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:30 <jiji@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
14:30 <jiji@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
14:27 <jmm@cumin2002> START - Cookbook sre.o11y.roll-restart-reboot-logstash-collectors rolling restart_daemons on A:logstash-collector [production]
14:26 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2075.codfw.wmnet with OS bullseye [production]
14:26 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component components-api [toolsbeta]
14:26 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component components-api [toolsbeta]
14:24 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2075.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
14:19 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ms-be2075.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
14:12 <moritzm> installing gnutls security updates [production]
14:06 <jiji@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-parsoid: apply [production]
14:05 <effie> restarting parsoid on codfw [production]
14:04 <jiji@deploy2002> helmfile [codfw] START helmfile.d/services/mw-parsoid: apply [production]
14:01 <kcvelaga@deploy2002> Finished deploy [airflow-dags/analytics_product@554407c]: T362615 (duration: 01m 39s) [production]
14:00 <kcvelaga@deploy2002> Started deploy [airflow-dags/analytics_product@554407c]: T362615 [production]
13:50 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cloudelastic1012.eqiad.wmnet with OS bullseye [production]
13:48 <stevemunene> restart the hadoop-hdfs-namenode service on an-master1004 to pick up the new hosts as well T388512 [analytics]
13:46 <bking@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudelastic1012.eqiad.wmnet with OS bullseye [production]
13:45 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cloudelastic1012.eqiad.wmnet with OS bullseye [production]
13:45 <stevemunene> fail over the hadoop namenode services from an-master1004 to an-master1003 [analytics]
13:44 <bking@cumin2002> conftool action : set/pooled=no; selector: service=cloudelastic,name=cloudelastic1012.eqiad.wmnet [production]
13:22 <isaranto@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
13:22 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
13:21 <isaranto@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
12:51 <ladsgroup@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:50 <ladsgroup@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
12:50 <ladsgroup@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:49 <ladsgroup@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
12:49 <ladsgroup@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:49 <ladsgroup@deploy2002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply [production]