151-200 of 10000 results (94ms)
2026-03-09 ยง
15:30 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host prometheus4003.ulsfo.wmnet with OS bookworm [production]
15:26 <elukey@cumin1003> END (PASS) - Cookbook sre.kafka.change-confluent-distro-version (exit_code=0) Change Confluent distribution for Kafka A:kafka-test-eqiad cluster: Change Confluent distribution. [production]
15:24 <kevinbazira@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
15:18 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host people2004.codfw.wmnet [production]
15:14 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host people2004.codfw.wmnet [production]
15:12 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2009.codfw.wmnet with reason: host reimage [production]
15:09 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti-jumbo1001.eqiad.wmnet [production]
15:08 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2009.codfw.wmnet with reason: host reimage [production]
15:03 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti-jumbo1001.eqiad.wmnet [production]
14:50 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2009.codfw.wmnet with OS bookworm [production]
14:49 <bking@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wdqs2009.codfw.wmnet with OS bullseye [production]
14:45 <elukey@cumin1003> START - Cookbook sre.kafka.change-confluent-distro-version Change Confluent distribution for Kafka A:kafka-test-eqiad cluster: Change Confluent distribution. [production]
14:35 <mszwarc@deploy2002> Finished scap sync-world: Backport for [[gerrit:1249291|Hide 2fa-warning Echo category from preferences (T419111)]] (duration: 06m 07s) [production]
14:35 <fceratto@cumin1003> START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis kaiwiki in section s5 [production]
14:34 <fceratto@cumin1003> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis urwikisource in section s5 [production]
14:31 <mszwarc@deploy2002> mszwarc: Continuing with sync [production]
14:31 <mszwarc@deploy2002> mszwarc: Backport for [[gerrit:1249291|Hide 2fa-warning Echo category from preferences (T419111)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:30 <fceratto@cumin1003> START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis urwikisource in section s5 [production]
14:29 <mszwarc@deploy2002> Started scap sync-world: Backport for [[gerrit:1249291|Hide 2fa-warning Echo category from preferences (T419111)]] [production]
14:25 <fceratto@cumin1003> END (PASS) - Cookbook sre.mysql.sanitize-wiki (exit_code=0) Checking sanitization for wikis urwikisource in section s5 [production]
14:22 <fceratto@cumin1003> START - Cookbook sre.mysql.sanitize-wiki Checking sanitization for wikis urwikisource in section s5 [production]
14:20 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2009.codfw.wmnet with reason: host reimage [production]
14:15 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2009.codfw.wmnet with reason: host reimage [production]
14:15 <phuedx@deploy2002> Finished scap sync-world: Backport for [[gerrit:1249243|JS SDK: Add getExperimentByPrefix() (T419191)]], [[gerrit:1249242|ext.wikimediaEvents: pageVisit -> loggedOutReaderRetention (T419191)]] (duration: 09m 39s) [production]
14:11 <phuedx@deploy2002> phuedx: Continuing with sync [production]
14:07 <phuedx@deploy2002> phuedx: Backport for [[gerrit:1249243|JS SDK: Add getExperimentByPrefix() (T419191)]], [[gerrit:1249242|ext.wikimediaEvents: pageVisit -> loggedOutReaderRetention (T419191)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:05 <phuedx@deploy2002> Started scap sync-world: Backport for [[gerrit:1249243|JS SDK: Add getExperimentByPrefix() (T419191)]], [[gerrit:1249242|ext.wikimediaEvents: pageVisit -> loggedOutReaderRetention (T419191)]] [production]
14:03 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host prometheus4003.ulsfo.wmnet with OS bookworm [production]
13:55 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cumin2003.codfw.wmnet [production]
13:54 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2009.codfw.wmnet with OS bullseye [production]
13:50 <phuedx@deploy2002> Finished scap sync-world: Backport for [[gerrit:1249262|Disable MetricsPlatform extension (T416865)]] (duration: 08m 02s) [production]
13:49 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host cumin2003.codfw.wmnet [production]
13:46 <phuedx@deploy2002> phuedx, sfaci: Continuing with sync [production]
13:44 <kevinbazira@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
13:43 <phuedx@deploy2002> phuedx, sfaci: Backport for [[gerrit:1249262|Disable MetricsPlatform extension (T416865)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:42 <phuedx@deploy2002> Started scap sync-world: Backport for [[gerrit:1249262|Disable MetricsPlatform extension (T416865)]] [production]
13:39 <phuedx@deploy2002> Finished scap sync-world: Backport for [[gerrit:1248075|Confirmemail: Log delay between email sent and confirmation (T415902)]], [[gerrit:1247651|Enable confirmemail logstash channel (T415902)]] (duration: 11m 16s) [production]
13:35 <phuedx@deploy2002> mmartorana, phuedx: Continuing with sync [production]
13:30 <phuedx@deploy2002> mmartorana, phuedx: Backport for [[gerrit:1248075|Confirmemail: Log delay between email sent and confirmation (T415902)]], [[gerrit:1247651|Enable confirmemail logstash channel (T415902)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:28 <phuedx@deploy2002> Started scap sync-world: Backport for [[gerrit:1248075|Confirmemail: Log delay between email sent and confirmation (T415902)]], [[gerrit:1247651|Enable confirmemail logstash channel (T415902)]] [production]
13:10 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host prometheus4003.ulsfo.wmnet with OS bookworm [production]
13:04 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host prometheus4003.ulsfo.wmnet with OS bookworm [production]
12:55 <moritzm> installing Kerberos security updates [production]
12:29 <moritzm> installing python3.9 security updates [production]
12:11 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host prometheus4003.ulsfo.wmnet with OS bookworm [production]
12:00 <reedy@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239026|Revert "CommonSettings: Temporarily set $wgOATHUserHandlesTable = true" (T416544)]], [[gerrit:1249253|CommonSettings: Remove orphaned $wgWebAuthnNewCredsDisabled]] (duration: 06m 13s) [production]
11:56 <reedy@deploy2002> reedy: Continuing with sync [production]
11:56 <reedy@deploy2002> reedy: Backport for [[gerrit:1239026|Revert "CommonSettings: Temporarily set $wgOATHUserHandlesTable = true" (T416544)]], [[gerrit:1249253|CommonSettings: Remove orphaned $wgWebAuthnNewCredsDisabled]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
11:54 <reedy@deploy2002> Started scap sync-world: Backport for [[gerrit:1239026|Revert "CommonSettings: Temporarily set $wgOATHUserHandlesTable = true" (T416544)]], [[gerrit:1249253|CommonSettings: Remove orphaned $wgWebAuthnNewCredsDisabled]] [production]
11:44 <phuedx@deploy2002> Finished scap sync-world: Backport for [[gerrit:1249245|Hooks: Really only add global logging context for pageviews]] (duration: 12m 02s) [production]