301-350 of 10000 results (95ms)
2024-07-22 ยง
13:45 <tchanders@deploy1002> tchanders: Continuing with sync [production]
13:42 <tchanders@deploy1002> tchanders: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]], [[gerrit:1055937|Fix logic for handling enabling temporary accounts (T348895)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:39 <tchanders@deploy1002> Started scap sync-world: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]], [[gerrit:1055937|Fix logic for handling enabling temporary accounts (T348895)]] [production]
13:29 <tchanders@deploy1002> Sync cancelled. [production]
13:25 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on rdb1014.eqiad.wmnet with reason: Hardware issue [production]
13:25 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on rdb1014.eqiad.wmnet with reason: Hardware issue [production]
13:21 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on netbox1002.eqiad.wmnet with reason: Netbox 3 silencing [production]
13:20 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on netbox1002.eqiad.wmnet with reason: Netbox 3 silencing [production]
13:20 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on netbox2002.codfw.wmnet with reason: Netbox 3 silencing [production]
13:20 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on netbox2002.codfw.wmnet with reason: Netbox 3 silencing [production]
13:13 <tchanders@deploy1002> tchanders: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:11 <tchanders@deploy1002> Started scap sync-world: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]] [production]
13:07 <claime> power cycling rdb1014.eqiad.wmnet [production]
12:22 <godog> restore retention.ms=172800000 for mediawiki.httpd.accesslog [production]
11:54 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply [production]
11:53 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/shellbox-video: apply [production]
11:17 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] (duration: 08m 02s) [production]
11:12 <ladsgroup@deploy1002> ebrahim, ladsgroup: Continuing with sync [production]
11:11 <ladsgroup@deploy1002> ebrahim, ladsgroup: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
11:09 <ladsgroup@deploy1002> Started scap sync-world: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] [production]
10:33 <volans> upgraded manually prometheus-ipmi-exporter to v 1.8.0-1~wmf12+1 on db1179 (leftover because was down) T368088 [production]
10:32 <Dreamy_Jazz> Running `mwscript extensions/MediaModeration/maintenance/updateMetrics.php --wiki=commonswiki --verbose` [production]
10:28 <Dreamy_Jazz> Restarting MediaModeration scanning script - https://wikitech.wikimedia.org/wiki/MediaModeration [production]
10:24 <elukey> kafka preferred-replica-election on kafka-main - T370574 [production]
09:51 <godog> set mediawiki.httpd.accesslog topic retention to 26h temporarily [production]
09:50 <mlitn@deploy1002> Finished scap: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] (duration: 08m 19s) [production]
09:45 <mlitn@deploy1002> cparle, mlitn: Continuing with sync [production]
09:44 <mlitn@deploy1002> cparle, mlitn: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
09:42 <mlitn@deploy1002> Started scap sync-world: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] [production]
09:40 <claime> homer 'cr*codfw*' commit 'T351074' [production]
09:30 <ayounsi@cumin1002> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 [production]
09:21 <ayounsi@cumin1002> START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 [production]
09:03 <ayounsi@cumin1002> END (FAIL) - Cookbook sre.deploy.python-code (exit_code=99) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 [production]
09:00 <ayounsi@cumin1002> START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 [production]
08:56 <godog> rebalance mediawiki.httpd.accesslog partitions across brokers - T370129 [production]
08:55 <ayounsi@cumin1002> END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) [production]
08:50 <ayounsi@cumin1002> START - Cookbook sre.postgresql.postgres-init [production]
08:32 <elukey> restart kafka on kafka-main2005 - T370574 [production]
08:31 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on kafka-main2005.codfw.wmnet with reason: restart attempt [production]
08:30 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-main2005.codfw.wmnet with reason: restart attempt [production]
08:24 <brouberol@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply [production]
08:23 <brouberol@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply [production]
08:07 <elukey> restart kafka on kafka-main2001 - T370574 [production]
08:06 <elukey> restart kafka on kafka-main2001 - sre.hosts.downtime [production]
08:06 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on kafka-main2001.codfw.wmnet with reason: restart attempt [production]
08:05 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-main2001.codfw.wmnet with reason: restart attempt [production]
08:03 <brouberol@cumin1002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts karapace1002.eqiad.wmnet [production]
08:00 <brouberol@cumin1002> START - Cookbook sre.hosts.decommission for hosts karapace1002.eqiad.wmnet [production]
07:39 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work [production]
07:39 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work [production]