2024-07-22
ยง
|
15:49 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-test1006.eqiad.wmnet with reason: attempt to remove a data dir on disk |
[production] |
15:08 |
<dancy@deploy1002> |
Finished scap: Backport for [[gerrit:1053752|MWMultiVersion.php: Allow MW_FORCE_VERSION to pin the mw version (T369115)]] (duration: 09m 10s) |
[production] |
15:03 |
<dancy@deploy1002> |
dancy: Continuing with sync |
[production] |
15:01 |
<dancy@deploy1002> |
dancy: Backport for [[gerrit:1053752|MWMultiVersion.php: Allow MW_FORCE_VERSION to pin the mw version (T369115)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:59 |
<dancy@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1053752|MWMultiVersion.php: Allow MW_FORCE_VERSION to pin the mw version (T369115)]] |
[production] |
14:26 |
<zabe@deploy1002> |
Finished scap: Backport for [[gerrit:1055614|Revert^2 "Set some site names for new-ish wikis" (T363270 T360303 T360310 T363263)]] (duration: 10m 54s) |
[production] |
14:21 |
<zabe@deploy1002> |
zabe: Continuing with sync |
[production] |
14:17 |
<zabe@deploy1002> |
zabe: Backport for [[gerrit:1055614|Revert^2 "Set some site names for new-ish wikis" (T363270 T360303 T360310 T363263)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:15 |
<zabe@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1055614|Revert^2 "Set some site names for new-ish wikis" (T363270 T360303 T360310 T363263)]] |
[production] |
14:08 |
<tchanders@deploy1002> |
Finished scap: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]], [[gerrit:1055937|Fix logic for handling enabling temporary accounts (T348895)]] (duration: 07m 11s) |
[production] |
14:03 |
<tchanders@deploy1002> |
tchanders: Continuing with sync |
[production] |
14:03 |
<tchanders@deploy1002> |
tchanders: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]], [[gerrit:1055937|Fix logic for handling enabling temporary accounts (T348895)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:01 |
<tchanders@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]], [[gerrit:1055937|Fix logic for handling enabling temporary accounts (T348895)]] |
[production] |
13:45 |
<tchanders@deploy1002> |
tchanders: Continuing with sync |
[production] |
13:42 |
<tchanders@deploy1002> |
tchanders: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]], [[gerrit:1055937|Fix logic for handling enabling temporary accounts (T348895)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:39 |
<tchanders@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]], [[gerrit:1055937|Fix logic for handling enabling temporary accounts (T348895)]] |
[production] |
13:29 |
<tchanders@deploy1002> |
Sync cancelled. |
[production] |
13:25 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on rdb1014.eqiad.wmnet with reason: Hardware issue |
[production] |
13:25 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on rdb1014.eqiad.wmnet with reason: Hardware issue |
[production] |
13:21 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on netbox1002.eqiad.wmnet with reason: Netbox 3 silencing |
[production] |
13:20 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on netbox1002.eqiad.wmnet with reason: Netbox 3 silencing |
[production] |
13:20 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on netbox2002.codfw.wmnet with reason: Netbox 3 silencing |
[production] |
13:20 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on netbox2002.codfw.wmnet with reason: Netbox 3 silencing |
[production] |
13:13 |
<tchanders@deploy1002> |
tchanders: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:11 |
<tchanders@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1054921|Set Flow to read only on testwiki (T370322)]], [[gerrit:1054625|Enable temporary accounts on testwiki and loginwiki (T348895)]] |
[production] |
13:07 |
<claime> |
power cycling rdb1014.eqiad.wmnet |
[production] |
12:22 |
<godog> |
restore retention.ms=172800000 for mediawiki.httpd.accesslog |
[production] |
11:54 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply |
[production] |
11:53 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/shellbox-video: apply |
[production] |
11:17 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] (duration: 08m 02s) |
[production] |
11:12 |
<ladsgroup@deploy1002> |
ebrahim, ladsgroup: Continuing with sync |
[production] |
11:11 |
<ladsgroup@deploy1002> |
ebrahim, ladsgroup: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
11:09 |
<ladsgroup@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] |
[production] |
10:33 |
<volans> |
upgraded manually prometheus-ipmi-exporter to v 1.8.0-1~wmf12+1 on db1179 (leftover because was down) T368088 |
[production] |
10:32 |
<Dreamy_Jazz> |
Running `mwscript extensions/MediaModeration/maintenance/updateMetrics.php --wiki=commonswiki --verbose` |
[production] |
10:28 |
<Dreamy_Jazz> |
Restarting MediaModeration scanning script - https://wikitech.wikimedia.org/wiki/MediaModeration |
[production] |
10:24 |
<elukey> |
kafka preferred-replica-election on kafka-main - T370574 |
[production] |
09:51 |
<godog> |
set mediawiki.httpd.accesslog topic retention to 26h temporarily |
[production] |
09:50 |
<mlitn@deploy1002> |
Finished scap: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] (duration: 08m 19s) |
[production] |
09:45 |
<mlitn@deploy1002> |
cparle, mlitn: Continuing with sync |
[production] |
09:44 |
<mlitn@deploy1002> |
cparle, mlitn: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
09:42 |
<mlitn@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] |
[production] |
09:40 |
<claime> |
homer 'cr*codfw*' commit 'T351074' |
[production] |
09:30 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 |
[production] |
09:21 |
<ayounsi@cumin1002> |
START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 |
[production] |
09:03 |
<ayounsi@cumin1002> |
END (FAIL) - Cookbook sre.deploy.python-code (exit_code=99) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 |
[production] |
09:00 |
<ayounsi@cumin1002> |
START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 |
[production] |
08:56 |
<godog> |
rebalance mediawiki.httpd.accesslog partitions across brokers - T370129 |
[production] |
08:55 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) |
[production] |
08:50 |
<ayounsi@cumin1002> |
START - Cookbook sre.postgresql.postgres-init |
[production] |