2023-07-12
§
|
03:41 |
<rzl@deploy1002> |
helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply |
[production] |
03:39 |
<rzl@deploy1002> |
helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply |
[production] |
03:29 |
<rzl@deploy1002> |
helmfile [staging] DONE helmfile.d/services/opentelemetry-collector: apply |
[production] |
03:29 |
<rzl@deploy1002> |
helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply |
[production] |
03:16 |
<rzl@deploy1002> |
helmfile [staging] DONE helmfile.d/services/opentelemetry-collector: apply |
[production] |
03:15 |
<rzl@deploy1002> |
helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply |
[production] |
03:14 |
<rzl@deploy1002> |
helmfile [staging] DONE helmfile.d/services/opentelemetry-collector: apply |
[production] |
03:14 |
<rzl@deploy1002> |
helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply |
[production] |
01:47 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6 days, 19:00:00 on wdqs[2013,2022].codfw.wmnet with reason: new host |
[production] |
01:46 |
<ryankemper@cumin1001> |
START - Cookbook sre.hosts.downtime for 6 days, 19:00:00 on wdqs[2013,2022].codfw.wmnet with reason: new host |
[production] |
2023-07-11
§
|
21:51 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
20:45 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:936097|Logos: Fixes grantswiki and idwiktionary]], [[gerrit:937177|Drop idwiktionary wordmark]], [[gerrit:937113|Always return the class as string from Html::getTextInputAttributes (T341566)]] (duration: 11m 10s) |
[production] |
20:35 |
<urbanecm@deploy1002> |
jdlrobson and urbanecm: Backport for [[gerrit:936097|Logos: Fixes grantswiki and idwiktionary]], [[gerrit:937177|Drop idwiktionary wordmark]], [[gerrit:937113|Always return the class as string from Html::getTextInputAttributes (T341566)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
20:33 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:936097|Logos: Fixes grantswiki and idwiktionary]], [[gerrit:937177|Drop idwiktionary wordmark]], [[gerrit:937113|Always return the class as string from Html::getTextInputAttributes (T341566)]] |
[production] |
20:32 |
<urbanecm@deploy1002> |
Sync cancelled. |
[production] |
20:26 |
<urbanecm@deploy1002> |
jdlrobson and urbanecm: Backport for [[gerrit:936097|Logos: Fixes grantswiki and idwiktionary]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
20:25 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:936097|Logos: Fixes grantswiki and idwiktionary]] |
[production] |
20:16 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
20:14 |
<bking@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
19:49 |
<denisse@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:48 |
<denisse@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:57 |
<dduvall@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.41.0-wmf.17 refs T340245 |
[production] |
18:46 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
17:53 |
<dduvall@deploy1002> |
Pruned MediaWiki: 1.41.0-wmf.15 (duration: 02m 16s) |
[production] |
17:50 |
<dduvall@deploy1002> |
Finished scap: testwikis wikis to 1.41.0-wmf.17 refs T340245 (duration: 45m 50s) |
[production] |
17:05 |
<dduvall@deploy1002> |
Started scap: testwikis wikis to 1.41.0-wmf.17 refs T340245 |
[production] |
16:52 |
<rzl@deploy1002> |
helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply |
[production] |
16:28 |
<vgutierrez> |
reenabling puppet in cp6002 |
[production] |
16:24 |
<rzl@deploy1002> |
helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply |
[production] |
16:08 |
<sukhe> |
upgrade dns1004 to gdnsd 3.99.0~alpha2 |
[production] |
16:04 |
<eevans@cumin1001> |
END (FAIL) - Cookbook sre.cassandra.roll-restart (exit_code=99) for nodes matching restbase20[13-27].codfw.wmnet: Applying JVM update - eevans@cumin1001 |
[production] |
16:03 |
<Lucas_WMDE> |
previous backport also included [[gerrit:930712|Remove oversampling for Navigation Timing extension. (T337858)]] |
[production] |
16:02 |
<lucaswerkmeister-wmde@deploy1002> |
Finished scap: Backport for [[gerrit:936737|Add option for html label in Menu template (T340217)]] (duration: 09m 15s) |
[production] |
15:54 |
<lucaswerkmeister-wmde@deploy1002> |
jdlrobson and lucaswerkmeister-wmde: Backport for [[gerrit:936737|Add option for html label in Menu template (T340217)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
15:54 |
<Krinkle> |
Deployed https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/930712 ("Remove oversampling for Navigation Timing extension.") |
[production] |
15:53 |
<lucaswerkmeister-wmde@deploy1002> |
Started scap: Backport for [[gerrit:936737|Add option for html label in Menu template (T340217)]] |
[production] |
15:48 |
<krinkle@deploy1002> |
Unlocked for deployment [ALL REPOSITORIES]: pending security problem, see mediawiki_security IRC (duration: 17m 03s) |
[production] |
15:31 |
<krinkle@deploy1002> |
Locking from deployment [ALL REPOSITORIES]: pending security problem, see mediawiki_security IRC |
[production] |
15:26 |
<krinkle@deploy1002> |
Sync cancelled. |
[production] |
15:24 |
<eevans@cumin1001> |
START - Cookbook sre.cassandra.roll-restart for nodes matching restbase20[13-27].codfw.wmnet: Applying JVM update - eevans@cumin1001 |
[production] |
15:22 |
<krinkle@deploy1002> |
phedenskog and krinkle: Backport for [[gerrit:930712|Remove oversampling for Navigation Timing extension. (T337858)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
15:21 |
<krinkle@deploy1002> |
Started scap: Backport for [[gerrit:930712|Remove oversampling for Navigation Timing extension. (T337858)]] |
[production] |
15:17 |
<eevans@cumin1001> |
END (FAIL) - Cookbook sre.cassandra.roll-restart (exit_code=99) for nodes matching A:restbase-codfw: Applying JVM update - eevans@cumin1001 |
[production] |
15:09 |
<eevans@cumin1001> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:restbase-codfw: Applying JVM update - eevans@cumin1001 |
[production] |
14:49 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid public cluster: Roll restart of Druid jvm daemons. |
[production] |
14:21 |
<btullis@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main |
[production] |
14:19 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.kafka.roll-restart-mirror-maker (exit_code=0) restart MirrorMaker for Kafka A:kafka-mirror-maker-jumbo-eqiad cluster: Roll restart of jvm daemons. |
[production] |
14:17 |
<moritzm> |
restarting apache on mw canaries |
[production] |
14:17 |
<btullis@deploy1002> |
helmfile [eqiad] START helmfile.d/services/datahub: apply on main |
[production] |
14:15 |
<btullis@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/datahub: sync on main |
[production] |