2024-07-22
§
|
09:03 |
<ayounsi@cumin1002> |
END (FAIL) - Cookbook sre.deploy.python-code (exit_code=99) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 |
[production] |
09:00 |
<ayounsi@cumin1002> |
START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 |
[production] |
08:56 |
<godog> |
rebalance mediawiki.httpd.accesslog partitions across brokers - T370129 |
[production] |
08:55 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) |
[production] |
08:50 |
<ayounsi@cumin1002> |
START - Cookbook sre.postgresql.postgres-init |
[production] |
08:32 |
<elukey> |
restart kafka on kafka-main2005 - T370574 |
[production] |
08:31 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on kafka-main2005.codfw.wmnet with reason: restart attempt |
[production] |
08:30 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-main2005.codfw.wmnet with reason: restart attempt |
[production] |
08:24 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply |
[production] |
08:23 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply |
[production] |
08:07 |
<elukey> |
restart kafka on kafka-main2001 - T370574 |
[production] |
08:06 |
<elukey> |
restart kafka on kafka-main2001 - sre.hosts.downtime |
[production] |
08:06 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on kafka-main2001.codfw.wmnet with reason: restart attempt |
[production] |
08:05 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-main2001.codfw.wmnet with reason: restart attempt |
[production] |
08:03 |
<brouberol@cumin1002> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts karapace1002.eqiad.wmnet |
[production] |
08:00 |
<brouberol@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts karapace1002.eqiad.wmnet |
[production] |
07:39 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work |
[production] |
07:39 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work |
[production] |
07:35 |
<stran@deploy1002> |
Finished scap: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] (duration: 12m 18s) |
[production] |
07:30 |
<stran@deploy1002> |
stran: Continuing with sync |
[production] |
07:25 |
<stran@deploy1002> |
stran: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:23 |
<stran@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] |
[production] |
07:12 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work |
[production] |
07:12 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work |
[production] |
02:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2170 (T367856)', diff saved to https://phabricator.wikimedia.org/P66880 and previous config saved to /var/cache/conftool/dbconfig/20240722-025552-marostegui.json |
[production] |
02:55 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2170.codfw.wmnet with reason: Maintenance |
[production] |
02:55 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db2170.codfw.wmnet with reason: Maintenance |
[production] |
02:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2153 (T367856)', diff saved to https://phabricator.wikimedia.org/P66879 and previous config saved to /var/cache/conftool/dbconfig/20240722-025530-marostegui.json |
[production] |
02:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2153', diff saved to https://phabricator.wikimedia.org/P66878 and previous config saved to /var/cache/conftool/dbconfig/20240722-024023-marostegui.json |
[production] |
02:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2153', diff saved to https://phabricator.wikimedia.org/P66877 and previous config saved to /var/cache/conftool/dbconfig/20240722-022516-marostegui.json |
[production] |
02:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2153 (T367856)', diff saved to https://phabricator.wikimedia.org/P66876 and previous config saved to /var/cache/conftool/dbconfig/20240722-021009-marostegui.json |
[production] |
01:53 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db1179 (re)pooling @ 100%: Maint over (T369855 T370304)', diff saved to https://phabricator.wikimedia.org/P66875 and previous config saved to /var/cache/conftool/dbconfig/20240722-015302-ladsgroup.json |
[production] |
01:37 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db1179 (re)pooling @ 75%: Maint over (T369855 T370304)', diff saved to https://phabricator.wikimedia.org/P66874 and previous config saved to /var/cache/conftool/dbconfig/20240722-013756-ladsgroup.json |
[production] |
01:22 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db1179 (re)pooling @ 25%: Maint over (T369855 T370304)', diff saved to https://phabricator.wikimedia.org/P66873 and previous config saved to /var/cache/conftool/dbconfig/20240722-012251-ladsgroup.json |
[production] |
01:19 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:1055637|Stop storing missing-image-alt-text lints (T370304)]] (duration: 08m 48s) |
[production] |
01:13 |
<ladsgroup@deploy1002> |
ladsgroup: Continuing with sync |
[production] |
01:13 |
<ladsgroup@deploy1002> |
ladsgroup: Backport for [[gerrit:1055637|Stop storing missing-image-alt-text lints (T370304)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
01:10 |
<ladsgroup@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1055637|Stop storing missing-image-alt-text lints (T370304)]] |
[production] |
01:07 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db1179 (re)pooling @ 10%: Maint over (T369855 T370304)', diff saved to https://phabricator.wikimedia.org/P66872 and previous config saved to /var/cache/conftool/dbconfig/20240722-010745-ladsgroup.json |
[production] |
2024-07-21
§
|
23:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2219 (T367856)', diff saved to https://phabricator.wikimedia.org/P66871 and previous config saved to /var/cache/conftool/dbconfig/20240721-232234-marostegui.json |
[production] |
23:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2219', diff saved to https://phabricator.wikimedia.org/P66870 and previous config saved to /var/cache/conftool/dbconfig/20240721-230727-marostegui.json |
[production] |
22:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2219', diff saved to https://phabricator.wikimedia.org/P66869 and previous config saved to /var/cache/conftool/dbconfig/20240721-225219-marostegui.json |
[production] |
22:44 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:1055629|Disable missing-image-alt-text lint (T370304)]] (duration: 26m 27s) |
[production] |
22:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2219 (T367856)', diff saved to https://phabricator.wikimedia.org/P66868 and previous config saved to /var/cache/conftool/dbconfig/20240721-223712-marostegui.json |
[production] |
22:36 |
<ladsgroup@deploy1002> |
ladsgroup: Continuing with sync |
[production] |
22:35 |
<ladsgroup@deploy1002> |
ladsgroup: Backport for [[gerrit:1055629|Disable missing-image-alt-text lint (T370304)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
22:18 |
<ladsgroup@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1055629|Disable missing-image-alt-text lint (T370304)]] |
[production] |
08:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2219 (T367856)', diff saved to https://phabricator.wikimedia.org/P66867 and previous config saved to /var/cache/conftool/dbconfig/20240721-085853-marostegui.json |
[production] |
08:58 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2219.codfw.wmnet with reason: Maintenance |
[production] |
08:58 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2219.codfw.wmnet with reason: Maintenance |
[production] |