4251-4300 of 10000 results (62ms)
2024-07-22 ยง
09:58 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-builder [tools]
09:51 <godog> set mediawiki.httpd.accesslog topic retention to 26h temporarily [production]
09:50 <mlitn@deploy1002> Finished scap: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] (duration: 08m 19s) [production]
09:45 <mlitn@deploy1002> cparle, mlitn: Continuing with sync [production]
09:44 <mlitn@deploy1002> cparle, mlitn: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
09:43 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component jobs-api [tools]
09:43 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component jobs-api [tools]
09:42 <mlitn@deploy1002> Started scap sync-world: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] [production]
09:40 <claime> homer 'cr*codfw*' commit 'T351074' [production]
09:32 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component jobs-api [toolsbeta]
09:32 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component jobs-api [toolsbeta]
09:30 <ayounsi@cumin1002> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 [production]
09:21 <ayounsi@cumin1002> START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 [production]
09:14 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
09:14 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.ceph.osd.depool_and_destroy (exit_code=0) [admin]
09:03 <ayounsi@cumin1002> END (FAIL) - Cookbook sre.deploy.python-code (exit_code=99) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 [production]
09:00 <ayounsi@cumin1002> START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 [production]
08:56 <godog> rebalance mediawiki.httpd.accesslog partitions across brokers - T370129 [production]
08:55 <ayounsi@cumin1002> END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) [production]
08:50 <ayounsi@cumin1002> START - Cookbook sre.postgresql.postgres-init [production]
08:44 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.ceph.osd.depool_and_destroy [admin]
08:32 <elukey> restart kafka on kafka-main2005 - T370574 [production]
08:31 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on kafka-main2005.codfw.wmnet with reason: restart attempt [production]
08:30 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-main2005.codfw.wmnet with reason: restart attempt [production]
08:24 <brouberol@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply [production]
08:23 <brouberol@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply [production]
08:17 <brouberol> deploy istio (adding securityContext) to dse-k8s-eqiad cluster - T362978 [analytics]
08:07 <elukey> restart kafka on kafka-main2001 - T370574 [production]
08:06 <elukey> restart kafka on kafka-main2001 - sre.hosts.downtime [production]
08:06 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on kafka-main2001.codfw.wmnet with reason: restart attempt [production]
08:05 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-main2001.codfw.wmnet with reason: restart attempt [production]
08:03 <brouberol@cumin1002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts karapace1002.eqiad.wmnet [production]
08:00 <brouberol@cumin1002> START - Cookbook sre.hosts.decommission for hosts karapace1002.eqiad.wmnet [production]
07:39 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work [production]
07:39 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work [production]
07:35 <stran@deploy1002> Finished scap: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] (duration: 12m 18s) [production]
07:30 <stran@deploy1002> stran: Continuing with sync [production]
07:25 <stran@deploy1002> stran: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:23 <stran@deploy1002> Started scap sync-world: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] [production]
07:12 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work [production]
07:12 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work [production]
04:09 <wmbot~bd808@tools-bastion-12> Swtiched to running bot from buildservice container. [tools.ircservserv]
02:55 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2170 (T367856)', diff saved to https://phabricator.wikimedia.org/P66880 and previous config saved to /var/cache/conftool/dbconfig/20240722-025552-marostegui.json [production]
02:55 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2170.codfw.wmnet with reason: Maintenance [production]
02:55 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db2170.codfw.wmnet with reason: Maintenance [production]
02:55 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2153 (T367856)', diff saved to https://phabricator.wikimedia.org/P66879 and previous config saved to /var/cache/conftool/dbconfig/20240722-025530-marostegui.json [production]
02:40 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2153', diff saved to https://phabricator.wikimedia.org/P66878 and previous config saved to /var/cache/conftool/dbconfig/20240722-024023-marostegui.json [production]
02:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2153', diff saved to https://phabricator.wikimedia.org/P66877 and previous config saved to /var/cache/conftool/dbconfig/20240722-022516-marostegui.json [production]
02:10 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2153 (T367856)', diff saved to https://phabricator.wikimedia.org/P66876 and previous config saved to /var/cache/conftool/dbconfig/20240722-021009-marostegui.json [production]
01:53 <ladsgroup@cumin1002> dbctl commit (dc=all): 'db1179 (re)pooling @ 100%: Maint over (T369855 T370304)', diff saved to https://phabricator.wikimedia.org/P66875 and previous config saved to /var/cache/conftool/dbconfig/20240722-015302-ladsgroup.json [production]