4351-4400 of 10000 results (66ms)
2024-07-29 ยง
10:14 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2441.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
10:13 <marostegui@cumin1002> dbctl commit (dc=all): 'es1032 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P67000 and previous config saved to /var/cache/conftool/dbconfig/20240729-101348-root.json [production]
10:12 <godog> bounce benthos@mw_accesslog_sampler on logstash collectors [production]
10:11 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es1032.eqiad.wmnet with reason: Long schema change [production]
10:11 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on es1032.eqiad.wmnet with reason: Long schema change [production]
10:07 <cgoubert@cumin1002> START - Cookbook sre.hosts.provision for host mw2441.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
10:03 <wmbot~bsadowski1@tools-bastion-13> Restarted StewardBot/SULWatcher because of a connection loss [tools.stewardbots]
10:02 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component builds-cli [toolsbeta]
10:02 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-cli [toolsbeta]
10:02 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component builds-cli [toolsbeta]
10:02 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-cli [toolsbeta]
10:00 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan+apply for main branch [admin]
09:59 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component builds-cli [toolsbeta]
09:59 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-cli [toolsbeta]
09:59 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch [admin]
09:57 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component builds-cli [toolsbeta]
09:56 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-cli [toolsbeta]
09:56 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component builds-cli [toolsbeta]
09:55 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-cli [toolsbeta]
09:54 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component builds-cli [toolsbeta]
09:54 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-cli [toolsbeta]
09:53 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component builds-cli [toolsbeta]
09:53 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-cli [toolsbeta]
09:31 <Dreamy_Jazz> Restarted MediaModeration scanning script - https://wikitech.wikimedia.org/wiki/MediaModeration [production]
09:27 <dcausse@deploy1002> Finished deploy [airflow-dags/search@7da1ef0]: search: process_sparql_query workaround oom issues (duration: 00m 20s) [production]
09:27 <dcausse@deploy1002> Started deploy [airflow-dags/search@7da1ef0]: search: process_sparql_query workaround oom issues [production]
09:22 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es1032 investigate access denied errors', diff saved to https://phabricator.wikimedia.org/P66999 and previous config saved to /var/cache/conftool/dbconfig/20240729-092239-root.json [production]
09:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1244 (T367856)', diff saved to https://phabricator.wikimedia.org/P66998 and previous config saved to /var/cache/conftool/dbconfig/20240729-091658-marostegui.json [production]
09:16 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1244.eqiad.wmnet with reason: Maintenance [production]
09:16 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1244.eqiad.wmnet with reason: Maintenance [production]
09:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243 (T367856)', diff saved to https://phabricator.wikimedia.org/P66997 and previous config saved to /var/cache/conftool/dbconfig/20240729-091637-marostegui.json [production]
09:09 <marostegui@cumin1002> dbctl commit (dc=all): 'Repool 25% of es1032', diff saved to https://phabricator.wikimedia.org/P66996 and previous config saved to /var/cache/conftool/dbconfig/20240729-090953-marostegui.json [production]
09:07 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es1032.eqiad.wmnet with reason: Long schema change [production]
09:07 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on es1032.eqiad.wmnet with reason: Long schema change [production]
09:07 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es1032 investigate access denied errors', diff saved to https://phabricator.wikimedia.org/P66995 and previous config saved to /var/cache/conftool/dbconfig/20240729-090730-root.json [production]
09:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243', diff saved to https://phabricator.wikimedia.org/P66994 and previous config saved to /var/cache/conftool/dbconfig/20240729-090129-marostegui.json [production]
08:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243', diff saved to https://phabricator.wikimedia.org/P66992 and previous config saved to /var/cache/conftool/dbconfig/20240729-084622-marostegui.json [production]
08:31 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243 (T367856)', diff saved to https://phabricator.wikimedia.org/P66991 and previous config saved to /var/cache/conftool/dbconfig/20240729-083115-marostegui.json [production]
08:20 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/27 [admin]
08:20 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/27 [admin]
07:54 <dcausse> closing the backport window [production]
07:53 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 24482 [production]
07:51 <dcausse@deploy1002> Finished scap: Backport for [[gerrit:1055890|GeoData: add pool counter settings (T370621)]] (duration: 11m 36s) [production]
07:47 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts karapace1001.eqiad.wmnet [production]
07:47 <brouberol@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
07:47 <brouberol@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: karapace1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin1002" [production]
07:46 <brouberol@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: karapace1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin1002" [production]
07:46 <dcausse@deploy1002> dcausse: Continuing with sync [production]
07:42 <dcausse@deploy1002> dcausse: Backport for [[gerrit:1055890|GeoData: add pool counter settings (T370621)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:41 <brouberol@cumin1002> START - Cookbook sre.dns.netbox [production]