1451-1500 of 10000 results (32ms)
2025-02-07 §
06:28 <root@cumin1002> START - Cookbook sre.mysql.upgrade for db2150.codfw.wmnet [production]
06:27 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1174 db2150', diff saved to https://phabricator.wikimedia.org/P73358 and previous config saved to /var/cache/conftool/dbconfig/20250207-062745-marostegui.json [production]
03:42 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2199.codfw.wmnet with reason: Maintenance [production]
03:41 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2172 (T384592)', diff saved to https://phabricator.wikimedia.org/P73357 and previous config saved to /var/cache/conftool/dbconfig/20250207-034149-marostegui.json [production]
03:26 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P73356 and previous config saved to /var/cache/conftool/dbconfig/20250207-032642-marostegui.json [production]
03:14 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1053.eqiad.wmnet with OS bookworm [production]
03:11 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P73355 and previous config saved to /var/cache/conftool/dbconfig/20250207-031134-marostegui.json [production]
02:56 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2172 (T384592)', diff saved to https://phabricator.wikimedia.org/P73354 and previous config saved to /var/cache/conftool/dbconfig/20250207-025628-marostegui.json [production]
02:00 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host ganeti1054.eqiad.wmnet with OS bookworm [production]
02:00 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.unset_maintenance (exit_code=0) [admin]
02:00 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.unset_maintenance [admin]
01:57 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1041.eqiad.wmnet [production]
01:54 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host ganeti1053.eqiad.wmnet with OS bookworm [production]
01:49 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1053.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
01:49 <andrew@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudvirt1041.eqiad.wmnet [production]
01:48 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=97) on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' [admin]
01:48 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host ganeti1053.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
01:47 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' [admin]
01:42 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ganeti1053.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
01:41 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host ganeti1053.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
01:40 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=99) on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' [admin]
01:39 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' [admin]
01:39 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1036.eqiad.wmnet}' [admin]
01:34 <bd808> Cherry-picked https://gerrit.wikimedia.org/r/c/operations/puppet/+/1117997 to deployment-puppetmaster (T385849) [releng]
01:32 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-7 [tools]
01:31 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1036.eqiad.wmnet}' [admin]
01:31 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1037.eqiad.wmnet}' [admin]
01:28 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-7 [tools]
01:28 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=99) for tools-k8s-worker-nfs-07 [tools]
01:28 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-07 [tools]
01:27 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=99) for tools-k8s-worker-nfs-07 [tools]
01:27 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-07 [tools]
01:12 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1037.eqiad.wmnet}' [admin]
01:12 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1038.eqiad.wmnet}' [admin]
00:53 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1038.eqiad.wmnet}' [admin]
00:53 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1039.eqiad.wmnet}' [admin]
00:42 <bd808> Shutoff deployment-parsoid14 to see if anything breaks/anyone yells (T385849) [releng]
00:33 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1039.eqiad.wmnet}' [admin]
00:26 <wmbot~lucaswerkmeister@tools-bastion-13> (new code / l10n version is not actually live yet due to T385847) [tools.ranker]
00:19 <wmbot~lucaswerkmeister@tools-bastion-13> deployed 1abc7122fa (l10n updates: lb, skr-arab) [tools.ranker]
2025-02-06 §
23:53 <bd808> Updated citoid-beta.wmflabs.org to point to deployment-docker-citoid02 [releng]
23:50 <bd808> Deleted beta-prometheus.wmflabs.org; it was pointed to an IP now owned by the mdwikioffline project. [releng]
23:48 <cstone> payments-wiki upgraded from d266fdf9 to 793998c0 [production]
23:43 <bd808> Deleted recently orphaned spiderpig.wmcloud.org proxy after discussion with dancy [releng]
23:07 <swfrench-wmf> ran cumin 'A:cp-text' 'run-puppet-agent -e "merging ATS Lua config change - T383845"' at 21:58:47 (retroactive) [production]
21:48 <swfrench-wmf> ran cumin 'A:cp-text' 'disable-puppet "merging ATS Lua config change - T383845"' [production]
21:46 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' [admin]
21:27 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2172 (T384592)', diff saved to https://phabricator.wikimedia.org/P73352 and previous config saved to /var/cache/conftool/dbconfig/20250206-212719-marostegui.json [production]
21:27 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2172.codfw.wmnet with reason: Maintenance [production]
21:26 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2155 (T384592)', diff saved to https://phabricator.wikimedia.org/P73351 and previous config saved to /var/cache/conftool/dbconfig/20250206-212656-marostegui.json [production]