151-200 of 10000 results (23ms)
2025-09-10 ยง
10:50 <fceratto@cumin1002> START - Cookbook sre.mysql.upgrade for db1173.eqiad.wmnet [production]
10:49 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host maps1013.eqiad.wmnet [production]
10:48 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P83103 and previous config saved to /var/cache/conftool/dbconfig/20250910-104813-ladsgroup.json [production]
10:42 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Set db1181 with weight 0 T404180', diff saved to https://phabricator.wikimedia.org/P83101 and previous config saved to /var/cache/conftool/dbconfig/20250910-104223-ladsgroup.json [production]
10:40 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 27 hosts with reason: Primary switchover s7 T404180 [production]
10:34 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repool db1181 T399955', diff saved to https://phabricator.wikimedia.org/P83100 and previous config saved to /var/cache/conftool/dbconfig/20250910-103436-ladsgroup.json [production]
10:33 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps1012.eqiad.wmnet [production]
10:33 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P83099 and previous config saved to /var/cache/conftool/dbconfig/20250910-103305-ladsgroup.json [production]
10:27 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host maps1012.eqiad.wmnet [production]
10:25 <fceratto@cumin1002> dbctl commit (dc=all): 'Promote db2203 to s1 primary T404178', diff saved to https://phabricator.wikimedia.org/P83098 and previous config saved to /var/cache/conftool/dbconfig/20250910-102507-fceratto.json [production]
10:24 <federico3> Starting s1 codfw failover from db2212 to db2203 - T404178 [production]
10:17 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1248 (T402763)', diff saved to https://phabricator.wikimedia.org/P83097 and previous config saved to /var/cache/conftool/dbconfig/20250910-101758-ladsgroup.json [production]
10:17 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps1011.eqiad.wmnet [production]
10:14 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1181.eqiad.wmnet with reason: Glow up [production]
10:13 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Glow up db1181 (T399955)', diff saved to https://phabricator.wikimedia.org/P83096 and previous config saved to /var/cache/conftool/dbconfig/20250910-101345-ladsgroup.json [production]
10:12 <moritzm> imported imposm3 0.14.1-2 T381565 [production]
10:10 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 32 hosts with reason: Primary switchover s1 T404178 [production]
10:10 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host maps1011.eqiad.wmnet [production]
10:09 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps2014.codfw.wmnet [production]
10:06 <moritzm> upgrading Envoy on Phabricator T402584 [production]
10:06 <moritzm> upgrading Envoy on lists T402584 [production]
10:03 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host maps2014.codfw.wmnet [production]
10:03 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps2013.codfw.wmnet [production]
09:57 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host maps2013.codfw.wmnet [production]
09:57 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db1248 (T402763)', diff saved to https://phabricator.wikimedia.org/P83095 and previous config saved to /var/cache/conftool/dbconfig/20250910-095700-ladsgroup.json [production]
09:56 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1248.eqiad.wmnet with reason: Maintenance [production]
09:56 <moritzm> upgrading Envoy on lists T402584 [production]
09:55 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps2012.codfw.wmnet [production]
09:49 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host maps2012.codfw.wmnet [production]
09:48 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps2011.codfw.wmnet [production]
09:47 <moritzm> upgrading Envoy on contint T402584 [production]
09:45 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
09:44 <jmm@cumin2002> END (PASS) - Cookbook sre.o11y.roll-restart-reboot-logstash-collectors (exit_code=0) rolling restart_daemons on A:logstash-collector [production]
09:42 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host maps2011.codfw.wmnet [production]
09:38 <moritzm> upgrading Envoy on Logstash T402584 [production]
09:37 <jmm@cumin2002> START - Cookbook sre.o11y.roll-restart-reboot-logstash-collectors rolling restart_daemons on A:logstash-collector [production]
09:28 <claime> cgoubert@deploy1003:/home$ sudo lvextend -L +20G /dev/vg0/root && sudo resize2fs /dev/vg0/root - T404060 [production]
08:55 <volans> force-reboot codesearch9 VM because it's unresponsive to ssh and console and the services are down T404163 [codesearch]
08:53 <volans@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=0) [codesearch]
08:52 <volans@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=0) [codesearch]
08:52 <volans@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.vm_console [codesearch]
08:51 <volans@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.vm_console [codesearch]
08:40 <volans> add volans to toolsbeta.admin tool [toolsbeta]
08:33 <dhinus> add volans to cloud-vps domain admins: openstack role add --user volans --domain default --inherited admin [admin]
08:19 <elukey@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host ml-serve1011.eqiad.wmnet [production]
08:19 <elukey@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host ml-serve1011.eqiad.wmnet [production]
08:19 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ml-serve1011.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART [production]
08:16 <fnegri@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.add_user_to_project (exit_code=0) for user 'volans' in role 'member' [paws]
08:16 <fnegri@cloudcumin1001> START - Cookbook wmcs.vps.add_user_to_project for user 'volans' in role 'member' [paws]
08:16 <jayme@cumin1002> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "[not really into teleological thinking] - jayme@cumin1002" [production]