4101-4150 of 10000 results (88ms)
2023-08-28 ยง
10:23 <elukey@deploy1002> helmfile [eqiad] START helmfile.d/services/changeprop: sync [production]
10:22 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2015.codfw.wmnet [production]
10:21 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply [production]
10:20 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-api-ext: apply [production]
10:19 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1149', diff saved to https://phabricator.wikimedia.org/P51563 and previous config saved to /var/cache/conftool/dbconfig/20230828-101949-ladsgroup.json [production]
10:17 <fabfur> enable puppet and start pybal on lvs4008 for reboot (T344587) [production]
10:16 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs4008.ulsfo.wmnet [production]
10:16 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2015.codfw.wmnet [production]
10:15 <cgoubert@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply [production]
10:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1134 (T343718)', diff saved to https://phabricator.wikimedia.org/P51562 and previous config saved to /var/cache/conftool/dbconfig/20230828-101426-ladsgroup.json [production]
10:14 <cgoubert@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-debug: apply [production]
10:14 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1134.eqiad.wmnet with reason: Maintenance [production]
10:14 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1134.eqiad.wmnet with reason: Maintenance [production]
10:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1132 (T343718)', diff saved to https://phabricator.wikimedia.org/P51561 and previous config saved to /var/cache/conftool/dbconfig/20230828-101405-ladsgroup.json [production]
10:13 <fabfur@cumin1001> START - Cookbook sre.hosts.reboot-single for host lvs4008.ulsfo.wmnet [production]
10:12 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2141.codfw.wmnet with reason: Maintenance [production]
10:12 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2141.codfw.wmnet with reason: Maintenance [production]
10:12 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2130 (T343718)', diff saved to https://phabricator.wikimedia.org/P51560 and previous config saved to /var/cache/conftool/dbconfig/20230828-101238-ladsgroup.json [production]
10:11 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-debug: apply [production]
10:11 <elukey@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop: sync [production]
10:11 <elukey@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop: sync [production]
10:10 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]
10:10 <claime> Deploying 952812 for T344814 to mw-debug and mw-api-ext [production]
10:10 <elukey@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop: sync [production]
10:09 <elukey@deploy1002> helmfile [staging] START helmfile.d/services/changeprop: sync [production]
10:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2155 (T344589)', diff saved to https://phabricator.wikimedia.org/P51559 and previous config saved to /var/cache/conftool/dbconfig/20230828-100823-ladsgroup.json [production]
10:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P51558 and previous config saved to /var/cache/conftool/dbconfig/20230828-100814-ladsgroup.json [production]
10:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance es2027', diff saved to https://phabricator.wikimedia.org/P51557 and previous config saved to /var/cache/conftool/dbconfig/20230828-100814-ladsgroup.json [production]
10:04 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1149 (T344589)', diff saved to https://phabricator.wikimedia.org/P51556 and previous config saved to /var/cache/conftool/dbconfig/20230828-100443-ladsgroup.json [production]
10:02 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2013.codfw.wmnet [production]
10:02 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2013.codfw.wmnet [production]
10:01 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2155 (T344589)', diff saved to https://phabricator.wikimedia.org/P51555 and previous config saved to /var/cache/conftool/dbconfig/20230828-100045-ladsgroup.json [production]
10:01 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
10:00 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
10:00 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2155.codfw.wmnet with reason: Maintenance [production]
10:00 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2155.codfw.wmnet with reason: Maintenance [production]
10:00 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2147 (T344589)', diff saved to https://phabricator.wikimedia.org/P51554 and previous config saved to /var/cache/conftool/dbconfig/20230828-100005-ladsgroup.json [production]
09:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1132', diff saved to https://phabricator.wikimedia.org/P51553 and previous config saved to /var/cache/conftool/dbconfig/20230828-095859-ladsgroup.json [production]
09:57 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2130', diff saved to https://phabricator.wikimedia.org/P51552 and previous config saved to /var/cache/conftool/dbconfig/20230828-095732-ladsgroup.json [production]
09:57 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1149 (T344589)', diff saved to https://phabricator.wikimedia.org/P51551 and previous config saved to /var/cache/conftool/dbconfig/20230828-095722-ladsgroup.json [production]
09:57 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1149.eqiad.wmnet with reason: Maintenance [production]
09:57 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1149.eqiad.wmnet with reason: Maintenance [production]
09:56 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1148 (T344589)', diff saved to https://phabricator.wikimedia.org/P51550 and previous config saved to /var/cache/conftool/dbconfig/20230828-095658-ladsgroup.json [production]
09:56 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2013.codfw.wmnet [production]
09:54 <fabfur> disable puppet and stop pybal on lvs4008 for reboot (T344587) [production]
09:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1214 (T344589)', diff saved to https://phabricator.wikimedia.org/P51549 and previous config saved to /var/cache/conftool/dbconfig/20230828-095308-ladsgroup.json [production]
09:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance es2027 (T344589)', diff saved to https://phabricator.wikimedia.org/P51548 and previous config saved to /var/cache/conftool/dbconfig/20230828-095308-ladsgroup.json [production]
09:48 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling es2027 (T344589)', diff saved to https://phabricator.wikimedia.org/P51547 and previous config saved to /var/cache/conftool/dbconfig/20230828-094813-ladsgroup.json [production]
09:48 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2027.codfw.wmnet with reason: Maintenance [production]
09:47 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es2027.codfw.wmnet with reason: Maintenance [production]