3401-3450 of 10000 results (55ms)
2022-04-04 ยง
20:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1099:3311 (T298565)', diff saved to https://phabricator.wikimedia.org/P24066 and previous config saved to /var/cache/conftool/dbconfig/20220404-205932-ladsgroup.json [production]
20:59 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1099.eqiad.wmnet with reason: Maintenance [production]
20:59 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1099.eqiad.wmnet with reason: Maintenance [production]
20:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1169 (T298565)', diff saved to https://phabricator.wikimedia.org/P24065 and previous config saved to /var/cache/conftool/dbconfig/20220404-205924-ladsgroup.json [production]
20:44 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1169', diff saved to https://phabricator.wikimedia.org/P24064 and previous config saved to /var/cache/conftool/dbconfig/20220404-204419-ladsgroup.json [production]
20:40 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp1081.eqiad.wmnet [production]
20:40 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp5010.eqsin.wmnet [production]
20:37 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp3061.esams.wmnet [production]
20:32 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp1081.eqiad.wmnet [production]
20:31 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp5010.eqsin.wmnet [production]
20:30 <urbanecm> UTC late B&C window completed [production]
20:29 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp3061.esams.wmnet [production]
20:29 <urbanecm@deploy1002> Synchronized wmf-config/InitialiseSettings.php: 8c81de9c732adef4537226ec6a7023fef40f3396: Remove wgWMEIPAddressCopyActionEnabled from Beta and production config (T296469) (duration: 00m 51s) [production]
20:29 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1169', diff saved to https://phabricator.wikimedia.org/P24063 and previous config saved to /var/cache/conftool/dbconfig/20220404-202914-ladsgroup.json [production]
20:26 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp5006.eqsin.wmnet [production]
20:21 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp1080.eqiad.wmnet [production]
20:16 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:16 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp5006.eqsin.wmnet [production]
20:15 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:15 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:15 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp4027.ulsfo.wmnet [production]
20:14 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1169 (T298565)', diff saved to https://phabricator.wikimedia.org/P24062 and previous config saved to /var/cache/conftool/dbconfig/20220404-201409-ladsgroup.json [production]
20:11 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp1080.eqiad.wmnet [production]
20:10 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp3060.esams.wmnet [production]
20:05 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp4027.ulsfo.wmnet [production]
20:00 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp3060.esams.wmnet [production]
20:00 <sukhe@cumin2002> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host cp3060.esams.wmnet [production]
20:00 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp3060.esams.wmnet [production]
19:56 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp5005.eqsin.wmnet [production]
19:51 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp5005.eqsin.wmnet [production]
19:50 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp2040.codfw.wmnet [production]
19:43 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lists1001.wikimedia.org [production]
19:42 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp2040.codfw.wmnet [production]
19:39 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp1079.eqiad.wmnet [production]
19:38 <herron@cumin1001> START - Cookbook sre.hosts.reboot-single for host lists1001.wikimedia.org [production]
19:37 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon1002.eqiad.wmnet [production]
19:35 <herron@cumin1001> START - Cookbook sre.hosts.reboot-single for host kafkamon1002.eqiad.wmnet [production]
19:35 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon2002.codfw.wmnet [production]
19:33 <herron@cumin1001> START - Cookbook sre.hosts.reboot-single for host kafkamon2002.codfw.wmnet [production]
19:29 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp1079.eqiad.wmnet [production]
19:22 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host centrallog1001.eqiad.wmnet [production]
19:17 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1169 (T298565)', diff saved to https://phabricator.wikimedia.org/P24061 and previous config saved to /var/cache/conftool/dbconfig/20220404-191750-ladsgroup.json [production]
19:17 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
19:17 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
19:17 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1164 (T298565)', diff saved to https://phabricator.wikimedia.org/P24060 and previous config saved to /var/cache/conftool/dbconfig/20220404-191743-ladsgroup.json [production]
19:16 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5005.eqsin.wmnet,service=ats-tls [production]
19:16 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5005.eqsin.wmnet,service=ats-be [production]
19:16 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5005.eqsin.wmnet,service=varnish-fe [production]
19:16 <herron@cumin1001> START - Cookbook sre.hosts.reboot-single for host centrallog1001.eqiad.wmnet [production]