1051-1100 of 10000 results (71ms)
2022-11-01 ยง
14:10 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on ganeti1028.eqiad.wmnet with reason: host reimage [production]
14:10 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1028.eqiad.wmnet with reason: host reimage [production]
14:04 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P37375 and previous config saved to /var/cache/conftool/dbconfig/20221101-140402-ladsgroup.json [production]
14:04 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-debug: apply [production]
14:02 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]
14:02 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply [production]
14:02 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-debug: apply [production]
14:02 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:851626|[GrowthExperiments] Remove wmgGEFeaturesMayBeAvailableToNewcomers]] (duration: 04m 32s) [production]
14:00 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1131', diff saved to https://phabricator.wikimedia.org/P37374 and previous config saved to /var/cache/conftool/dbconfig/20221101-135827-ladsgroup.json [production]
13:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P37373 and previous config saved to /var/cache/conftool/dbconfig/20221101-135811-ladsgroup.json [production]
13:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3312', diff saved to https://phabricator.wikimedia.org/P37372 and previous config saved to /var/cache/conftool/dbconfig/20221101-135800-ladsgroup.json [production]
13:59 <moritzm> draining ganeti1016 for eventual reimage T311687 [production]
13:59 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti1028.eqiad.wmnet with OS bullseye [production]
13:59 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
13:59 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp4038.ulsfo.wmnet [production]
13:59 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
13:57 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp4037.ulsfo.wmnet [production]
13:56 <moritzm> installing exim4 security updates on buster [production]
13:55 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-debug: apply [production]
13:54 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:851626|[GrowthExperiments] Remove wmgGEFeaturesMayBeAvailableToNewcomers]] [production]
13:53 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:851631|Copy reverse-proxy-staging.php to reverse-proxy-labs.php]], [[gerrit:851630|"reverse-proxy-staging.php" -> "reverse-staging-labs.php"]], [[gerrit:851633|Delete "reverse-proxy-staging.php"]] (duration: 04m 30s) [production]
13:53 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]
13:53 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply [production]
13:53 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-debug: apply [production]
13:52 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2158 (T318955)', diff saved to https://phabricator.wikimedia.org/P37371 and previous config saved to /var/cache/conftool/dbconfig/20221101-135120-ladsgroup.json [production]
13:51 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on db2095.codfw.wmnet with reason: Maintenance [production]
13:51 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 16:00:00 on db2095.codfw.wmnet with reason: Maintenance [production]
13:51 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2158.codfw.wmnet with reason: Maintenance [production]
13:50 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db2158.codfw.wmnet with reason: Maintenance [production]
13:49 <urbanecm@deploy1002> urbanecm and zabe: Backport for [[gerrit:851631|Copy reverse-proxy-staging.php to reverse-proxy-labs.php]], [[gerrit:851630|"reverse-proxy-staging.php" -> "reverse-staging-labs.php"]], [[gerrit:851633|Delete "reverse-proxy-staging.php"]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
13:49 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:851631|Copy reverse-proxy-staging.php to reverse-proxy-labs.php]], [[gerrit:851630|"reverse-proxy-staging.php" -> "reverse-staging-labs.php"]], [[gerrit:851633|Delete "reverse-proxy-staging.php"]] [production]
13:48 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P37370 and previous config saved to /var/cache/conftool/dbconfig/20221101-134854-ladsgroup.json [production]
13:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1131 (T318955)', diff saved to https://phabricator.wikimedia.org/P37369 and previous config saved to /var/cache/conftool/dbconfig/20221101-134318-ladsgroup.json [production]
13:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2154 (T318605)', diff saved to https://phabricator.wikimedia.org/P37368 and previous config saved to /var/cache/conftool/dbconfig/20221101-134302-ladsgroup.json [production]
13:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3312', diff saved to https://phabricator.wikimedia.org/P37367 and previous config saved to /var/cache/conftool/dbconfig/20221101-134252-ladsgroup.json [production]
13:42 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1131 (T318955)', diff saved to https://phabricator.wikimedia.org/P37366 and previous config saved to /var/cache/conftool/dbconfig/20221101-134108-ladsgroup.json [production]
13:42 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1131.eqiad.wmnet with reason: Maintenance [production]
13:42 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp4037.ulsfo.wmnet [production]
13:42 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db1131.eqiad.wmnet with reason: Maintenance [production]
13:40 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1113:3316 (T318955)', diff saved to https://phabricator.wikimedia.org/P37365 and previous config saved to /var/cache/conftool/dbconfig/20221101-134045-ladsgroup.json [production]
13:39 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2141.codfw.wmnet with reason: Maintenance [production]
13:39 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db2141.codfw.wmnet with reason: Maintenance [production]
13:38 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2129 (T318955)', diff saved to https://phabricator.wikimedia.org/P37364 and previous config saved to /var/cache/conftool/dbconfig/20221101-133857-ladsgroup.json [production]
13:37 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1156 (T318950)', diff saved to https://phabricator.wikimedia.org/P37363 and previous config saved to /var/cache/conftool/dbconfig/20221101-133346-ladsgroup.json [production]
13:37 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1156 (T318950)', diff saved to https://phabricator.wikimedia.org/P37362 and previous config saved to /var/cache/conftool/dbconfig/20221101-133132-ladsgroup.json [production]
13:35 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
13:33 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
13:33 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance [production]
13:31 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance [production]
13:31 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T318950)', diff saved to https://phabricator.wikimedia.org/P37361 and previous config saved to /var/cache/conftool/dbconfig/20221101-133113-ladsgroup.json [production]