2901-2950 of 10000 results (76ms)
2022-05-18 ยง
15:07 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 10:00:00 on db1110.eqiad.wmnet with reason: Maintenance [production]
15:07 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1144:3315 (T298555)', diff saved to https://phabricator.wikimedia.org/P27956 and previous config saved to /var/cache/conftool/dbconfig/20220518-150714-ladsgroup.json [production]
15:04 <btullis@deploy1002> helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main [production]
15:04 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1006.eqiad.wmnet [production]
15:04 <vgutierrez> rolling upgrade to HAProxy 2.4.17 in eqiad - T307444 [production]
15:03 <btullis@deploy1002> helmfile [eqiad] START helmfile.d/services/datahub: apply on main [production]
14:56 <btullis@deploy1002> helmfile [codfw] DONE helmfile.d/services/datahub: sync on main [production]
14:56 <btullis@deploy1002> helmfile [codfw] START helmfile.d/services/datahub: apply on main [production]
14:56 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1168 (T303603)', diff saved to https://phabricator.wikimedia.org/P27955 and previous config saved to /var/cache/conftool/dbconfig/20220518-145603-ladsgroup.json [production]
14:55 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
14:54 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
14:52 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1144:3315', diff saved to https://phabricator.wikimedia.org/P27954 and previous config saved to /var/cache/conftool/dbconfig/20220518-145208-ladsgroup.json [production]
14:45 <jnuche@deploy1002> rebuilt and synchronized wikiversions files: Set commonswiki to 1.39.0-wmf.12 [production]
14:40 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P27952 and previous config saved to /var/cache/conftool/dbconfig/20220518-144058-ladsgroup.json [production]
14:39 <jnuche@deploy1002> scap failed: average error rate on 6/8 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org for details) [production]
14:37 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1144:3315', diff saved to https://phabricator.wikimedia.org/P27951 and previous config saved to /var/cache/conftool/dbconfig/20220518-143703-ladsgroup.json [production]
14:25 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P27949 and previous config saved to /var/cache/conftool/dbconfig/20220518-142553-ladsgroup.json [production]
14:21 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1144:3315 (T298555)', diff saved to https://phabricator.wikimedia.org/P27948 and previous config saved to /var/cache/conftool/dbconfig/20220518-142158-ladsgroup.json [production]
14:15 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
14:10 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1168 (T303603)', diff saved to https://phabricator.wikimedia.org/P27947 and previous config saved to /var/cache/conftool/dbconfig/20220518-141048-ladsgroup.json [production]
14:10 <vgutierrez> rolling upgrade to HAProxy 2.4.17 in esams - T307444 [production]
14:09 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
14:09 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
14:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1168 (T303603)', diff saved to https://phabricator.wikimedia.org/P27946 and previous config saved to /var/cache/conftool/dbconfig/20220518-140812-ladsgroup.json [production]
14:08 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1168.eqiad.wmnet with reason: Maintenance [production]
14:08 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1168.eqiad.wmnet with reason: Maintenance [production]
14:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1180 (T303603)', diff saved to https://phabricator.wikimedia.org/P27945 and previous config saved to /var/cache/conftool/dbconfig/20220518-140804-ladsgroup.json [production]
14:02 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:57 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
13:52 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P27944 and previous config saved to /var/cache/conftool/dbconfig/20220518-135259-ladsgroup.json [production]
13:51 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
13:51 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:44 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:44 <jforrester@deploy1002> Synchronized multiversion/MWMultiVersion.php: Config: [[gerrit:740304|Make use of the ?? operator in more trivial situations]] (duration: 00m 53s) [production]
13:43 <jforrester@deploy1002> Synchronized wmf-config/Wikibase.php: Config: [[gerrit:740304|Make use of the ?? operator in more trivial situations]] (duration: 00m 52s) [production]
13:42 <jforrester@deploy1002> Synchronized w/health-check.php: Config: [[gerrit:740304|Make use of the ?? operator in more trivial situations]] (duration: 00m 52s) [production]
13:40 <jforrester@deploy1002> Synchronized rpc/RunJobs.php: Config: [[gerrit:740304|Make use of the ?? operator in more trivial situations]] (duration: 00m 51s) [production]
13:40 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2060.codfw.wmnet with OS bullseye [production]
13:39 <jforrester@deploy1002> Synchronized docroot/noc/conf/highlight.php: Config: [[gerrit:740304|Make use of the ?? operator in more trivial situations]] (duration: 00m 51s) [production]
13:39 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
13:39 <volans@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ns-recursor1.openstack.codfw1dev.wikimediacloud.org on all recursors [production]
13:39 <volans@cumin1001> START - Cookbook sre.dns.wipe-cache ns-recursor1.openstack.codfw1dev.wikimediacloud.org on all recursors [production]
13:39 <volans@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ns-recursor0.openstack.codfw1dev.wikimediacloud.org on all recursors [production]
13:39 <volans@cumin1001> START - Cookbook sre.dns.wipe-cache ns-recursor0.openstack.codfw1dev.wikimediacloud.org on all recursors [production]
13:38 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
13:38 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:38 <jforrester@deploy1002> Synchronized docroot/wwwportal/w/search-redirect.php: Config: [[gerrit:740304|Make use of the ?? operator in more trivial situations]] (duration: 00m 51s) [production]
13:37 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P27943 and previous config saved to /var/cache/conftool/dbconfig/20220518-133753-ladsgroup.json [production]
13:37 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:36 <volans@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]