2351-2400 of 10000 results (56ms)
2022-05-18 §
08:19 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti4003.ulsfo.wmnet with reason: host reimage [production]
08:18 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1113:3316', diff saved to https://phabricator.wikimedia.org/P27904 and previous config saved to /var/cache/conftool/dbconfig/20220518-081852-ladsgroup.json [production]
08:17 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2056.codfw.wmnet with reason: host reimage [production]
08:16 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti4003.ulsfo.wmnet with reason: host reimage [production]
08:12 <jnuche@deploy1002> deploy-promote aborted: (duration: 03m 02s) [production]
08:11 <hashar> Jenkins CI is down, can't connect to the agents [production]
08:11 <moritzm> upgrading ganeti packages in eqsin to Ganeti 3.0 T308211 [production]
08:03 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1113:3316 (T303603)', diff saved to https://phabricator.wikimedia.org/P27903 and previous config saved to /var/cache/conftool/dbconfig/20220518-080347-ladsgroup.json [production]
08:03 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1130 (T298555)', diff saved to https://phabricator.wikimedia.org/P27902 and previous config saved to /var/cache/conftool/dbconfig/20220518-080339-ladsgroup.json [production]
08:03 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db1130.eqiad.wmnet with reason: Maintenance [production]
08:03 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 10:00:00 on db1130.eqiad.wmnet with reason: Maintenance [production]
08:02 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2056.codfw.wmnet with OS bullseye [production]
07:59 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti4003.ulsfo.wmnet with OS bullseye [production]
07:58 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1113:3316 (T303603)', diff saved to https://phabricator.wikimedia.org/P27900 and previous config saved to /var/cache/conftool/dbconfig/20220518-075826-ladsgroup.json [production]
07:58 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1113.eqiad.wmnet with reason: Maintenance [production]
07:58 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1113.eqiad.wmnet with reason: Maintenance [production]
07:56 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1163 (T298560)', diff saved to https://phabricator.wikimedia.org/P27898 and previous config saved to /var/cache/conftool/dbconfig/20220518-075620-ladsgroup.json [production]
07:56 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on db1163.eqiad.wmnet with reason: Maintenance [production]
07:56 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 16:00:00 on db1163.eqiad.wmnet with reason: Maintenance [production]
07:54 <hashar> Restarting CI Jenkins [production]
07:41 <moritzm> imported jenkins 2.332.3 to thirdparty/ci for buster-wikimedia [production]
07:36 <dcausse> closing UTC morning backport window [production]
07:34 <dcausse@deploy1002> Synchronized php-1.39.0-wmf.12/extensions/WikibaseCirrusSearch/src/Query/HasLicenseFeature.php: Backport: [[gerrit:792650|haslicense: Apply minimum_should_match for elastic 7.x (T288765)]] (duration: 00m 52s) [production]
07:32 <dcausse@deploy1002> Synchronized php-1.39.0-wmf.12/extensions/CirrusSearch/includes/Query/FullTextSimpleMatchQueryBuilder.php: Backport: [[gerrit:792649|Resolve minimum_should_match warnings during random scoring (T288765)]] (duration: 00m 56s) [production]
07:30 <hashar> Restarting CI Jenkins [production]
07:29 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
07:28 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
07:28 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
07:27 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
07:23 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host cumin1001.eqiad.wmnet [production]
07:17 <marostegui> Cold reset wtp1045.mgmt ipmi [production]
07:14 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host cumin1001.eqiad.wmnet [production]
01:05 <ejegg> updated fundraising CiviCRM from d45afdfc to b8b8c177 [production]
2022-05-17 §
23:36 <ejegg> updated payments-wiki from 590fac28 to d9d63a3d [production]
22:31 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
22:29 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1122 (T300774)', diff saved to https://phabricator.wikimedia.org/P27896 and previous config saved to /var/cache/conftool/dbconfig/20220517-222904-ladsgroup.json [production]
22:27 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
22:27 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
22:23 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
22:18 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
22:17 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
22:17 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
22:16 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
22:16 <urbanecm@deploy1002> Synchronized wmf-config/interwiki.php: c2151b3: Update interwiki cache (duration: 00m 52s) [production]
22:15 <urbanecm@deploy1002> Synchronized langlist: cd704d4f: langlist: add kcg language (T305279) (duration: 00m 53s) [production]
22:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1122', diff saved to https://phabricator.wikimedia.org/P27895 and previous config saved to /var/cache/conftool/dbconfig/20220517-221359-ladsgroup.json [production]
21:58 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1122', diff saved to https://phabricator.wikimedia.org/P27894 and previous config saved to /var/cache/conftool/dbconfig/20220517-215854-ladsgroup.json [production]
21:52 <mutante> alert1001 - systemctl start certspotter (after alert that the unit was failed. happens sometimes) [production]
21:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1122 (T300774)', diff saved to https://phabricator.wikimedia.org/P27893 and previous config saved to /var/cache/conftool/dbconfig/20220517-214349-ladsgroup.json [production]
21:25 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1122 (T300774)', diff saved to https://phabricator.wikimedia.org/P27892 and previous config saved to /var/cache/conftool/dbconfig/20220517-212530-ladsgroup.json [production]