951-1000 of 10000 results (95ms)
2023-08-09 §
08:32 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2098.codfw.wmnet with reason: Maintenance [production]
08:32 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
08:32 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2098.codfw.wmnet with reason: Maintenance [production]
07:58 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1003.eqiad.wmnet [production]
07:52 <mvernon@cumin1001> START - Cookbook sre.hosts.reboot-single for host thanos-be1003.eqiad.wmnet [production]
07:12 <kartik@deploy1002> Finished scap: Backport for [[gerrit:946852|testwiki: Enable Section Translation for 7 Wikipedias (T343211)]] (duration: 09m 58s) [production]
07:05 <kartik@deploy1002> kartik: Continuing with sync [production]
07:03 <kartik@deploy1002> kartik: Backport for [[gerrit:946852|testwiki: Enable Section Translation for 7 Wikipedias (T343211)]] synced to the testservers mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
07:02 <kartik@deploy1002> Started scap: Backport for [[gerrit:946852|testwiki: Enable Section Translation for 7 Wikipedias (T343211)]] [production]
06:52 <root@cumin2002> END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jkieserman out of all services on: 33 hosts [production]
06:51 <root@cumin2002> START - Cookbook sre.idm.logout Logging Jkieserman out of all services on: 33 hosts [production]
06:51 <root@cumin2002> END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jkieserman out of all services on: 716 hosts [production]
06:51 <root@cumin2002> START - Cookbook sre.idm.logout Logging Jkieserman out of all services on: 716 hosts [production]
06:47 <root@cumin2002> END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jkieserman out of all services on: 1309 hosts [production]
06:46 <root@cumin2002> START - Cookbook sre.idm.logout Logging Jkieserman out of all services on: 1309 hosts [production]
06:46 <root@cumin2002> END (FAIL) - Cookbook sre.idm.logout (exit_code=99) Logging Jmads out of all services on: 1309 hosts [production]
06:46 <root@cumin2002> START - Cookbook sre.idm.logout Logging Jmads out of all services on: 1309 hosts [production]
06:18 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1146:3314 (T342617)', diff saved to https://phabricator.wikimedia.org/P50222 and previous config saved to /var/cache/conftool/dbconfig/20230809-061826-ladsgroup.json [production]
06:18 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: Maintenance [production]
06:18 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: Maintenance [production]
01:31 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2137:3314 (T342617)', diff saved to https://phabricator.wikimedia.org/P50219 and previous config saved to /var/cache/conftool/dbconfig/20230809-013145-ladsgroup.json [production]
01:31 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2137.codfw.wmnet with reason: Maintenance [production]
01:31 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2137.codfw.wmnet with reason: Maintenance [production]
01:31 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2136 (T342617)', diff saved to https://phabricator.wikimedia.org/P50218 and previous config saved to /var/cache/conftool/dbconfig/20230809-013124-ladsgroup.json [production]
01:16 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P50217 and previous config saved to /var/cache/conftool/dbconfig/20230809-011618-ladsgroup.json [production]
01:01 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P50216 and previous config saved to /var/cache/conftool/dbconfig/20230809-010112-ladsgroup.json [production]
00:46 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2136 (T342617)', diff saved to https://phabricator.wikimedia.org/P50215 and previous config saved to /var/cache/conftool/dbconfig/20230809-004605-ladsgroup.json [production]
00:38 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1145.eqiad.wmnet with reason: Maintenance [production]
00:38 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1145.eqiad.wmnet with reason: Maintenance [production]
00:38 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1144:3314 (T342617)', diff saved to https://phabricator.wikimedia.org/P50214 and previous config saved to /var/cache/conftool/dbconfig/20230809-003817-ladsgroup.json [production]
00:23 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1144:3314', diff saved to https://phabricator.wikimedia.org/P50213 and previous config saved to /var/cache/conftool/dbconfig/20230809-002310-ladsgroup.json [production]
00:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1144:3314', diff saved to https://phabricator.wikimedia.org/P50212 and previous config saved to /var/cache/conftool/dbconfig/20230809-000804-ladsgroup.json [production]
2023-08-08 §
23:52 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1144:3314 (T342617)', diff saved to https://phabricator.wikimedia.org/P50211 and previous config saved to /var/cache/conftool/dbconfig/20230808-235258-ladsgroup.json [production]
22:33 <urbanecm> mwmaint1002: stop persistRevisionThreadItems.php frwiki instance because of T343859 (cc T315510) [production]
22:04 <bking@deploy1002> Finished deploy [wdqs/wdqs@f1a6177] (wcqs): f1a6177 (duration: 00m 17s) [production]
22:03 <bking@deploy1002> Started deploy [wdqs/wdqs@f1a6177] (wcqs): f1a6177 [production]
21:57 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) [production]
21:46 <bking@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
21:46 <bking@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wcqs1003.eqiad.wmnet with OS bullseye [production]
21:22 <brett> Exported varnish-modules 0.15.0-4 for bookworm-wikimedia (T342154) [production]
21:18 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wcqs1003.eqiad.wmnet with reason: host reimage [production]
21:15 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on wcqs1003.eqiad.wmnet with reason: host reimage [production]
21:06 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 108 [production]
21:06 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 108 [production]
21:04 <bking@cumin1001> conftool action : set/pooled=no; selector: name=wcqs1003.eqiad.wmnet,service=wcqs [production]
21:02 <bking@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=wcqs,name=eqiad [production]
21:02 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host wcqs1003.eqiad.wmnet with OS bullseye [production]
20:58 <bking@deploy1002> Finished deploy [wdqs/wdqs@f1a6177] (wcqs): f1a6177 (duration: 00m 17s) [production]
20:58 <bking@deploy1002> Started deploy [wdqs/wdqs@f1a6177] (wcqs): f1a6177 [production]
20:57 <bking@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wcqs1002.eqiad.wmnet with OS bullseye [production]