2024-06-17
ยง
|
10:31 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.rename from mw2323 to wikikube-worker2003 |
[production] |
10:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2204 (T367261)', diff saved to https://phabricator.wikimedia.org/P65107 and previous config saved to /var/cache/conftool/dbconfig/20240617-102938-marostegui.json |
[production] |
10:26 |
<jynus> |
restarting db2183, db2184 |
[production] |
10:24 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
10:24 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Fix AAAA records for mw232[3-9] - cgoubert@cumin1002" |
[production] |
10:21 |
<cgoubert@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Fix AAAA records for mw232[3-9] - cgoubert@cumin1002" |
[production] |
10:17 |
<cgoubert@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
10:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2204', diff saved to https://phabricator.wikimedia.org/P65106 and previous config saved to /var/cache/conftool/dbconfig/20240617-101431-marostegui.json |
[production] |
10:11 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
10:10 |
<kamila@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
10:09 |
<claime> |
Depooling mw2323.codfw.wmnet,mw2324.codfw.wmnet,mw2326.codfw.wmnet,mw2327.codfw.wmnet,mw2328.codfw.wmnet,mw2329.codfw.wmnet for reimage - T351074 |
[production] |
10:08 |
<claime> |
Depooling mw2323.codfw.wmnet,mw2324.codfw.wmnet,mw2326.codfw.wmnet,mw2327.codfw.wmnet,mw2328.codfw.wmnet,mw2329.codfw.wmnet for reimage |
[production] |
10:01 |
<claime> |
draining and cordoning mw2321 - T367702 |
[production] |
10:01 |
<brouberol@cumin2002> |
START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling reboot on A:kafka-jumbo-eqiad |
[production] |
10:01 |
<taavi@deploy1002> |
Finished scap: Backport for [[gerrit:1041742|Stop loading OSM i18n (T161553)]] (duration: 34m 07s) |
[production] |
09:59 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2204', diff saved to https://phabricator.wikimedia.org/P65104 and previous config saved to /var/cache/conftool/dbconfig/20240617-095924-marostegui.json |
[production] |
09:54 |
<jayme@deploy1002> |
Finished deploy [docker-pkg/deploy@38eb04d]: Update docker-pkg to 4.0.1 (duration: 00m 24s) |
[production] |
09:53 |
<jayme@deploy1002> |
Started deploy [docker-pkg/deploy@38eb04d]: Update docker-pkg to 4.0.1 |
[production] |
09:52 |
<jayme@deploy1002> |
Finished deploy [docker-pkg/deploy@4dbea81]: Update docker-pkg to 4.0.1 (duration: 00m 38s) |
[production] |
09:51 |
<jayme@deploy1002> |
Started deploy [docker-pkg/deploy@4dbea81]: Update docker-pkg to 4.0.1 |
[production] |
09:49 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
09:49 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
09:49 |
<taavi@deploy1002> |
taavi: Continuing with sync |
[production] |
09:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158 (T364069)', diff saved to https://phabricator.wikimedia.org/P65103 and previous config saved to /var/cache/conftool/dbconfig/20240617-094926-marostegui.json |
[production] |
09:48 |
<taavi@deploy1002> |
taavi: Backport for [[gerrit:1041742|Stop loading OSM i18n (T161553)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
09:44 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2204 (T367261)', diff saved to https://phabricator.wikimedia.org/P65102 and previous config saved to /var/cache/conftool/dbconfig/20240617-094417-marostegui.json |
[production] |
09:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2204 (T367261)', diff saved to https://phabricator.wikimedia.org/P65101 and previous config saved to /var/cache/conftool/dbconfig/20240617-094034-marostegui.json |
[production] |
09:40 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2204.codfw.wmnet with reason: Maintenance |
[production] |
09:40 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2204.codfw.wmnet with reason: Maintenance |
[production] |
09:34 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2197.codfw.wmnet with reason: Maintenance |
[production] |
09:34 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2197.codfw.wmnet with reason: Maintenance |
[production] |
09:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189 (T367261)', diff saved to https://phabricator.wikimedia.org/P65100 and previous config saved to /var/cache/conftool/dbconfig/20240617-093427-marostegui.json |
[production] |
09:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P65099 and previous config saved to /var/cache/conftool/dbconfig/20240617-093419-marostegui.json |
[production] |
09:26 |
<taavi@deploy1002> |
Started scap: Backport for [[gerrit:1041742|Stop loading OSM i18n (T161553)]] |
[production] |
09:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P65098 and previous config saved to /var/cache/conftool/dbconfig/20240617-091920-marostegui.json |
[production] |
09:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P65097 and previous config saved to /var/cache/conftool/dbconfig/20240617-091912-marostegui.json |
[production] |
09:05 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling reboot on A:kafka-test-eqiad |
[production] |
09:04 |
<_joe_> |
removed damaged AOF file for redis rdb1014-6379, resyncing with primary |
[production] |
09:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P65096 and previous config saved to /var/cache/conftool/dbconfig/20240617-090413-marostegui.json |
[production] |
09:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158 (T364069)', diff saved to https://phabricator.wikimedia.org/P65095 and previous config saved to /var/cache/conftool/dbconfig/20240617-090405-marostegui.json |
[production] |
09:01 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:1046599|throttle: Fix exemption for ongoing course]] (duration: 25m 05s) |
[production] |
08:53 |
<claime> |
hardcycling rdb1014 |
[production] |
08:49 |
<cgoubert@cumin1002> |
conftool action : set/pooled=inactive; selector: name=mw2321.codfw.wmnet |
[production] |
08:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189 (T367261)', diff saved to https://phabricator.wikimedia.org/P65094 and previous config saved to /var/cache/conftool/dbconfig/20240617-084906-marostegui.json |
[production] |
08:40 |
<claime> |
powercycling rdb1014 |
[production] |
08:38 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2189.codfw.wmnet with reason: Maintenance |
[production] |
08:38 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2189.codfw.wmnet with reason: Maintenance |
[production] |
08:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367261)', diff saved to https://phabricator.wikimedia.org/P65093 and previous config saved to /var/cache/conftool/dbconfig/20240617-083755-marostegui.json |
[production] |
08:36 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:1046599|throttle: Fix exemption for ongoing course]] |
[production] |
08:25 |
<brouberol@cumin2002> |
START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling reboot on A:kafka-test-eqiad |
[production] |