2024-06-17
§
|
09:34 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2197.codfw.wmnet with reason: Maintenance |
[production] |
09:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189 (T367261)', diff saved to https://phabricator.wikimedia.org/P65100 and previous config saved to /var/cache/conftool/dbconfig/20240617-093427-marostegui.json |
[production] |
09:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P65099 and previous config saved to /var/cache/conftool/dbconfig/20240617-093419-marostegui.json |
[production] |
09:27 |
<Lucas_WMDE> |
wb-reconcile: shut down instance, I don’t think it’s needed at the moment (was running MediaWiki 1.35) |
[wikidata-dev] |
09:26 |
<taavi@deploy1002> |
Started scap: Backport for [[gerrit:1041742|Stop loading OSM i18n (T161553)]] |
[production] |
09:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P65098 and previous config saved to /var/cache/conftool/dbconfig/20240617-091920-marostegui.json |
[production] |
09:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P65097 and previous config saved to /var/cache/conftool/dbconfig/20240617-091912-marostegui.json |
[production] |
09:18 |
<Lucas_WMDE> |
deleted ssr-termbox.wmflabs.org proxy, backend (http://172.16.4.123:3030) was already gone (no known instance with IP .123 and no obvious existing instance that would fit) |
[wikidata-dev] |
09:16 |
<Lucas_WMDE> |
fedprops-opennext: shut down instance, MediaWiki was broken anyway (1.43 needs PHP 7.4.3+, 7.2 installed) and I believe the instance isn’t needed anymore |
[wikidata-dev] |
09:15 |
<Lucas_WMDE> |
reference-island: shut down instance, pretty sure it’s not needed anymore [re-log with correct name] |
[wikidata-dev] |
09:14 |
<Lucas_WMDE> |
reference:island: shut down instance, pretty sure it’s not needed anymore |
[wikidata-dev] |
09:05 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling reboot on A:kafka-test-eqiad |
[production] |
09:04 |
<_joe_> |
removed damaged AOF file for redis rdb1014-6379, resyncing with primary |
[production] |
09:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P65096 and previous config saved to /var/cache/conftool/dbconfig/20240617-090413-marostegui.json |
[production] |
09:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158 (T364069)', diff saved to https://phabricator.wikimedia.org/P65095 and previous config saved to /var/cache/conftool/dbconfig/20240617-090405-marostegui.json |
[production] |
09:01 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:1046599|throttle: Fix exemption for ongoing course]] (duration: 25m 05s) |
[production] |
08:53 |
<claime> |
hardcycling rdb1014 |
[production] |
08:49 |
<cgoubert@cumin1002> |
conftool action : set/pooled=inactive; selector: name=mw2321.codfw.wmnet |
[production] |
08:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189 (T367261)', diff saved to https://phabricator.wikimedia.org/P65094 and previous config saved to /var/cache/conftool/dbconfig/20240617-084906-marostegui.json |
[production] |
08:47 |
<Lucas_WMDE> |
queripulator shut down instance, I’m pretty sure it’s not needed anymore |
[wikidata-dev] |
08:40 |
<claime> |
powercycling rdb1014 |
[production] |
08:38 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2189.codfw.wmnet with reason: Maintenance |
[production] |
08:38 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2189.codfw.wmnet with reason: Maintenance |
[production] |
08:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367261)', diff saved to https://phabricator.wikimedia.org/P65093 and previous config saved to /var/cache/conftool/dbconfig/20240617-083755-marostegui.json |
[production] |
08:36 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:1046599|throttle: Fix exemption for ongoing course]] |
[production] |
08:25 |
<brouberol@cumin2002> |
START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling reboot on A:kafka-test-eqiad |
[production] |
08:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P65092 and previous config saved to /var/cache/conftool/dbconfig/20240617-082248-marostegui.json |
[production] |
08:21 |
<hashar> |
Updating Jenkins jobs for Quibble 1.9.0 https://gerrit.wikimedia.org/r/1044421 |
[releng] |
08:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P65091 and previous config saved to /var/cache/conftool/dbconfig/20240617-080741-marostegui.json |
[production] |
07:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367261)', diff saved to https://phabricator.wikimedia.org/P65090 and previous config saved to /var/cache/conftool/dbconfig/20240617-075234-marostegui.json |
[production] |
07:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2148 (T367261)', diff saved to https://phabricator.wikimedia.org/P65089 and previous config saved to /var/cache/conftool/dbconfig/20240617-074542-marostegui.json |
[production] |
07:45 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2148.codfw.wmnet with reason: Maintenance |
[production] |
07:45 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2148.codfw.wmnet with reason: Maintenance |
[production] |
07:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2138 (T367261)', diff saved to https://phabricator.wikimedia.org/P65088 and previous config saved to /var/cache/conftool/dbconfig/20240617-074530-marostegui.json |
[production] |
07:30 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2138', diff saved to https://phabricator.wikimedia.org/P65087 and previous config saved to /var/cache/conftool/dbconfig/20240617-073023-marostegui.json |
[production] |
07:15 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2138', diff saved to https://phabricator.wikimedia.org/P65086 and previous config saved to /var/cache/conftool/dbconfig/20240617-071516-marostegui.json |
[production] |
07:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2138 (T367261)', diff saved to https://phabricator.wikimedia.org/P65085 and previous config saved to /var/cache/conftool/dbconfig/20240617-070009-marostegui.json |
[production] |
06:56 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db2189 (T352010)', diff saved to https://phabricator.wikimedia.org/P65084 and previous config saved to /var/cache/conftool/dbconfig/20240617-065647-ladsgroup.json |
[production] |
06:56 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2189.codfw.wmnet with reason: Maintenance |
[production] |
06:56 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2189.codfw.wmnet with reason: Maintenance |
[production] |
06:56 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175 (T352010)', diff saved to https://phabricator.wikimedia.org/P65083 and previous config saved to /var/cache/conftool/dbconfig/20240617-065625-ladsgroup.json |
[production] |
06:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2138 (T367261)', diff saved to https://phabricator.wikimedia.org/P65082 and previous config saved to /var/cache/conftool/dbconfig/20240617-065357-marostegui.json |
[production] |
06:53 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2138.codfw.wmnet with reason: Maintenance |
[production] |
06:53 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2138.codfw.wmnet with reason: Maintenance |
[production] |
06:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2126 (T367261)', diff saved to https://phabricator.wikimedia.org/P65081 and previous config saved to /var/cache/conftool/dbconfig/20240617-065335-marostegui.json |
[production] |
06:41 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P65080 and previous config saved to /var/cache/conftool/dbconfig/20240617-064118-ladsgroup.json |
[production] |
06:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P65079 and previous config saved to /var/cache/conftool/dbconfig/20240617-063923-root.json |
[production] |
06:38 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2126', diff saved to https://phabricator.wikimedia.org/P65078 and previous config saved to /var/cache/conftool/dbconfig/20240617-063826-marostegui.json |
[production] |
06:26 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P65077 and previous config saved to /var/cache/conftool/dbconfig/20240617-062612-ladsgroup.json |
[production] |
06:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1170 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P65076 and previous config saved to /var/cache/conftool/dbconfig/20240617-062511-root.json |
[production] |