2024-04-29
§
|
06:34 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
06:34 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
06:34 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2128.codfw.wmnet with reason: Maintenance |
[production] |
06:34 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2128.codfw.wmnet with reason: Maintenance |
[production] |
06:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2123 (T361627)', diff saved to https://phabricator.wikimedia.org/P61314 and previous config saved to /var/cache/conftool/dbconfig/20240429-063412-marostegui.json |
[production] |
06:24 |
<marostegui> |
Restart sanitarium instances in eqiad T363276 |
[production] |
06:23 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1212.eqiad.wmnet with reason: host reimage |
[production] |
06:21 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es1023.eqiad.wmnet with reason: host reimage |
[production] |
06:20 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1212.eqiad.wmnet with reason: host reimage |
[production] |
06:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2123', diff saved to https://phabricator.wikimedia.org/P61313 and previous config saved to /var/cache/conftool/dbconfig/20240429-061905-marostegui.json |
[production] |
06:17 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on es1023.eqiad.wmnet with reason: host reimage |
[production] |
06:14 |
<marostegui> |
Restart sanitarium instances in codfw T363276 |
[production] |
06:06 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db1212.eqiad.wmnet with OS bookworm |
[production] |
06:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1212', diff saved to https://phabricator.wikimedia.org/P61312 and previous config saved to /var/cache/conftool/dbconfig/20240429-060423-root.json |
[production] |
06:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2123', diff saved to https://phabricator.wikimedia.org/P61311 and previous config saved to /var/cache/conftool/dbconfig/20240429-060358-marostegui.json |
[production] |
06:02 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host es1023.eqiad.wmnet with OS bookworm |
[production] |
05:58 |
<marostegui@deploy1002> |
Finished scap: Backport for [[gerrit:1024725|Revert "db-production.php: Disable writes on es5"]] (duration: 14m 47s) |
[production] |
05:48 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2123 (T361627)', diff saved to https://phabricator.wikimedia.org/P61310 and previous config saved to /var/cache/conftool/dbconfig/20240429-054850-marostegui.json |
[production] |
05:46 |
<marostegui@deploy1002> |
marostegui: Continuing with sync |
[production] |
05:46 |
<marostegui@deploy1002> |
marostegui: Backport for [[gerrit:1024725|Revert "db-production.php: Disable writes on es5"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
05:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2123 (T361627)', diff saved to https://phabricator.wikimedia.org/P61309 and previous config saved to /var/cache/conftool/dbconfig/20240429-054519-marostegui.json |
[production] |
05:45 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2123.codfw.wmnet with reason: Maintenance |
[production] |
05:44 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2123.codfw.wmnet with reason: Maintenance |
[production] |
05:44 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
05:44 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
05:44 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195 (T352010)', diff saved to https://phabricator.wikimedia.org/P61308 and previous config saved to /var/cache/conftool/dbconfig/20240429-054413-ladsgroup.json |
[production] |
05:43 |
<marostegui@deploy1002> |
Started scap: Backport for [[gerrit:1024725|Revert "db-production.php: Disable writes on es5"]] |
[production] |
05:41 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es1023 T361548', diff saved to https://phabricator.wikimedia.org/P61306 and previous config saved to /var/cache/conftool/dbconfig/20240429-054158-marostegui.json |
[production] |
05:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote es1024 to es5 primary T361548', diff saved to https://phabricator.wikimedia.org/P61305 and previous config saved to /var/cache/conftool/dbconfig/20240429-054035-marostegui.json |
[production] |
05:40 |
<marostegui> |
Starting es5 eqiad failover from es1023 to es1024 T361548 |
[production] |
05:35 |
<marostegui@deploy1002> |
Finished scap: Backport for [[gerrit:1024997|db-production.php: Disable writes on es5 (T361548)]] (duration: 26m 58s) |
[production] |
05:29 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P61304 and previous config saved to /var/cache/conftool/dbconfig/20240429-052906-ladsgroup.json |
[production] |
05:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set es1024 with weight 0 T361548', diff saved to https://phabricator.wikimedia.org/P61303 and previous config saved to /var/cache/conftool/dbconfig/20240429-052311-root.json |
[production] |
05:22 |
<marostegui@deploy1002> |
marostegui: Continuing with sync |
[production] |
05:22 |
<marostegui@deploy1002> |
marostegui: Backport for [[gerrit:1024997|db-production.php: Disable writes on es5 (T361548)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
05:13 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P61302 and previous config saved to /var/cache/conftool/dbconfig/20240429-051359-ladsgroup.json |
[production] |
05:08 |
<marostegui@deploy1002> |
Started scap: Backport for [[gerrit:1024997|db-production.php: Disable writes on es5 (T361548)]] |
[production] |
05:05 |
<root@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: Primary switchover es5 T322187 |
[production] |
05:04 |
<root@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: Primary switchover es5 T322187 |
[production] |
04:58 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195 (T352010)', diff saved to https://phabricator.wikimedia.org/P61301 and previous config saved to /var/cache/conftool/dbconfig/20240429-045851-ladsgroup.json |
[production] |
03:11 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudbackup1003.eqiad.wmnet with OS bookworm |
[production] |
02:17 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudbackup1003.eqiad.wmnet with reason: host reimage |
[production] |
02:14 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudbackup1003.eqiad.wmnet with reason: host reimage |
[production] |
01:42 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudbackup1003.eqiad.wmnet with OS bookworm |
[production] |