3551-3600 of 10000 results (93ms)
2024-04-29 ยง
06:47 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2159', diff saved to https://phabricator.wikimedia.org/P61318 and previous config saved to /var/cache/conftool/dbconfig/20240429-064717-root.json [production]
06:46 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1212.eqiad.wmnet with OS bookworm [production]
06:44 <marostegui@cumin1002> dbctl commit (dc=all): 'db1212 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P61317 and previous config saved to /var/cache/conftool/dbconfig/20240429-064420-root.json [production]
06:43 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es1023.eqiad.wmnet with OS bookworm [production]
06:38 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2128 (T361627)', diff saved to https://phabricator.wikimedia.org/P61316 and previous config saved to /var/cache/conftool/dbconfig/20240429-063819-marostegui.json [production]
06:34 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2128 (T361627)', diff saved to https://phabricator.wikimedia.org/P61315 and previous config saved to /var/cache/conftool/dbconfig/20240429-063450-marostegui.json [production]
06:34 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2186.codfw.wmnet with reason: Maintenance [production]
06:34 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db2186.codfw.wmnet with reason: Maintenance [production]
06:34 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2128.codfw.wmnet with reason: Maintenance [production]
06:34 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db2128.codfw.wmnet with reason: Maintenance [production]
06:34 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2123 (T361627)', diff saved to https://phabricator.wikimedia.org/P61314 and previous config saved to /var/cache/conftool/dbconfig/20240429-063412-marostegui.json [production]
06:24 <marostegui> Restart sanitarium instances in eqiad T363276 [production]
06:23 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1212.eqiad.wmnet with reason: host reimage [production]
06:21 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es1023.eqiad.wmnet with reason: host reimage [production]
06:20 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1212.eqiad.wmnet with reason: host reimage [production]
06:19 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2123', diff saved to https://phabricator.wikimedia.org/P61313 and previous config saved to /var/cache/conftool/dbconfig/20240429-061905-marostegui.json [production]
06:17 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on es1023.eqiad.wmnet with reason: host reimage [production]
06:14 <marostegui> Restart sanitarium instances in codfw T363276 [production]
06:06 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host db1212.eqiad.wmnet with OS bookworm [production]
06:04 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1212', diff saved to https://phabricator.wikimedia.org/P61312 and previous config saved to /var/cache/conftool/dbconfig/20240429-060423-root.json [production]
06:03 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2123', diff saved to https://phabricator.wikimedia.org/P61311 and previous config saved to /var/cache/conftool/dbconfig/20240429-060358-marostegui.json [production]
06:02 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host es1023.eqiad.wmnet with OS bookworm [production]
05:58 <marostegui@deploy1002> Finished scap: Backport for [[gerrit:1024725|Revert "db-production.php: Disable writes on es5"]] (duration: 14m 47s) [production]
05:48 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2123 (T361627)', diff saved to https://phabricator.wikimedia.org/P61310 and previous config saved to /var/cache/conftool/dbconfig/20240429-054850-marostegui.json [production]
05:46 <marostegui@deploy1002> marostegui: Continuing with sync [production]
05:46 <marostegui@deploy1002> marostegui: Backport for [[gerrit:1024725|Revert "db-production.php: Disable writes on es5"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
05:45 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2123 (T361627)', diff saved to https://phabricator.wikimedia.org/P61309 and previous config saved to /var/cache/conftool/dbconfig/20240429-054519-marostegui.json [production]
05:45 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2123.codfw.wmnet with reason: Maintenance [production]
05:44 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db2123.codfw.wmnet with reason: Maintenance [production]
05:44 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2198.codfw.wmnet with reason: Maintenance [production]
05:44 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2198.codfw.wmnet with reason: Maintenance [production]
05:44 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2195 (T352010)', diff saved to https://phabricator.wikimedia.org/P61308 and previous config saved to /var/cache/conftool/dbconfig/20240429-054413-ladsgroup.json [production]
05:43 <marostegui@deploy1002> Started scap: Backport for [[gerrit:1024725|Revert "db-production.php: Disable writes on es5"]] [production]
05:41 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es1023 T361548', diff saved to https://phabricator.wikimedia.org/P61306 and previous config saved to /var/cache/conftool/dbconfig/20240429-054158-marostegui.json [production]
05:40 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote es1024 to es5 primary T361548', diff saved to https://phabricator.wikimedia.org/P61305 and previous config saved to /var/cache/conftool/dbconfig/20240429-054035-marostegui.json [production]
05:40 <marostegui> Starting es5 eqiad failover from es1023 to es1024 T361548 [production]
05:35 <marostegui@deploy1002> Finished scap: Backport for [[gerrit:1024997|db-production.php: Disable writes on es5 (T361548)]] (duration: 26m 58s) [production]
05:29 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P61304 and previous config saved to /var/cache/conftool/dbconfig/20240429-052906-ladsgroup.json [production]
05:23 <marostegui@cumin1002> dbctl commit (dc=all): 'Set es1024 with weight 0 T361548', diff saved to https://phabricator.wikimedia.org/P61303 and previous config saved to /var/cache/conftool/dbconfig/20240429-052311-root.json [production]
05:22 <marostegui@deploy1002> marostegui: Continuing with sync [production]
05:22 <marostegui@deploy1002> marostegui: Backport for [[gerrit:1024997|db-production.php: Disable writes on es5 (T361548)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
05:13 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P61302 and previous config saved to /var/cache/conftool/dbconfig/20240429-051359-ladsgroup.json [production]
05:08 <marostegui@deploy1002> Started scap: Backport for [[gerrit:1024997|db-production.php: Disable writes on es5 (T361548)]] [production]
05:05 <root@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: Primary switchover es5 T322187 [production]
05:04 <root@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: Primary switchover es5 T322187 [production]
04:58 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2195 (T352010)', diff saved to https://phabricator.wikimedia.org/P61301 and previous config saved to /var/cache/conftool/dbconfig/20240429-045851-ladsgroup.json [production]
03:11 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudbackup1003.eqiad.wmnet with OS bookworm [production]
02:17 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudbackup1003.eqiad.wmnet with reason: host reimage [production]
02:14 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudbackup1003.eqiad.wmnet with reason: host reimage [production]
01:42 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudbackup1003.eqiad.wmnet with OS bookworm [production]